Skip to content

Bh3ky/content-research-agent

Repository files navigation

Content Research Agent

This project is a content research agent designed to surface near real-time insights from Reddit based on keywords and topics you care about.

Built with:

  • Python
  • LangChain (integration to Google Gemini LLM), LangGraph (for easy Tool Calling)
  • Jupyter (rapid prototyping)
  • Postegres (database)
  • Django (web app & automation coordinator)
  • BrightData Search Engine AI (SERP) & Crawl API (extracting Reddit posts)
  • Redis (caching & queues)
  • Celery (background tasks)
  • Django QStash (serverless background tasks)
  • Cloudfare Tunnels (public domain to accept webhooks)

The system continuously discovers, collects, and structures trending discussions across Reddit and specific subreddits.

The agent automates the research workflow by identifying emerging themes, popular conversations, and high-engagement posts, transforming raw social data into actionable intelligence for content creators, researchers, and product teams.

Key Capabilities

  • 🔍 Track trending keywords and topics across Reddit and targeted subreddits

  • ⚡ Fetch near real-time discussion data using scalable web scraping infrastructure

  • 🧠 Apply LLM-powered reasoning to summarize, categorize, and analyze trends

  • 🖥️ Manage and explore results through a Django-based backend and Jupyter notebooks

This project serves as a foundation for AI-assisted content discovery, enabling faster, data-driven decisions in content strategy, market research, and trend analysis.

Note:

Still in development

About

Build a Reddit Content Research Agent with LLMs, LangChain, SERP, Jupyter, Django, Bright Data, Celery, Django QStash etc

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors