BlogForge

AI-powered Content Alchemy

Transform trending topics into golden blog posts with just a few keywords!

🚀 Introduction

I wanted to write SEO-friendly blog posts for my portfolio website pratim.me about the latest technology trends, especially in AI and software engineering. However, keyword research, content research, and writing well-structured, SEO-friendly blog posts is a time-consuming process. As a lazy engineer, I decided to leverage an LLM to help automate this. And thus, BlogForge was created!

🌟 What It Does

Keyword Research & Search
- Takes trending Google search keywords (up to 5 at a time).
- Performs Google searches using googlesearch-python, focusing on blog posts related to these topics.
Content Crawling
- Crawls the first n pages (configurable, currently set to 7) using Jina AI's Public API.
- Alternatively, uses Crawl4AI's API (configured via Docker Compose) for self-hosted crawling.
Data Processing & Storage
- Chunks the crawled data.
- Embeds them using Google Gemini’s embeddings.
- Stores the processed data in a local ChromaDB vector store.
AI-Generated Blog Writing
- Supports multiple LLM providers:
  - Gemini via Google AI
  - LLaMA models via Groq
  - DeepSeek models via DeepSeek AI
  - Local models via Ollama
- Generates structured, SEO-optimized blog posts using your preferred LLM
Interactive Refinement
- Provides a Streamlit-based UI (Streamlit) to allow users to refine the blog post interactively.
- Allows users to chat with the AI to fine-tune content.
Session Management & Persistence
- Uses Supabase DB for storing crawled website content and user chat history.

🔧 Running Locally

1️⃣ Prerequisites

Install pipenv:
```
pip install pipenv
```

2️⃣ Setup

pipenv shell
pipenv install

3️⃣ Environment Variables

Copy the example environment file:
```
cp example.env .env
```
Populate .env with required API keys and settings:
- GOOGLE_API_KEY - Required for Google Search and Gemini models
- GROQ_API_KEY - Required for Groq LLM models
- DEEP_SEEK_API_KEY - Required for DeepSeek models
- JINA_API_KEY - Required when using Jina for crawling
- SUPABASE_URL and SUPABASE_KEY - Required for database functionality
- LLM_TO_USE - Choose your LLM provider: "GEMINI", "GROQ", "DEEPSEEK", or "OLLAMA"
- LLM_MODEL - Specify the model name for your chosen provider

4️⃣ Start the Application

pipenv run start

5️⃣ (Optional) Use Crawl4AI

Set CRAWLER_PROVIDER=CRAWL4AI in .env.
Generate a random secret and set it as CRAWL4AI_API_TOKEN.

📂 Directory Structure

├── Pipfile
├── Pipfile.lock
├── README.md
├── __init__.py
├── app.py
├── assets
│   └── logo.webp
├── config.py
├── crawler
│   ├── __init__.py
│   ├── crawler.py
│   └── processor.py
├── db
│   ├── __init__.py
│   ├── supabase.py
│   └── vector_store.py
├── docker-compose.yaml
├── main.py
├── rag
│   ├── __init__.py
│   ├── chains.py
│   ├── chat_history.py
│   ├── embeddings.py
│   ├── llm.py
│   └── prompts.py
├── tools
│   ├── __init__.py
│   ├── crawl4ai.py
│   ├── jina.py
│   └── search.py
└── utils
    ├── __init__.py
    ├── helpers.py
    └── logger.py

🚀 Future Improvements

✅ Enable Classic LLM Chat-like streaming.
✅ Improve overall performance.
✅ Enable Docker deployment.

🤝 Contribution Guidelines

We welcome contributions! To contribute:

Fork the repository.
Create a new branch (git checkout -b feature-branch).
Commit your changes (git commit -m 'Add new feature').
Push to your branch (git push origin feature-branch).
Create a pull request.

Please make sure your contributions adhere to best coding practices and include necessary documentation.

📜 License

This project is licensed under the MIT License.

🙌 Credits

Big thanks to:

Jina AI for providing free API access.
Google AI for Gemini models and embeddings.
Groq for high-performance LLaMA model inference.
DeepSeek for their powerful AI models.
Ollama for local model support.
Supabase for database storage.
ChromaDB for the vector store.
Streamlit for the frontend framework.

Happy Blogging! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BlogForge

AI-powered Content Alchemy

🚀 Introduction

🌟 What It Does

🔧 Running Locally

1️⃣ Prerequisites

2️⃣ Setup

3️⃣ Environment Variables

4️⃣ Start the Application

5️⃣ (Optional) Use Crawl4AI

📂 Directory Structure

🚀 Future Improvements

🤝 Contribution Guidelines

📜 License

🙌 Credits

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
assets		assets
crawler		crawler
db		db
rag		rag
tools		tools
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
__init__.py		__init__.py
app.py		app.py
config.py		config.py
docker-compose.yaml		docker-compose.yaml
example.env		example.env
main.py		main.py

Folders and files

Latest commit

History

Repository files navigation

BlogForge

AI-powered Content Alchemy

🚀 Introduction

🌟 What It Does

🔧 Running Locally

1️⃣ Prerequisites

2️⃣ Setup

3️⃣ Environment Variables

4️⃣ Start the Application

5️⃣ (Optional) Use Crawl4AI

📂 Directory Structure

🚀 Future Improvements

🤝 Contribution Guidelines

📜 License

🙌 Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages