LLM Docs Generator

Generate LLM-friendly documentation files from any documentation website.

This tool downloads a documentation site, extracts the useful content, and produces a clean text file optimized for AI agents and LLMs.

This project follows the concept proposed by the llms.txt initiative — a simple standard for providing LLM-friendly context files for AI systems.
Learn more: https://llmstxt.org/

The process follows a simple pipeline:


Documentation Website
↓
Mirror site (HTTrack)
↓
Extract clean content (Trafilatura)
↓
Build LLM-friendly documentation file

The output can be used as context for AI agents, copilots, and chat assistants.

Requirements

Install the following dependencies.

System

HTTrack

Python

Python 3.10+
pip

Python packages

pip install trafilatura tqdm

Installation

Clone the repository:

git clone https://github.com/vreoo/llms-dot-txt.git
cd llms-dot-txt

Make the script executable:

chmod +x run.sh

Usage

Run the script with:

./run.sh --name <project-name> --url <docs-url>

Example:

./run.sh --name rabbitmq --url https://rabbitmq.com/docs

Output

After the process finishes, the following structure will be created:

llm-docs/
└── rabbitmq/
    ├── site/        # mirrored documentation website
    ├── extracted/   # extracted text content
    └── docs/
        └── rabbitmq-llm.txt
        └── rabbitmq-rag.jsonl

The generated file:

rabbitmq-llm.txt

contains clean documentation formatted for LLM consumption.

Using the Documentation with AI

Example prompt:

Context:
docs/rabbitmq-llm.txt

Task:
Create a Python producer that sends messages to a RabbitMQ queue.

This allows AI agents to reason using official documentation as context.

License

Generated documentation is derived from the original documentation website and follows the license of the source documentation.

Scripts in this repository are licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Docs Generator

Requirements

System

Python

Python packages

Installation

Usage

Output

Using the Documentation with AI

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Docs Generator

Requirements

System

Python

Python packages

Installation

Usage

Output

Using the Documentation with AI

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages