A technical documentation generation tool that transforms video transcripts into clear, structured documentation using AI.
Simmons leverages the power of AI agents to convert verbal knowledge from video transcriptions into professional, well-formatted technical documentation. Perfect for creating documentation from recorded meetings, tutorials, technical presentations, and educational content.
- 🤖 AI-Powered Processing: Uses multiple specialized AI agents for content creation, review, and formatting
- 🧠 Advanced Reasoning: Enhanced AI reasoning capabilities for optimal content generation and formatting
- 📝 Multi-Language Support: Generate documentation in multiple languages
- 🔄 Sequential Workflow: Technical writer → Content reviewer → Markdown specialist
- 📋 Structured Output: Creates well-formatted Markdown documentation with proper sections
- 🎯 Topic-Focused: Tailors content generation to specific technical topics
- 📊 Comprehensive Coverage: Automatically captures all topics mentioned in transcriptions, including implicit ones
- 🚀 Command Line Interface: Easy-to-use CLI for quick processing
Simmons uses the CrewAI framework to orchestrate three specialized AI agents with enhanced reasoning capabilities:
- Technical Content Writer: Interprets and restructures transcription content into clear, technical language with advanced reasoning
- Content Reviewer: Reviews and refines the content for clarity and technical accuracy
- Markdown Specialist: Converts the reviewed content into well-formatted Markdown documentation with reasoning-based optimization
- Enhanced Reasoning: Technical Content Writer and Markdown Specialist agents use advanced reasoning with up to 3 attempts for optimal results
- Comprehensive Coverage: Automatically includes all topics mentioned in transcriptions, even implicit ones
- Quality Assurance: Multi-layer review process ensures technical accuracy and clarity
- Python 3.10 to 3.13
- An OpenAI API key (for GPT-4o-mini model)
-
Clone the repository:
git clone https://github.com/thrsouza/simmons.git cd simmons -
Install dependencies:
# Using uv (recommended) uv sync # Or using pip pip install -e .
-
Set up your OpenAI API key:
export OPENAI_API_KEY="your-api-key-here"
# Generate documentation from a transcript file
python -m simmons.main -t "Python Programming" -f transcript.txt -l "en-US"
# Or using the installed package
simmons -t "Docker Containers" -f meeting_transcript.txt -l "pt-BR"-t, --topic: Topic for the documentation generation (e.g., "Python Programming", "Docker Containers")-f, --file: Path to the transcription file (required)-l, --language: Language for the documentation (default: "en-US")
python -m simmons.main \
--topic "REST API Development" \
--file tutorial_transcript.txt \
--language "en-US"This will generate a file named REST API Development.md with structured documentation.
python -m simmons.main \
--topic "Arquitetura de Microsserviços" \
--file reuniao_transcript.txt \
--language "pt-BR"python -m simmons.main \
--topic "Machine Learning Fundamentals" \
--file workshop_transcript.txt \
--language "en-US"Simmons accepts plain text transcription files. The transcription should contain:
- Spoken content from videos, meetings, or presentations
- Natural language explanations of technical concepts
- Any verbal knowledge you want to convert to documentation
Example transcript file:
So today we're going to talk about Docker containers. Docker is a containerization platform that allows you to package applications with their dependencies. Let me explain how this works...
Simmons generates Markdown documentation with the following structure:
# 🎯 [Topic Name]
**📋 TL;DR**
- Summary point 1
- Summary point 2
- Summary point 3
## 📝 [Section Title]
Content with proper formatting...
## 🔧 [Another Section]
More structured content...The AI agents are configured through YAML files in simmons/config/:
agents.yaml: Defines the roles, goals, and backstories of the AI agentstasks.yaml: Defines the tasks each agent performs, their expected outputs, and processing rules
- Enhanced Topic Coverage: The system now automatically includes all topics mentioned in transcriptions, even if not explicitly stated as section titles
- Improved Model Configuration: Updated to use explicit OpenAI model specification (
openai/gpt-4o-mini) for better reliability - Advanced Reasoning: Technical Content Writer and Markdown Specialist agents now use reasoning capabilities with multiple attempts for optimal results
You can customize these configurations to adjust the behavior and output style of the generated documentation.
Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
Thiago Souza
- Email: dev@thiagosouza.com
- GitHub: @thrsouza
Made with ❤️ for the developer community