The Automaton Auditor is a production-grade multi-agent swarm designed for the autonomous governance and forensic analysis of AI-generated repositories. It evaluates codebases against architectural best practices (specifically LangGraph patterns) using a dialectical judicial process.
In high-velocity AI-assisted development, architectural drift is common. Systems built with complex frameworks like LangGraph often suffer from "vibe coding"—where the outward structure looks correct, but core mechanisms like parallel state reducers, typed state transitions, and deterministic synthesis are missing or broken. Manual auditing of these patterns is slow and error-prone.
Automaton Auditor automates this governance by deploying a swarm of specialized agents:
- Detectives: Perform deep forensic analysis (AST parsing, Git history, PDF RAG).
- Judges: Evaluate evidence through conflicting personas (Prosecutor, Defense, Tech Lead).
- Supreme Court: Synthesizes a final verdict with deterministic rules and security caps.
graph TD
A[Repo URL / PDF Path] --> B[Pre-Audit Check]
B -->|Changes?| C{Delta Decision}
C -->|Yes / New| D[Detective Swarm]
C -->|No| E[Supreme Court]
subgraph Detectives
D1[Repo Investigator]
D2[Doc Analyst]
D3[Vision Inspector]
end
D1 & D2 & D3 --> F[Evidence Aggregator]
F --> G[Judges Bench]
subgraph Judges
J1[Prosecutor]
J2[Defense]
J3[Tech Lead]
end
J1 & J2 & J3 --> E[Supreme Court]
E --> H[Final Report & Metadata]
- Python 3.10+
- uv (Recommended package manager)
- OpenAI API Key (Configured in
.env)
# Clone the repository
git clone <repo-url>
cd Automation_Auditor
# Install dependencies using uv
uv sync# Run a full audit on a local directory or remote repo
uv run python main.py . --pdf path/to/report.pdf --thread-id audit-001# Launch the monitoring dashboard
uv run streamlit run app.pyTo verify the system's own sophistication, the Auditor was subjected to a "Self-Audit" before and after a major focus on architectural excellence.
| Milestone | Overall Score | Key Improvements |
|---|---|---|
| v1.0 (Baseline) | 1.7 / 5.0 | Basic graph, missing reducers, brittle PDF handling. |
| v2.0 (Upgraded) | 4.0 / 5.0 | Parallel branches, state reducers, robust fallback discovery, judicial nuance. |
Tip
Why the score jumped? The upgrade introduced AST-level connectivity analysis for fan-out/fan-in detection and verified the existence of Annotated state reducers, moving the project from "vibe coding" to "engineered architecture."
- State Reducers: See
src/state.py. We useoperator.addfor list merges andmerge_evidencesfor dictionary synchronization. - AST Visitor: See
src/tools/repo_tools.py. TheLangGraphVisitortracksnode_sourcesandnode_targetsfor true connectivity verification. - Synthesis Logic: See
src/nodes/supreme_court.py. Deterministic rules like theRule of Securitycan cap scores regardless of judicial opinion.


