-
Notifications
You must be signed in to change notification settings - Fork 0
agentic frameworks
Nicolas Cravino edited this page Apr 22, 2026
·
5 revisions
id: agentic-frameworks title: Agentic Frameworks tags: [python, langgraph, langchain, azure, openai, mlx, audio, cli, stt, tts, rag, benchmark] created: 2026-04-11 updated: 2026-04-22 freshness: fresh
Seven repositories demonstrating diverse agentic AI patterns — from a mental model visualization (Agent Stack) through workflow conversion (N8n2LangGraph), coding assistants (DeepAgent Azure CLI), voice analysis research loops (SST-AutoResearch), SAP implementation onboarding automation (ProjectPulse), experimental multi-agent requirements generation (AutoRequirementsAgent), and a head-to-head benchmark of agentic-RAG vs three retrieval baselines (Wiki-vs-RAG). LangGraph and AutoGen are the dominant orchestration frameworks; tool-loop budgets and Karpathy-style autoresearch loops appear repeatedly.
- Agent Stack decomposes agentic systems into 10 layers, reframing the LLM as a fallible component within a larger deterministic system. ^src:agent-stack
- DeepAgent Azure CLI provides a full coding agent (file R/W, shell, glob, grep, sub-agents) wired to Azure OpenAI with Textual TUI, reasoning effort control (low/medium/high), and SQLite session persistence. ^src:deepagent-azure-cli
- DeepAgent Azure CLI can run fully offline using a local MLX server that emulates the Azure OpenAI API. ^src:deepagent-azure-cli
- N8n2LangGraph converts n8n workflow JSON into standalone LangGraph Python scripts, with optional reflection loops and LLM-assisted translation for unsupported nodes. ^src:n8n2langraph
- SST-AutoResearch implements a Karpathy-style autoresearch loop where the LLM drives hypothesis → design → execute → evaluate → reflect cycles over speaker voice dynamics (Takens' embedding, Lyapunov exponents, recurrence analysis). ^src:sst-autoresearch
- ProjectPulse processes meeting recordings through a 6-phase pipeline (ingest → transcribe → analyze → wiki → podcast) with 79 integration tests, checkpoint/resume, and token budget enforcement. ^src:projectpulse
- ProjectPulse generates a full Karpathy-style interlinked wiki with per-meeting summaries, SAP Activate phase roll-ups, cross-cutting themes, stakeholder profiles, decision log, and onboarding Q&A. ^src:projectpulse
- AutoRequirementsAgent experiments with strict JSON-format inter-agent messaging and state-transition functions, using nested GroupChat with local Ollama LLMs (Nemotron 70B, Hermes 70B) for deterministic multi-agent coordination. ^src:autogen-requirements-agent
- Wiki-vs-RAG benchmarks four agent patterns over the sw30labs wiki corpus (single-shot RAG, agentic-RAG with ≤8-turn tool loop, wiki navigation with precomputed sitemap, and QMD local hybrid retriever) with Claude Sonnet 4.6 as the agent and Claude Opus 4.7 as the LLM-judge. ^src:wiki-vs-rag
- Agentic-RAG is the Pareto winner on the 30-query eval — same quality as wiki-nav (4.77 vs 4.69, Δ inside noise) at ~45% of the per-query cost; beats single-shot RAG 11–2 with 17 ties for 4.2× the spend. ^src:wiki-vs-rag
- QMD (BM25 FTS5 + EmbeddingGemma + Qwen3 reranker, fully on-device) is the cheapest arm at $0.005/query with zero embedding API spend, ties on narrow factoids, and collapses on branch / cross-branch synthesis (quality 2.83 / 3.37 vs 4.5+ for the other three) — confirming its "AI grep" framing. ^src:wiki-vs-rag
- Could N8n2LangGraph be used to convert workflows into DeepAgent Azure CLI custom tools?
- How does SST-AutoResearch's audio pipeline compare with ProjectPulse's transcription layer?
- Could Wiki-vs-RAG's tool-loop harness be repurposed to benchmark DeepAgent Azure CLI's agent loop against the same corpus?
- Does the Wiki-vs-RAG result generalize to corpora that aren't already organized as a Karpathy-style wiki?
- Could N8n2LangGraph be used to convert workflows into DeepAgent Azure CLI custom tools?
- How does SST-AutoResearch's audio pipeline compare with ProjectPulse's transcription layer?
- ai-security-pentesting (Agent Stack mental model, DeepAgent Azure context)
- local-inference-mlx (SST-AutoResearch and DeepAgent use MLX backends)
- oscal-compliance (shared LangGraph patterns)
- timeline — 2026-02-11. ^src:agentic-pipeline-code-optional
- timeline — 2025-06-21. ^src:automated-ai-news-workflow
- timeline — 2026-03-08. ^src:one-person-conglomerate-reality
- timeline — 2026-02-24. ^src:best-ai-unrestricted-access