Agent-Driven Design (ADD)

A conceptual framework for designing systems where LLM-based agents are first-class architectural citizens — and a collection of runnable examples that show how the framework applies in practice.

The central thesis: every agent is composed of exactly two parts — a Model and a Harness. Getting their responsibilities right is the core design problem.

What is ADD?

Domain-Driven Design gave us vocabulary for decomposing complex software around business domains. ADD does the same for agentic systems — answering questions DDD was not designed for:

When does logic belong in the model's reasoning versus in code?
Where do you draw the boundary between two agents?
When is a single agent enough, and when do you decompose?
How do RAG, fine-tuning, evals, and observability fit into the architecture?

ADD is not a framework library. It is a design language.

Core Concepts

Concept	Definition
Model	The LLM. Responsible for reasoning, generation, and judgment.
Harness	All code surrounding the Model: prompts, tools, memory, routing, validation.
Agent	Always `Model + Harness`. Neither alone is an agent.
Agent Context Boundary	The conceptual scope of what an agent knows, can do, and owns.
Agent Topology	How agents are arranged and connected in a system.
Harness-Driven Design (HDD)	Improve the agent by changing the Harness. Default strategy.
LLM-Driven Design (LLMDD)	Improve the agent by changing the Model. Applied after HDD is exhausted.

The decision rule: Does this require reasoning or judgment? → Model. Is this structure, flow, or contract? → Harness. Does this serve the Harness? → Infra.

The improvement rule: Is this a context, tool, or routing problem? → HDD. Is the Model reasoning incorrectly despite correct context? → LLMDD.

Where Everything Fits

Repository Structure

agent-driven-design/
├── core/                    # Framework concepts and glossary
├── patterns/
│   ├── rag/                 # Retrieval-Augmented Generation as a Harness pattern
│   ├── integration/         # Connecting agents to external systems
│   ├── memory/              # Episodic, semantic, and working memory
│   └── loops/               # Agentic loop patterns (ReAct, plan-execute, reflection)
├── production/
│   ├── evals/               # Model eval vs Agent eval vs System eval
│   ├── observability/       # What the Harness must expose and why
│   └── fine-tuning/         # When and why to move logic from Harness into Model
├── guides/                  # How everything connects to ADD
└── examples/                # Runnable code
    ├── single-agent/        # Claude, OpenAI, LangChain
    ├── multi-agent/         # LangGraph, hierarchical
    ├── loops/               # ReAct, plan-execute, reflection — manual + LangGraph
    └── observability/       # Langfuse, LangSmith, Phoenix, OpenTelemetry

Reference System

The reference/ directory is a complete, runnable system that demonstrates every ADD concept working together. It is the ADD equivalent of a DDD reference application: a real fraud detection agent, not a toy.

It shows:

Three versions of the same agent (v1 → v2 → v3), each representing an HDD iteration with working evals before and after
Output evals, trajectory evals, and LLM-as-Judge — all running against a 20-case golden dataset
Observability — every agent run captures a full trace; every eval score attaches to a trace ID
The LLMDD decision point — documented with evidence of when Harness exhaustion actually looks like

# Run evals on the latest agent version
python reference/scripts/run_evals.py --version v3

# Compare all three versions side by side (shows HDD progression)
python reference/scripts/compare_versions.py

See reference/README.md for the full walkthrough.

Runnable Examples

All examples are in Python. Each one is annotated to show where Model and Harness responsibilities begin and end.

Example	Provider	Pattern
single-agent/claude	Anthropic	Basic agent with tools
single-agent/openai	OpenAI	Basic agent with tools
single-agent/langchain	Agnostic	LangChain abstraction
multi-agent/langgraph	Agnostic	Orchestrator + workers
multi-agent/hierarchical	Anthropic	Hierarchical topology
loops/react/manual	Anthropic	ReAct loop from scratch
loops/react/langgraph	Agnostic	ReAct with LangGraph
loops/plan-execute/manual	Anthropic	Plan-execute from scratch
loops/reflection/manual	Anthropic	Reflection loop from scratch
observability/langfuse	Any	Tracing with Langfuse
observability/langsmith	Any	Tracing with LangSmith
observability/phoenix	Any	Tracing with Arize Phoenix
observability/opentelemetry	Any	OTel-native tracing

Status

Early-stage research framework. Concepts are stable; documentation and examples are actively developed. Contributions, critiques, and counterexamples are welcome.

See CONTRIBUTING.md.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
core/docs		core/docs
examples		examples
guides		guides
img		img
patterns		patterns
production		production
reference		reference
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
GLOSSARY.md		GLOSSARY.md
OPEN-QUESTIONS.md		OPEN-QUESTIONS.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent-Driven Design (ADD)

What is ADD?

Core Concepts

Where Everything Fits

Repository Structure

Reference System

Runnable Examples

Status

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent-Driven Design (ADD)

What is ADD?

Core Concepts

Where Everything Fits

Repository Structure

Reference System

Runnable Examples

Status

License

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages