Template repository for portable agent behavior contracts in any development project.
The goal is not to store one big prompt. The goal is to store a small set of durable artifacts that teach agents how to work well in real repos:
- reason from first principles
- work in manager -> builder -> tester -> critic -> reviewer loops
- prefer evidence over fluency
- stay concise and operational
- improve continuously through dated research refreshes
This repo is intended to be copied into a project or used as a submodule.
.
├── AGENTS.md
├── AGENTS_UPDATE_PLAN.md
├── README.md
├── RESEARCH_SNAPSHOT.md
├── templates/
│ ├── AGENT_DECISIONS.template.md
│ ├── AGENT_PROGRESS.template.md
│ └── AGENT_TASKS.template.json
└── personalities/
├── builder.md
├── critic.md
├── manager.md
├── researcher.md
├── reviewer.md
└── tester.md
The template encodes a few strong defaults:
- Start from first principles.
- Inspect the real repo before acting.
- Make the smallest useful change.
- Test it.
- Critique it.
- Review it.
- Repeat until the result is strong enough to ship.
This is intentionally simple. Recent research and engineering writeups keep pointing to the same pattern: simple, explicit loops beat magical prompt complexity.
Copy these files into the target project:
AGENTS.mdpersonalities/- optionally
AGENTS_UPDATE_PLAN.mdandRESEARCH_SNAPSHOT.mdif the project will evolve its agent doctrine
git submodule add <this-repo-url> agent-templateThen reference the files from the root project documentation or symlink the chosen artifacts into place.
- place
AGENTS.mdat the project root - tell agents to follow it by default
- keep the role cards available for larger tasks
- use
manager.md,builder.md,tester.md,critic.md, andreviewer.mdas explicit passes or sub-agent roles - copy the files in
templates/if the project wants durable progress, task, and decision state - require agents to use conventional commit summaries plus a commit body when asked to commit or push
- rerun
AGENTS_UPDATE_PLAN.mdwhenever you change the project’s agent doctrine - update
RESEARCH_SNAPSHOT.mdso future edits are anchored to dated evidence
Use one agent, but force it through the full loop:
Read AGENTS.md and solve this task as one agent.
Work sequentially as manager, builder, tester, critic, and reviewer.
Do not stop at the first plausible answer.
Keep the final output concise and evidence-based.
Use the role cards explicitly:
Manager: frame the task, define done, and split the work.
Builder: implement the smallest useful change.
Tester: verify behavior and regressions.
Critic: challenge assumptions and propose a better version.
Reviewer: decide whether the result is ready to merge.
If the critic or tester finds a real problem, loop again.
When changing how agents should behave:
Use researcher.md.
Rerun AGENTS_UPDATE_PLAN.md.
Check the latest primary sources.
Update RESEARCH_SNAPSHOT.md, AGENTS.md, and README examples together.
Do not promote a new pattern into the template unless the evidence is strong and portable.
When asking an agent to finalize work:
If you commit, use a conventional summary line such as feat:, fix:, or chore:.
Always include a commit body that explains what changed and why.
Do not push a one-line commit message.
The current structure reflects recent evidence from Anthropic, OpenAI, and foundational agent papers:
- iterative critique/refinement still matters
- actor-critic loops work best when the critic is backed by tests or other external checks
- long-running agents need durable handoffs and incremental progress
- multi-agent systems help only when the work is genuinely parallelizable
- evaluation quality matters more than leaderboard aesthetics
- benchmark scores need skepticism because contamination, grader flaws, and infra setup can distort results
See RESEARCH_SNAPSHOT.md for the dated source list and current conclusions.
- Run
AGENTS_UPDATE_PLAN.md. - Refresh
RESEARCH_SNAPSHOT.md. - Update
AGENTS.mdandpersonalities/only where evidence supports a change. - Review the repo for clarity and portability.
- Update README examples so adoption stays easy.