Add agent-driven template + docs/agents.md#129
Conversation
New template demonstrating agent-backed transforms: - analyst agent reused across MapSynthesis + ReduceSynthesis - reporter agent for FoldSynthesis - Same DAG as 03-team-report but with named agents - Transform prompts define task structure, agent instructions define persona - synix.toml with [agents.*] config + instructions files in prompts/ - Full demo case with plan/build/release/search/rebuild/explain New docs/agents.md covering: - Agent protocol (map/reduce/group/fold + task_prompt composition) - agent_id vs fingerprint_value (separate lifecycles) - SynixLLMAgent + PromptStore integration - Workspace config via [agents.*] in synix.toml - Custom agent implementations - Artifact provenance (agent_id + agent_fingerprint)
|
Note Red Team Review — OpenAI GPT-5.4 | Adversarial review (docs + diff only) Threat assessment — Medium-high risk: this PR introduces a new user-facing abstraction (“agents”) but the shipped template/demo is broken and the docs overstate capabilities the implementation does not provide. One-way doors
Findings
Missing
Verdict — Block: this introduces a public abstraction and template while the flagship example is nonfunctional and the docs promise behavior not demonstrated in code. Review parameters
|
|
Note Architectural Review — Claude Opus | Blind review (docs + diff only) Summary — This PR introduces the Agent abstraction (docs, template, demo case) and applies formatting cleanup across several files. Agents are named execution units with stable identity and fingerprinting that compose with transform prompts — persona (HOW) separated from task (WHAT). The bulk of the diff is a new template ( Alignment — Agents fit the vision well. DESIGN.md's core thesis is Python-first programmability and architecture experimentation. Agents extend the transform interface without replacing it — you can swap agents without changing task prompts, and agent fingerprints feed into cache invalidation (materialization keys capture all inputs, per §3.3). The Observations
Verdict — The Agent concept is a sound addition that fits the architecture, but the PR ships a broken template (golden files prove it fails) and lacks tests for the happy path — it needs the Review parameters
|
…oldens - Pipeline creates PromptStore and seeds from prompts/ directory, then binds to agents (was missing — agents failed without store) - LLMClient._complete_anthropic() now extracts system messages from messages list and passes as system= parameter (Anthropic API requires this, was sending system role in messages) - Remove volatile search golden (LLM output varies without cassette) - Add output_masks for LLM-dependent steps in case.py
|
Note Red Team Review — OpenAI GPT-5.4 | Adversarial review (docs + diff only) Threat assessment — Medium-high risk: this looks small in code, but it introduces a new user-facing abstraction (“agents”) with unclear semantics, docs/runtime drift, and weak test coverage around the actual failure paths. One-way doors
Findings
Missing
Verdict — Block: the core bugfix in Anthropic handling may be fine, but this PR also ships a new abstraction with doc/code drift, boundary violations, and a template that contradicts the project’s own storage model. Review parameters
|
|
Note Architectural Review — Claude Opus | Blind review (docs + diff only) Summary — This PR introduces the Agent abstraction — named, reusable execution personas that compose with transform task prompts. It adds the Alignment — Strong fit. DESIGN.md's Python-first principle (§4.1) and the extension model (custom transforms, prompt functions) both anticipate this. Agents add a composable axis of variation — persona is now independently versionable from task prompt — without violating core invariants. The Observations
Verdict — This is a well-designed extension that adds a meaningful composition axis (persona × task) while preserving cache and provenance invariants; the Anthropic fix and Review parameters
|
|
Note Red Team Review — OpenAI GPT-5.4 | Adversarial review (docs + diff only) Threat assessment — Medium risk: there’s one real behavioral fix here, but the PR mostly papers over an unfinished “agents” abstraction and introduces user-facing template/docs commitments that outpace the implementation. One-way doors
Findings
Missing
Verdict — Ship with fixes: the Anthropic adapter change is probably correct, but the agents/template/docs story is prematurely hardened and currently conflicts with the project’s own design direction. Review parameters
|
|
Note Architectural Review — Claude Opus | Blind review (docs + diff only) Summary This PR introduces the Agent abstraction — named, reusable execution personas that compose with transform task prompts. It adds the Alignment Strong fit. Agents are a clean extension of the "Python-first" and "architecture is a runtime concern" principles from DESIGN.md. The Observations
Verdict Good incremental step that adds a well-designed extension point; the main gap is missing tests for the Anthropic system-message fix and the duplicated template files. Review parameters
|
Summary
Template 09-agent-driven — demo pipeline using agent-backed transforms. Same DAG as 03-team-report but with named agents:
analystagent reused across MapSynthesis + ReduceSynthesis (same persona, different tasks)reporteragent for FoldSynthesissynix.tomlwith[agents.*]config + instruction files inprompts/docs/agents.md — comprehensive agent documentation covering:
Test plan
uv run synix demo run templates/09-agent-driven— all 12 steps passuv run release— full gate including all demos