Replies: 1 comment
-
|
My current answer is that a safe autonomous coding loop should record enough information to answer four questions after the fact:
In practice that means I want at least:
That is the shape MartinLoop is converging on right now: budgets, verifier gates, explicit stop reasons, rollback evidence for repo-backed runs, and JSONL run records under The new examples/docs in flight are meant to make that concrete from a few angles:
If anyone is already running autonomous coding loops, I would still love sharper feedback on which missing record hurts the most in practice: cost provenance, verifier history, scope boundaries, rollback evidence, or something else. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Ralph-style loops are useful because they let AI coding agents keep working until a task is complete.
The failure mode is that unattended loops can keep spending, retrying, mutating files, or drifting from the original task without a clean stop condition.
MartinLoop is trying to define the governance layer around that pattern:
Question for people running Claude Code, Codex, OpenCode, Goose, Aider, or Ralph-style loops:
What should a safe autonomous coding loop record before it is trusted?
Beta Was this translation helpful? Give feedback.
All reactions