Marten

Local-first agent control plane for issue-driven coding and review workflows.

中文文档 · Architecture · Runtime Contracts · RAG Surface · Docs Index

Marten turns Feishu / GitHub / GitLab / MCP / local worktrees into one executable agent chain so Main Agent -> Ralph -> Code Review Agent can work on real issues, real repositories, and real review loops instead of prompt-only demos.

Overview

Agent-first, but not prompt-only
LLM + skill + MCP as the primary capability surface
Local repository execution for coding, testing, and review
Stable runtime contracts instead of brittle workflow sprawl
Retrieval facade kept behind a stable contract for future backend integration

Why Marten

Most agent demos stop at planning or generate text against partial context. Marten is built for a narrower but more useful target: take a real request, route it through issue intake, execute code in a real checkout, run review and repair loops, then notify the right channel only after the chain is actually complete.

The repository is opinionated about that boundary:

MCP is for platform access and external system bridging
LLMs and skills are for reasoning, coding, and review
Local worktrees are for real file context, commands, and validation
JSON-first schemas are preferred over hard-coded orchestration growth

At A Glance

Layer	Responsibility
`channel`	Feishu inbound and outbound delivery
`control plane`	task lifecycle, polling, repair loops, final delivery gates
`runtime`	model access, skills, MCP bridge, token accounting, provider wiring
`agents`	Main Agent, Ralph, Code Review Agent

Core Workflow

flowchart LR
    A["Feishu / API Request"] --> B["Main Agent"]
    B -->|"chat mode"| C["Direct Response"]
    B -->|"coding handoff"| D["GitHub Issue / Control Task"]
    D --> E["Ralph Worker"]
    E --> F["Local Worktree Coding"]
    F --> G["PR + Review Handoff"]
    G --> H["Code Review Agent"]
    H -->|"approved"| I["Final Delivery"]
    H -->|"blocking"| J["Repair Loop"]
    J --> E
    J -->|"3 blocking rounds"| K["needs_attention"]

Highlights

Main Agent separates chat mode from coding_handoff mode
Ralph works in a local worktree and emits structured coding and review artifacts
Code Review Agent produces stable machine-readable and human-readable review payloads
Final delivery is gated on review approval, not just coding completion
Retrieval stays behind a unified contract so future vector backends can be added without changing the main chain

Architecture

Marten is optimized around one stable path:

Feishu / API -> Main Agent -> GitHub issue -> Ralph coding -> local validation -> review -> final delivery

That path is the project center of gravity. If a change does not make this chain stronger, safer, or easier to operate, it should be treated as low priority.

Key references:

Getting Started

Requirements

Python 3.11, 3.12, or 3.13
Git
a usable LLM provider credential
optional GitHub / GitLab / Feishu / MCP configuration

Install

python3.11 -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install -e .

Configure

cp .env.example .env
cp mcp.json.example mcp.json
cp models.json.example models.json
cp platform.json.example platform.json

Configuration responsibilities:

mcp.json: MCP server command, args, env, cwd, adapter, and external tokens
models.json: provider credentials, API base, default model, and profile bindings
platform.json: repository target and runtime behavior overrides
agents.json: optional agent workspace, skills, MCP servers, prompt spec, model profile
.env: deployment-time overrides, not the primary source of truth

Minimal practical setup:

mcp.json: at least one GitHub MCP server with a valid token
models.json: at least one working model provider
platform.json: at least github.repository
.env: only when runtime overrides are needed

Run

uvicorn app.main:app --host 0.0.0.0 --port 8000

Private Server Self-Host

当前第一阶段的目标运行形态是“私有服务器自用”，不是本机临时脚本集合。

Feishu 是默认用户入口
Web/API 是诊断、运维和备用入口
同一时刻只允许一个 active 主任务
新请求必须进入队列或收到明确 busy 语义
目标仓库遵循“请求优先，否则配置默认”

推荐的最小运行模型已经固定：

一个 API/webhook 进程
一个 scheduler/worker 进程
第一阶段不采用单进程内挂 scheduler

Recommended first checks:

GET /health
GET /diagnostics/integrations
GET /control/operator/state
确认 self_host_boot.ready=true 且 self_host_boot.process_model=split_process
确认 feishu.inbound_status=ready 且 feishu.delivery_status=ready
用同一个 Feishu chat 连续发送一次状态查询和一次 coding 请求，确认 session continuity 不断裂
POST /main-agent/intake
POST /control/tasks/{task_id}/actions
POST /workers/sleep-coding/poll

Recommended process startup contract:

API/webhook process:
- uvicorn app.main:app --host 0.0.0.0 --port 8000
scheduler/worker process:
- python scripts/run_worker_scheduler.py

Local-First Execution

The default path is not "stream a large codebase through MCP into a model." The default path is:

Materialize code into a local worktree or checkout.
Let the agent read files and run commands locally.
Use MCP or platform APIs only for issue, PR, comment, and notification bridges.

Important defaults:

Ralph uses the built-in agent runtime as the only supported main-chain execution path
Review is local-first across intermediate rounds
Blocking review feedback immediately enters the next repair loop
Final delivery happens only after review approval

For live end-to-end validation, enable live_test in platform.json and run:

python scripts/run_test_suites.py live

RAG And Retrieval

Marten keeps retrieval behind a stable facade so upper layers do not care which vector store is active.

Provider selection is configuration-driven
Search and fetch mapping stay normalized at the retrieval layer
Collection schema and incremental indexing can be handled per provider behind the same contract
当前保留的是最小 retrieval contract、policy 与 indexing 基础面；具体向量库接入留到真正需要时再补

Design reference:

RAG Provider Surface

API Surface

GET /health
GET /diagnostics/integrations
GET /control/operator/state
POST /gateway/message
POST /webhooks/feishu/events
POST /main-agent/intake
POST /control/tasks/{task_id}/actions
POST /workers/sleep-coding/poll
GET /workers/sleep-coding/claims
GET /control/tasks/{task_id}
GET /control/tasks/{task_id}/events
GET /tasks/sleep-coding/{task_id}
GET /reviews/{review_id}

对当前私有服务器自用阶段，API 是 operator surface，不是第二套业务编排面。默认用户流量优先从 Feishu 进入，再由 control plane 驱动 main-agent -> ralph -> code-review-agent -> delivery 主链。

Testing

Run the default fast suite:

python scripts/run_test_suites.py quick

Run the broader non-live regression suite:

python scripts/run_test_suites.py regression

Run the non-default evolution/example suite separately when needed:

python scripts/run_test_suites.py manual

Run live validation separately:

python scripts/run_test_suites.py live

Important regression areas:

Main Agent intake and mode routing
Feishu webhook intake, endpoint routing, and session continuity
worker polling and claim flow
Ralph local-first execution artifacts
review materialization and repair loop control
session/task registry and token ledger boundaries
shared LLM runtime normalization and pricing
retrieval provider contract stability
MVP end-to-end chain behavior
live-chain remains intentionally isolated because it depends on real local integrations

Documentation

Development Rules

Do not implement directly on main; create or switch to a work branch first
docs/handoffs/ is only for handoff rules and templates
concrete session handoffs belong in local-only docs/internal/
low-value historical execution notes should be deleted instead of force-archived

Current Scope

The repository is intentionally focused on a single-task production path, not a feature buffet.

In scope:

request intake to issue creation
local-first coding and validation
review, repair, and final delivery
unified runtime contracts
pluggable retrieval providers

Not yet expanded:

multi-repository concurrent scheduling
multi-reviewer aggregation
platformized long-term memory and context compression

Roadmap

continue reducing issue-only context assumptions
strengthen source materialization across GitHub and GitLab
keep shrinking Python fallback paths where the agent runtime can carry the work
keep public and runtime payload contracts explicit and stable
keep the repository small enough to debug without losing production utility

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
agents		agents
app		app
docs		docs
scripts		scripts
skills		skills
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
SECURITY.md		SECURITY.md
STATUS.md		STATUS.md
agents.json.example		agents.json.example
mcp.json.example		mcp.json.example
models.json.example		models.json.example
platform.json.example		platform.json.example
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Marten

Overview

Why Marten

At A Glance

Core Workflow

Highlights

Architecture

Getting Started

Requirements

Install

Configure

Run

Private Server Self-Host

Local-First Execution

RAG And Retrieval

API Surface

Testing

Documentation

Development Rules

Current Scope

Roadmap

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Marten

Overview

Why Marten

At A Glance

Core Workflow

Highlights

Architecture

Getting Started

Requirements

Install

Configure

Run

Private Server Self-Host

Local-First Execution

RAG And Retrieval

API Surface

Testing

Documentation

Development Rules

Current Scope

Roadmap

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages