Merlion

A lightweight CLI coding agent built as a reference implementation.

Merlion is a working coding agent you can run from the terminal or from WeChat. It is built as a reference implementation: small enough to read, but complete enough to show the real shape of a coding agent. Context assembly, tool execution, session persistence, and verification are all here in code you can actually follow.

Compared with broader tools such as Claude Code and Codex CLI, Merlion keeps the product layer intentionally thin so the runtime stays legible. The point is twofold: the core path is compact without being partial, and if coding agents are going to matter, we need a lightweight system that helps us understand what one actually is.

What This Repo Contains

A runtime loop with planning, tool execution, retries, guardrails, and verification
A context system with orientation, compact summaries, path guidance, and layered AGENTS.md / MERLION.md
A builtin tool layer for files, search, shell, git, config, and LSP-assisted edits
A real sandbox stack: OS-level sandboxing for subprocesses plus application-layer policy enforcement for file, fetch, and approval flows
Two transports: terminal REPL first, plus optional WeChat inbox mode
Bench and regression lanes for fixture tests, BugsInPy, and SWE-bench Lite

Why It Stays Lightweight

The core path stays short, but the essential pieces are still there: loop, tools, context, sessions, guardrails, verification
The codebase is small enough to read end-to-end without reverse-engineering a large product surface
It runs as a local Node.js runtime rather than depending on a hosted control plane
The tool layer is practical, but still narrow enough to understand without days of setup
The architecture is opinionated on purpose: fewer abstractions, fewer hidden systems, less ceremony

Lightweight here does not mean incomplete. It means the runtime is kept narrow enough that the design decisions are still visible.

Quick Start

Merlion requires Node.js >=22.

Global install:

npm install -g merlion
merlion

Project-local install:

npm install merlion
npx merlion

On first run, Merlion opens a setup wizard for provider, API key, and model. It works with OpenAI-compatible endpoints, including custom base URLs.

Common usage:

# one-shot
merlion "read src/index.ts and summarize the startup flow"

# interactive REPL
merlion

# continue a previous session
merlion --resume <session-id>

# restore the last git checkpoint for a session
merlion undo <session-id>

Default CLI execution runs with:

--sandbox workspace-write
--approval on-failure
--network off

That means Merlion can change files in the current workspace, does not allow outbound network by default, and only asks to widen the sandbox after a sandbox/policy failure.

Useful overrides:

# strict read-only investigation
merlion --sandbox read-only --approval never

# allow networked shell/tool execution
merlion --network full

# fully unsandboxed local run
merlion --sandbox danger-full-access --approval never

Legacy flags still work:

--auto-allow maps to --approval never
--auto-deny maps to --approval untrusted

Shell-like tools such as bash and run_script execute through the sandbox backend. File tools (read_file, write_file, edit_file, create_file, and related mutations) use the same sandbox policy at the application layer, so read-only, deny-read, and deny-write still apply even when no shell is involved. The fetch tool also respects --network.

Sandbox & Approvals

Sandboxing is one of Merlion's core runtime features, not an afterthought.

Merlion separates two concerns:

sandbox: the execution boundary
approval: when Merlion is allowed to widen that boundary

The main modes are:

read-only: no file mutations
workspace-write: writes are limited to the workspace or explicit writable roots
danger-full-access: no filesystem sandbox

Approval policies are:

untrusted: deny escalation
on-failure: ask only after a sandbox or policy failure
on-request: allow interactive escalation requests
never: never ask; stay inside the configured boundary

This model applies across the runtime:

bash and run_script run inside the sandbox backend
file tools enforce the same policy at the application layer
fetch respects network policy
subagents inherit and can only narrow the parent sandbox
WeChat runs without interactive escalation

Merlion also creates a git checkpoint for writable local sessions and provides merlion undo <session-id> and /undo as a recovery path.

Architecture Entry Points

If you want to read the code rather than just run it, start here:

src/index.ts: CLI bootstrap, config resolution, session wiring
src/runtime/loop.ts: main agent loop
src/runtime/executor.ts: tool execution and model turn handling
src/runtime/query_engine.ts: conversation runtime
src/context/*: orientation, compacting, path guidance
src/tools/*: tool registry and builtin tools
src/transport/wechat/*: WeChat transport

There is also a higher-level technical overview in docs/merlion_runtime_technical_overview.md.

WeChat Mode

Merlion can use WeChat as an agent inbox.

# first time or token refresh
merlion wechat --login

# daily use
merlion wechat

Inside REPL, you can also trigger login directly:

:wechat
/wechat

Credentials are stored at ~/.config/merlion/wechat.json.

By default, WeChat receives final replies and concise error hints, not internal tool logs. If you want progress updates, set MERLION_WECHAT_PROGRESS=1. For more detailed progress, set MERLION_WECHAT_PROGRESS_VERBOSE=1.

Interactive terminal approvals are not available in WeChat mode. WeChat sessions run with approval=never, and the default sandbox is workspace-write, so the agent can edit files in the current workspace but cannot widen permissions mid-session. If you want a different startup boundary, pass sandbox flags explicitly when launching WeChat mode, for example:

# cautious
merlion wechat --sandbox read-only

# trusted local automation
merlion wechat --sandbox danger-full-access

What Merlion Is Not

Not a product-comparison project; it is a runtime to read, run, and extend
Not trying to reproduce every workflow and integration from broader agent tools
Not a stable SDK or platform layer yet
Not optimized for non-technical onboarding first
Not interested in hiding architectural tradeoffs behind a black box

Merlion is a small, opinionated runtime meant to stay understandable while still covering the essential shape of a real coding agent.

Name		Name	Last commit message	Last commit date
Latest commit History 155 Commits
.githooks		.githooks
bin		bin
docs		docs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
README.zh-CN.md		README.zh-CN.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Merlion

What This Repo Contains

Why It Stays Lightweight

Quick Start

Sandbox & Approvals

Architecture Entry Points

WeChat Mode

What Merlion Is Not

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Merlion

What This Repo Contains

Why It Stays Lightweight

Quick Start

Sandbox & Approvals

Architecture Entry Points

WeChat Mode

What Merlion Is Not

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages