`agent-rs`

A pure-Rust async runtime for shipping LLM agents.

Multi-provider · tool-capable end-to-end · structured permissions · real MCP · zero unsafe.

┌─ user ──┐    ┌─── QueryLoop ───────────────────────────────────────┐
│ prompt  │───▶│ Streaming → ToolDispatch → ToolCollecting → Yield │──▶ Event::*
└─────────┘    │       ↑                                       │     │
               │       └──── auto-compact / hooks / cost ◀─────┘     │
               └─────────────────────────────────────────────────────┘
                  │            │            │            │
                  ▼            ▼            ▼            ▼
              Anthropic    OpenAI-*       Ollama         MCP servers
              (SSE)        (compat)       (local)        (stdio / HTTP)

TL;DR

use agent::prelude::*;
use std::sync::Arc;

let provider = Arc::new(AnthropicProvider::new(std::env::var("ANTHROPIC_API_KEY")?));
let engine = QueryEngine::new(provider, "claude-opus-4-7").with_system("Be concise.");

let mut stream = engine.run("Summarize Rust's borrow checker in two lines.", AbortController::new()).await?;
while let Some(event) = futures::StreamExt::next(&mut stream).await {
    if let Event::TextDelta { delta } = event? { print!("{delta}") }
}

That's a complete agent — provider streaming, tool dispatch, hooks, permissions, auto-compaction, USD cost tracking — all wired up. Drop in Files API attachments, MCP servers, or the bundled coding tool pack with one extra line each.

Why agent-rs?

🦀 Rust-native, library-only. No tokio::main hijack, no global state, no panic! on bad input. #![forbid(unsafe_code)] in every crate. Drop it into a CLI, an IDE plugin, a desktop app, or a server — the runtime doesn't care.
🔌 Three providers, one event vocabulary. Anthropic Messages (hand-rolled SSE — full prompt-cache + extended-thinking betas, no SDK dep), async-openai 0.36 (DeepSeek / Moonshot / Groq / OpenRouter / LM Studio), and local Ollama. Stream Event::TextDelta, ToolUse, Usage, Result — same shape from every backend.
🛠 Tool-capable end-to-end. Define a tool, register it, the runtime wires the JSON Schema into the request body, dispatches ToolUse events to your code, feeds results back. Multi-turn loop with a phase machine. Receipt-order concurrent execution. Permissions and cost tracking are wired through, not bolted on.
🛡 Structured permissions that fail safe. A 7-step decision chain (deny / ask / callback / bypass / allow / default-ask / dont_ask), composable PermissionMatcher rules over tool input shapes (JSON-pointer fields, glob/prefix/regex patterns, AnyOf / AllOf / Not), and a 4-level SafetyClass lattice where Unknown ≡ Destructive for gating — so unclassified tools never slip through.
🔗 MCP that actually plugs in. Full Model Context Protocol client lifecycle: stdio child processes, streamable HTTP, OAuth 2.0 + PKCE, server-initiated elicitation, channel permissions, stale-handle reconnect repair. Tool calls don't serialize on a mutex. close() doesn't deadlock during slow RPCs.
💸 Cost accounting in nanodollar precision. Event::Usage flows into a CostTracker with a model-price catalog (Anthropic + GPT defaults, BYO entries trivially). u128 integer accumulator — no f64 drift across long sessions.
📎 Files API for big attachments. FilesClient trait + AnthropicFilesClient. Smart helpers auto-route between inline base64 and uploaded file_id references based on size. Beta header gets added automatically when any block (including those nested in tool results) carries a file_id.
♻️ Reactive auto-compaction. Token estimator + LLM-driven <analysis> / <summary> summarization, microcompact, session memory, post-cleanup file restoration. Long sessions stay inside the context window without losing critical state.
📦 Optional batteries. Companion agent-tools-code crate ships generic FileRead/Write/Edit, Grep/Glob (gitignore-aware via ignore), Bash, WebFetch, TodoWrite, NotebookEdit (Jupyter .ipynb cells), and ToolSearch for deferred-tool discovery. Every tool declares its SafetyClass; a WorkspacePolicy enforces path containment + size caps + symlink rules. Pull only the features you want.

Architecture

flowchart TB
    Host[Host application]
    subgraph Runtime[agent crate]
        QL[query - phase machine]
        Prov[provider - Anthropic / OpenAI / Ollama]
        Tool[tool trait + registry]
        Perm[permission - 7-step chain + matchers]
        Hook[hook - 27 typed events]
        Comp[compact - reactive auto-compact]
        Cost[cost - USD accounting]
        Sess[session - JSONL persistence]
        Atch[attachments - Files API]
        MCP[mcp - rmcp connector]
    end
    subgraph Optional[agent-tools-code]
        FS[FileRead / Write / Edit / ...]
        Search[Grep / Glob / ToolSearch]
        Shell[Bash]
        Web[WebFetch]
        Todo[TodoWrite]
    end
    Models[Models - Anthropic / OpenAI / Ollama / MCP]

    Host --> QL
    QL --> Prov
    QL --> Tool
    QL --> Hook
    QL --> Perm
    QL --> Comp
    QL --> Cost
    QL --> Sess
    Perm --> Tool
    Atch --> Tool
    Tool --> Optional
    Prov --> Models
    MCP --> Models

Streaming Events are the universal language: every provider emits the same Event taxonomy, so swap providers without touching tool code.

Install

Two crates, both versioned together. Pull only what you need.

[dependencies]
# Runtime — always
agent = { git = "https://github.com/ZSeven-W/agent-rs", default-features = false, features = ["anthropic", "session-jsonl"] }

# Optional: ready-made coding tool pack (FileRead/Write/Edit, Grep/Glob, Bash, WebFetch, TodoWrite, ToolSearch)
agent-tools-code = { git = "https://github.com/ZSeven-W/agent-rs", default-features = false, features = ["fs", "search"] }

`agent` features

Flag	Pulls in	Notes
`anthropic` (default)	`reqwest` + `eventsource-stream`	Hand-rolled Anthropic SSE — no SDK dep.
`openai`	`async-openai` 0.36	OpenAI-compatible providers.
`ollama`	`ollama-rs` 0.3	Local models.
`mcp`	`rmcp` 1.5	MCP client + production stdio/HTTP connector + OAuth/PKCE.
`session-jsonl`	`fs4`	JSONL persistence with file lock.
`swarm`	`fs4` + `notify`	Sub-agents, mailbox, teams.
`tiktoken`	`tiktoken-rs`	Real BPE token counts (cl100k / o200k / p50k / r50k).
`full`	all of the above

`agent-tools-code` features

Flag	Pulls in	Tools
`fs` (default)	(none)	FileRead / Write / Edit / ListDir / Mkdir / Move / Remove
`search` (default)	`regex` + `ignore`	Grep · Glob (gitignore-aware)
`shell`	`shell-words`	Bash (timeout, abort, output cap)
`bash-async`	(none)	BashRun + BashOutput + KillShell (background shells, ring-buffer poll)
`web`	`reqwest` + `futures`	WebFetch (HTML→text, size cap)
`web-search`	`web`	WebSearch (pluggable backend, ships Tavily)
`task`	`futures`	Task (spawn a child `QueryLoop`)
`todo`	(none)	TodoWrite (in-memory shared state)
`notebook`	(none)	NotebookEdit (Jupyter .ipynb cell-level edits)
`all`	all of the above

ToolSearch is always-on (no feature flag) and lets you expose 50+ MCP tools without flooding the model's tool list — it picks them up via select:Name1,Name2 or keyword search.

Examples

Runnable examples live under each crate's examples/ directory:

Example	Crate	What it shows
`anthropic_basic`	`agent`	Minimal provider + `QueryLoop` + stream — the README TL;DR as a real binary.
`with_tools`	`agent`	Wires the bundled coding tool pack into the loop and asks the model to grep + read the workspace.
`notebook_edit`	`agent-tools-code`	Calls `NotebookEditTool` directly (no LLM) to edit a synthesized `.ipynb`.
`web_search_tavily`	`agent-tools-code`	Tavily Search via `WebSearchTool`. Needs `TAVILY_API_KEY`.

ANTHROPIC_API_KEY=sk-... cargo run --example anthropic_basic --features anthropic -p agent
ANTHROPIC_API_KEY=sk-... cargo run --example with_tools --features anthropic -p agent
cargo run --example notebook_edit --features notebook -p agent-tools-code
TAVILY_API_KEY=tv-... cargo run --example web_search_tavily --features web-search -p agent-tools-code

Quickstart with bundled tools

use agent::prelude::*;
use agent_tools_code::{register_default, WorkspacePolicy};
use std::sync::Arc;

let policy = WorkspacePolicy::new(std::env::current_dir()?)?.into_arc();
let mut tools = ToolRegistry::new();
register_default(&mut tools, policy);   // FileRead, Write, Edit, ListDir,
                                        // Mkdir, Move, Remove, Grep, Glob

let provider = Arc::new(AnthropicProvider::new(std::env::var("ANTHROPIC_API_KEY")?));
let qloop = QueryLoop::builder(provider, "claude-opus-4-7")
    .tools(Arc::new(tools))
    .build();

let mut stream = qloop.run("List the .rs files in src/, then summarize main.rs.", AbortController::new()).await?;
while let Some(event) = futures::StreamExt::next(&mut stream).await {
    match event? {
        Event::TextDelta { delta } => print!("{delta}"),
        Event::ToolUse { name, .. } => eprintln!("\n→ calling {name}"),
        _ => {}
    }
}

That's the full picture: registry → provider → loop. The runtime handles tool dispatch, permission gating, hooks, cost tracking, and auto-compaction without you wiring anything else.

Module surface

agent crate — runtime (15+ modules)

Foundation

Module	Purpose
`provider/`	Multi-provider LLM client. Tool definitions wired into request bodies; capability flags + streaming `Event` vocabulary.
`query/`	`QueryLoop` multi-turn phase machine. Reactive auto-compaction wired in.
`tool/`	`Tool` trait, `ToolRegistry`, `SafetyClass` lattice. Receipt-order concurrent execution via `ToolExecutor`.
`permission/`	7-step chain + structured `PermissionMatcher` (Always / Field / ExactJson / AnyOf / AllOf / Not) + `StringPattern`. External-queue async approval.
`hook/`	27 typed `HookEvent` variants.
`message/`	DAG-aware `MessageStore`. `ContentBlock::Document` for PDFs; `ImageSource::File` for Files-API references.
`stream/`	`Event` taxonomy: TextDelta / Thinking / ToolUse / ToolResult / Result / Usage / Error / Notice.
`session/`	JSONL persistence (schema v1) with atomic-rename + file lock.
`swarm/`	Sub-agents / teams. File-locked mailbox, in-process / tmux / iTerm2 backends.
`compact/`	Reactive auto-compaction. LLM-driven summarization, partial directions, microcompact, session memory.
`context/`	Sliding-window trim.

Service layer

Module	Purpose
`api/`	Retry with decorrelated jitter, error classification, prompt-cache-break detection, secret redaction.
`cost/`	Model-price-aware USD accounting. `u128` nanodollars — no f64 drift.
`attachments/`	`FilesClient` + `AnthropicFilesClient`, smart size-aware routing.
`tokenizer/`	Pluggable trait. Real tiktoken plugs in via the trait.

Discovery + extensibility

Module	Purpose
`mcp/` (feature `mcp`)	Full MCP client + production `RmcpConnector`.
`memdir/`	`MEMORY.md` directory loader with frontmatter + relevance scoring.
`skills/` · `plugins/` · `state/` · `bootstrap/` · `context_analysis/` · `tasks/` · `memory_extract/` · `remote/`	See `crates/agent/src/`.

agent-tools-code crate — optional coding tool pack

Tool	Class	Feature
`FileReadTool`	`ReadOnly`	`fs`
`FileWriteTool`	`Mutating`	`fs`
`FileEditTool`	`Mutating`	`fs`
`ListDirTool`	`ReadOnly`	`fs`
`MkdirTool`	`Mutating`	`fs`
`MoveTool`	`Mutating`	`fs`
`RemoveTool`	`Destructive`	`fs`
`GrepTool`	`ReadOnly`	`search`
`GlobTool`	`ReadOnly`	`search`
`BashTool`	`Mutating`	`shell`
`BashRunTool`	`Mutating`	`bash-async`
`BashOutputTool`	`ReadOnly`	`bash-async`
`KillShellTool`	`Mutating`	`bash-async`
`WebFetchTool`	`ReadOnly`	`web`
`WebSearchTool`	`ReadOnly`	`web-search`
`TaskTool`	`Mutating`	`task`
`TodoWriteTool`	`Mutating`	`todo`
`NotebookEditTool`	`Mutating`	`notebook`
`ToolSearchTool`	`ReadOnly`	(always)

A shared WorkspacePolicy enforces path containment, file-size caps, and symlink rules. register_default(registry, policy) bulk-registers every enabled tool.

Design principles

Library-only. No global state, no tokio::main, no panic! on bad input — every error path is a typed AgentError.

Provider-agnostic at the runtime layer. Concrete tools live outside the agent crate. The runtime defines the trait; companions ship implementations.

Streaming first. Every provider is a streaming source. Multi-turn / tool dispatch / compaction are coordinated through one Event vocabulary, no polling.

Cancellation everywhere. Every async surface honors an AbortController — including the tokio::task::spawn_blocking workers used by Grep / Glob.

No unsafe. #![forbid(unsafe_code)] in both crates.

Defensive against the model. Permissions fail safe (Unknown ≡ Destructive for gating). Tool schemas validated before reaching the wire. Path operations canonicalize before any I/O. Idempotent writes detect no-ops.

Cost-aware. Tool schemas, prompt cache, and token usage feed an integer-precision USD accumulator. Long-running sessions don't drift.

Testing

cargo test --workspace --all-features
# 844 unit · 13 integration · 5 doc · 4 ignored (real-API gates)

cargo clippy --workspace --all-targets --all-features -- -D warnings
cargo fmt --all -- --check
cargo deny --all-features check

The 4 #[ignore]-gated tests hit real APIs (Anthropic / OpenAI / Ollama / Anthropic Files) when their environment variables are set. CI runs the full suite with mocks; real-API runs are manual.

Status

This is a pre-release project — every change lives under Unreleased in CHANGELOG.md until 0.1.0 ships. The runtime API surface has stabilized and 863 tests guard it; the open work is wiring more tools into the optional companion crate and tagging a release.

See openpencil-docs/agent-rs/notes/2026-05-02-claude-code-non-tui-gaps.md for what's intentionally host-side vs. what's pending.

Contributing

PRs welcome. Two ground rules:

No product-specific imports in the agent crate. Generic concepts only — anything tied to a specific app belongs in a downstream crate.
Adversarial review every change. Open an issue first for anything bigger than a small fix so we can align on direction.

MIT licensed. See LICENSE.

Built with caffeine, codex review, and an unreasonable number of tests.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.cargo		.cargo
.github/workflows		.github/workflows
crates		crates
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
deny.toml		deny.toml
rust-toolchain.toml		rust-toolchain.toml
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`agent-rs`

A pure-Rust async runtime for shipping LLM agents.

TL;DR

Why agent-rs?

Architecture

Install

`agent` features

`agent-tools-code` features

Examples

Quickstart with bundled tools

Module surface

Foundation

Service layer

Discovery + extensibility

Design principles

Testing

Status

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-rs

A pure-Rust async runtime for shipping LLM agents.

TL;DR

Why agent-rs?

Architecture

Install

agent features

agent-tools-code features

Examples

Quickstart with bundled tools

Module surface

Foundation

Service layer

Discovery + extensibility

Design principles

Testing

Status

Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`agent-rs`

`agent` features

`agent-tools-code` features

Packages