sb-runtime

A single binary that runs your AI agent inside an OS-level sandbox and signs a receipt for every decision it makes.

What it does, in plain English

You want to run an AI agent — a coding assistant, an autonomous workflow, a scripted LLM — but you don't fully trust what it'll do. sb-runtime wraps the agent and does three things:

Checks a policy before the agent runs. Is it allowed to invoke python? Can it touch /etc? Can it open a network socket? You write the rules once in Cedar — the same policy language AWS uses for IAM.
Confines the agent at the OS level while it runs. On Linux the agent literally cannot open files outside the allowed list, cannot make most syscalls, cannot open network sockets — not because the agent cooperates, because the kernel refuses. (Backend: Landlock + seccomp. Opt-in for v0.1-alpha; see the platform matrix below.)
Signs a receipt for every decision. Ed25519-signed, hash-chained, verifiable offline by anyone with the public key. When something goes wrong you can prove — not claim — what happened.

You don't modify the agent. You wrap it:

sb exec --policy dev.cedar -- python my_agent.py

Who this is for

Security teams running AI coding assistants they didn't write.
Compliance teams who need tamper-evident evidence of agent behaviour.
CI and edge deployments where Docker / k3s / OpenShell is too heavy.
Anyone nervous about letting an LLM run commands on their machine.

How it relates to `protect-mcp`

sb-runtime is the OS sandbox around the agent. protect-mcp is the policy check inside the agent — a hook that sits between the LLM and its tool registry in Claude Code / MCP. They're complementary:

┌─────────────────────────────────────────────────┐
│  sb-runtime   ← OS refuses forbidden syscalls   │
│  ┌───────────────────────────────────────────┐  │
│  │  agent process (Claude Code, Python, …)   │  │
│  │  ┌─────────────────────────────────────┐  │  │
│  │  │  protect-mcp                        │  │  │
│  │  │    ← Cedar decides per tool call,   │  │  │
│  │  │      receipts every decision        │  │  │
│  │  └─────────────────────────────────────┘  │  │
│  └───────────────────────────────────────────┘  │
└─────────────────────────────────────────────────┘

Use protect-mcp alone if you wrote the agent and trust its framework to honour decisions. Use sb-runtime alone if you didn't write the agent and want the OS to contain it regardless. Use both for belt-and-braces.

Status: v0.1.0-alpha.1 — design-partner preview. Honest platform matrix:

Platform	Sandbox	Cedar + receipts
Linux x86_64	Landlock + seccomp — opt-in via `--features linux-sandbox` (see issue #1)	✓
Linux aarch64	Refuses (see issue #1)	✓
macOS	Stub (`--allow-unsandboxed`) — issue #3	✓
Windows	Stub (`--allow-unsandboxed`)	✓

The linux-sandbox cargo feature is off by default in v0.1-alpha while the Landlock/seccomp backend is stabilized. The JCS-canonical, Ed25519-signed, hash-chained receipts (plus Cedar policy evaluation) work on every platform today.

We're actively looking for design-partner input on the AGT provider interface, the Cedar schema for agent actions, and the macOS/Windows backend priorities — see CONTRIBUTING.md or reply to microsoft/agent-governance-toolkit#748.

Quick start

cargo install --path crates/sb-cli    # or: cargo run -p sb-cli --
sb exec \
  --policy examples/basic/policy.cedar \
  --receipts .receipts \
  -- /usr/bin/cat /etc/hosts

sb verify .receipts/
# ✓ 1 receipts verified (2026-04-17T...Z → 2026-04-17T...Z)

The signed receipt format is compatible with @veritasacta/verify — your auditor can verify a chain offline with npx @veritasacta/verify .receipts/ without installing the sb binary.

Architecture

sb-cli            — the `sb` binary
├── sb-policy     — Cedar policy evaluator
├── sb-sandbox    — OS sandbox (Landlock + seccomp on Linux; macOS/Windows WIP)
└── sb-receipt    — Ed25519-signed, JCS-canonical, hash-chained receipts

Each sub-crate is usable independently. sb-receipt is deliberately minimal (zero I/O, pure crypto) so it can be dropped into other Rust agent frameworks.

Commands


`sb exec --policy P --sandbox S -- CMD ARGS…`	evaluate P, apply S, emit receipt, exec CMD
`sb verify DIR`	verify a chain of receipts offline
`sb keys generate`	mint a fresh Ed25519 keypair

--allow-unsandboxed skips the sandbox step (Cedar + receipts still fire). Useful on macOS / Windows until the native backends ship; do not use in production.

Why not just…

…use Docker? Docker is great but heavy for CI, edge, and dev-laptop agents. sb-runtime is 8 MB and starts instantly.
…use OpenShell? OpenShell is the right design, but it expects Docker/OCI/k3s/gateway infrastructure. sb-runtime is the local-first version of the same idea. AGT's agent-os-kernel can talk to either; swap via config.
…use firejail / bubblewrap? Those are filesystem sandboxes. They don't evaluate Cedar policy before the exec, and they don't emit signed receipts. Combine them with sb-runtime if you want — sb does Cedar + receipts + Landlock+seccomp, they do extra fs isolation layers.
…just use Cedar? Cedar decides. It doesn't enforce or audit. sb-runtime is the enforcement layer.

Integrating with Microsoft's Agent Governance Toolkit

See examples/agt-integration/ for a Python drop-in shim (SbRuntimeSkill) that replaces openshell_agentmesh.skill.GovernanceSkill field-for-field. Swap via config, no agent code changes required.

Licensing

Apache-2.0. No runtime dependencies on ScopeBlind services; no telemetry. The optional managed tier (hosted receipt archival, team dashboards, compliance exports) is available at scopeblind.com/pricing but the sandbox runs local-only forever with the free binary.

Design-partner program

We're looking for 3–5 engineers to co-design the AGT provider interface, the Cedar schema for agent actions, and the macOS / Windows backend priorities. If you're building in this space (agent governance, policy-as-code, secure-element attestation, transparency-log anchoring), open an issue or reach out — early partners get direct input on API surface before v0.1 stabilises.

Agent Governance Toolkit (Microsoft) — decision layer
Cedar (AWS) — policy engine
Sigstore — transparency-log anchoring for receipt chains
Veritas Acta — open protocol for contestable public records
IETF draft-farley-acta-signed-receipts — standardisation of the receipt format

Contributing

Small repo, fast iteration. PRs welcome. See DESIGN.md for the current roadmap.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
crates		crates
examples		examples
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DESIGN.md		DESIGN.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sb-runtime

What it does, in plain English

Who this is for

How it relates to `protect-mcp`

Quick start

Architecture

Commands

Why not just…

Integrating with Microsoft's Agent Governance Toolkit

Licensing

Design-partner program

Related

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sb-runtime

What it does, in plain English

Who this is for

How it relates to protect-mcp

Quick start

Architecture

Commands

Why not just…

Integrating with Microsoft's Agent Governance Toolkit

Licensing

Design-partner program

Related

Contributing

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

How it relates to `protect-mcp`

Packages