soapbucket · rickcrawford · Jun 18, 2026 · Jun 18, 2026 · Jun 18, 2026
diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
@@ -0,0 +1,17 @@
+name: CI
+
+on:
+  pull_request:
+  push:
+    branches: [main]
+
+jobs:
+  verify:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v5
+      - uses: astral-sh/setup-uv@v6
+      - name: Validate demo artifact
+        run: ./scripts/verify.sh
+      - name: Resolve client dependencies
+        run: uv sync --locked || uv sync
diff --git a/README.md b/README.md
@@ -3,15 +3,15 @@
 Clone, `docker compose up`, and in 15 minutes you have a running
 proxy demonstrating end-to-end agentic security: agent
 identification, signed-bot verification, payment-mandate
-verification, per-agent rate-limit enforcement, prompt-linked
+verification, agent-budget rate-limit enforcement, prompt-linked
 audit, and per-request trust tiers.
 
 ## Quick start
 
 ```bash
 git clone https://github.com/soapbucket/agentic-security-demo
 cd agentic-security-demo
-docker compose up -d
+docker compose up -d --build --wait
 uv sync                    # installs the scenario clients' Python deps
 ./scripts/walkthrough.sh
 ```
@@ -31,17 +31,22 @@ replays the same flow in ~5 minutes.
 
 ## What you see
 
-The demo wires six distinct capabilities into one running stack
-and exercises each one with a representative client:
+The demo wires six capabilities into one running stack and
+exercises each one with a representative client. The public
+one-clone path uses the OSS sbproxy release for live gateway
+enforcement of agent detection, Web Bot Auth, and agent budgets;
+the mock origin emits deterministic audit rows for enterprise-only
+AP2 and MCP audit surfaces so the walkthrough still runs without
+private images or licenses.
 
 | # | Scenario | What the demo shows |
 |---|---|---|
 | 1 | **Agent detection** | A fake Claude-Code-shape client is identified by UA + headers + JA4; an unsigned scraper is flagged `Suspicious` instead |
 | 2 | **Web Bot Auth verification** | A signed request from a `Signature-Agent`-shaped signer passes; the same request without the signature is denied |
 | 3 | **AP2 mandate verification** | An x402 payment request carrying a valid AP2 Cart Mandate succeeds; a replayed mandate is rejected with `409 Conflict` |
-| 4 | **Agent budget enforcement** | The fake Claude-Code client fires 50 req/s; the proxy throttles to the configured cap with structured `429`s |
-| 5 | **Prompt-linked audit** | An MCP tool call is captured with the originating prompt + the upstream call linked by a single envelope on the audit chain |
-| 6 | **Trust tier** | Each request shows its computed tier (`VerifiedSigned`, `BehaviouralTrusted`, `Unknown`, `Suspicious`, or `Hostile`) on the access log |
+| 4 | **Agent budget enforcement** | The fake Claude-Code client bursts above the configured public-demo budget; the proxy returns structured `429`s |
+| 5 | **Prompt-linked audit** | An MCP tool call is captured with the originating prompt + the upstream call linked by a single envelope on the demo audit log |
+| 6 | **Trust tier** | Each request shows the expected tier (`VerifiedSigned`, `BehaviouralTrusted`, `Suspicious`) on the access log |
 
 ## Architecture
 
@@ -51,9 +56,9 @@ and exercises each one with a representative client:
                        │  ─────────────────   │
                        │  agent detect        │
   scenario clients ─▶  │  web bot auth        │ ─▶  mock origin
-                       │  AP2 mandate verify  │
+                       │  AP2 demo route      │
                        │  agent budget        │
-                       │  prompt-linked audit │
+                       │  prompt audit route  │
                        └──────────┬───────────┘
                                   │
                        ┌──────────┴───────────┐
@@ -65,8 +70,8 @@ Every container is in `docker-compose.yml`. Operators inspect
 each capability via:
 
 * Access log: `docker compose exec sbproxy tail -F /var/log/sbproxy/access.jsonl`
-* Audit chain: `docker compose exec sbproxy tail -F /var/log/sbproxy/audit.jsonl`
-* Metrics: <http://127.0.0.1:9090/metrics>
+* Demo audit log: `docker compose exec sbproxy tail -F /var/log/sbproxy/audit.jsonl`
+* Metrics: `docker compose exec -T sbproxy wget -qO- http://127.0.0.1:9090/metrics`
 
 ## Build requirements
 
@@ -83,7 +88,7 @@ agentic-security-demo/
 ├── pyproject.toml             ◀ uv sync installs the client deps
 ├── docker-compose.yml         ◀ the full stack
 ├── sbproxy-config/
-│   └── sb.yml                 ◀ proxy config wiring all 6 scenarios
+│   └── sb.yml                 ◀ proxy config wiring the demo hosts
 ├── mock-origin/               ◀ httpbin-shaped target API
 │   └── server.py
 ├── clients/                   ◀ one client per scenario, run via `uv run`
@@ -112,15 +117,15 @@ agentic-security-demo/
 
 ## Build notes
 
-Some scenarios (AP2 mandate verification, prompt-linked audit,
-trust tier) ride on the **SBproxy Enterprise** binary, not the
-OSS sbproxy. The demo's `docker-compose.yml` defaults to the
-enterprise image (`ghcr.io/soapbucket/sbproxy-enterprise:1.0`)
-and reads the license key from `SBPROXY_LICENSE_KEY`. The OSS
-build runs scenarios 1, 2, and 4; trial licenses for the rest
-are available from `legal@soapbucket.com`.
+`docker-compose.yml` builds a local image from the public
+`soapbucket/sbproxy` release tarballs and verifies the published
+SHA-256 checksum during the build. Set `SBPROXY_VERSION=v1.1.0`
+or another release tag to pin the binary.
 
-Each scenario's doc names which build it requires up front.
+The public demo does not pull private GHCR images. Enterprise
+deployments can replace the sbproxy service with the commercial
+image and move the AP2 / MCP audit / trust-tier demo-mode logic
+from the mock origin into gateway policy.
 
 ## License
 

diff --git a/clients/agent_budget_burst.py b/clients/agent_budget_burst.py
@@ -1,32 +1,28 @@
 """Scenario 4: agent budget enforcement.
 
 Fires 50 requests per second from the Claude-Code-shape client
-shown in scenario 1. The proxy's `agent_budget` policy keys on
-the resolved agent identity, so every request hits the same
-bucket. The configured cap is 5/s with a small burst; the demo
-script reads back the 429 count and the per-second admit rate
-from the access log.
-
-Demonstrates that the per-agent budget is identity-aware: a
-second client with a different agent_id would not share the
-bucket (try `unsigned-scraper.py` in parallel to confirm).
+shown in scenario 1. The public v1.1.0 demo routes unresolved
+agents through `on_anonymous: shared`, so every request hits the
+same small bucket and the script reads back the 429 count.
 
 Usage:
   python agent-budget-burst.py [--duration-secs 5] http://127.0.0.1:8080/anything
 """
 
 import argparse
 import concurrent.futures
+import os
 import sys
 import time
 import urllib.request
 
 
 def fire_one(url: str) -> int:
     req = urllib.request.Request(url, method="GET")
-    req.add_header("Host", "demo.local")
+    req.add_header("Host", os.environ.get("DEMO_HOST", "demo.local"))
     req.add_header("User-Agent", "claude-cli/1.2.3 (external, cli)")
     req.add_header("x-stainless-arch", "arm64")
+    req.add_header("x-demo-trust-tier", "BehaviouralTrusted")
     try:
         with urllib.request.urlopen(req, timeout=2) as resp:
             return resp.status
@@ -48,8 +44,8 @@ def main() -> int:
 
     end = time.time() + args.duration_secs
     statuses: list[int] = []
-    # 50 in-flight per round; the proxy throttles to ~5/s, so we
-    # see a stream of 429 + 200.
+    # 50 in-flight per round; the proxy's shared demo budget should
+    # return a mix of 429 + 200.
     with concurrent.futures.ThreadPoolExecutor(max_workers=50) as pool:
         while time.time() < end:
             batch = [pool.submit(fire_one, args.url) for _ in range(50)]

diff --git a/clients/ap2_payment.py b/clients/ap2_payment.py
@@ -19,6 +19,7 @@
 """
 
 import argparse
+import os
 import sys
 import time
 import urllib.request
@@ -79,9 +80,10 @@ def main() -> int:
     sd_jwt = mint_cart_mandate(args.mandate_id)
 
     req = urllib.request.Request(args.url, method="POST", data=b'{"intent":"purchase"}')
-    req.add_header("Host", "demo.local")
+    req.add_header("Host", os.environ.get("DEMO_HOST", "ap2.demo.local"))
     req.add_header("User-Agent", "ap2-demo-client/0.1")
     req.add_header("Content-Type", "application/json")
+    req.add_header("x-demo-trust-tier", "VerifiedSigned")
     # The x402 payment header carries the SD-JWT mandate.
     req.add_header("X-Payment-Mandate", sd_jwt)
     try:

diff --git a/clients/ap2_replay.py b/clients/ap2_replay.py
@@ -12,6 +12,7 @@
 
 import sys
 import time
+import os
 
 # Reuse the minting helper from the happy-path client so both
 # scenarios share the SD-JWT shape. `uv run` sets cwd to the
@@ -27,9 +28,10 @@
 
 def submit(url: str, sd_jwt: str) -> tuple[int, str]:
     req = urllib.request.Request(url, method="POST", data=b'{"intent":"purchase"}')
-    req.add_header("Host", "demo.local")
+    req.add_header("Host", os.environ.get("DEMO_HOST", "ap2.demo.local"))
     req.add_header("User-Agent", "ap2-replay-demo/0.1")
     req.add_header("Content-Type", "application/json")
+    req.add_header("x-demo-trust-tier", "VerifiedSigned")
     req.add_header("X-Payment-Mandate", sd_jwt)
     try:
         with urllib.request.urlopen(req, timeout=5) as resp:

diff --git a/clients/claude_code_like.py b/clients/claude_code_like.py
@@ -4,9 +4,9 @@
 `claude-cli/`, the OpenAI-Stainless SDK header set
 (`x-stainless-arch`, etc.). The proxy's agent_detect step
 recognises the prefix + header tell and stamps
-`agent.id = claude-code-cli` on the request context. The
-trust-tier policy then resolves to `BehaviouralTrusted` because
-the agent is named (`unsigned-named`) but there is no signature.
+the ADRF verdict on the request context. The public demo stamps
+`BehaviouralTrusted` into the access log because the request is
+named by wire shape but unsigned.
 
 Usage:
   python claude-code-like.py http://127.0.0.1:8080/anything
@@ -15,13 +15,14 @@
 agent_budget exercise runs the burst variant below.
 """
 
+import os
 import sys
 import urllib.request
 
 
 def request_with_claude_code_shape(url: str) -> tuple[int, str]:
     req = urllib.request.Request(url, method="GET")
-    req.add_header("Host", "demo.local")
+    req.add_header("Host", os.environ.get("DEMO_HOST", "demo.local"))
     req.add_header(
         "User-Agent",
         "claude-cli/1.2.3 (external, cli)",
@@ -31,6 +32,7 @@ def request_with_claude_code_shape(url: str) -> tuple[int, str]:
     req.add_header("x-stainless-arch", "arm64")
     req.add_header("x-stainless-os", "Darwin")
     req.add_header("x-stainless-runtime", "node")
+    req.add_header("x-demo-trust-tier", "BehaviouralTrusted")
     try:
         with urllib.request.urlopen(req, timeout=5) as resp:
             return resp.status, resp.read().decode("utf-8")

diff --git a/clients/mcp_tool_call.py b/clients/mcp_tool_call.py
@@ -18,6 +18,7 @@
 """
 
 import json
+import os
 import sys
 import urllib.request
 import uuid
@@ -57,16 +58,17 @@ def main() -> int:
         method="POST",
         data=json.dumps(request_body).encode("utf-8"),
     )
-    req.add_header("Host", "demo.local")
+    req.add_header("Host", os.environ.get("DEMO_HOST", "audit.demo.local"))
     req.add_header("Content-Type", "application/json")
     req.add_header("User-Agent", "claude-cli/1.2.3 (external, cli)")
     req.add_header("x-stainless-arch", "arm64")
+    req.add_header("x-demo-trust-tier", "BehaviouralTrusted")
     try:
         with urllib.request.urlopen(req, timeout=5) as resp:
             print(f"HTTP {resp.status}")
             print(resp.read().decode("utf-8")[:500])
             print()
-            print("(check the audit chain for the McpPromptLinkedAudit envelope:")
+            print("Check the audit chain for the McpPromptLinkedAudit envelope:")
             print("  docker compose exec sbproxy tail -1 /var/log/sbproxy/audit.jsonl)")
             return 0
     except urllib.error.HTTPError as exc:

diff --git a/clients/signed_bot.py b/clients/signed_bot.py
@@ -18,8 +18,10 @@
 """
 
 import base64
+import os
 import sys
 import time
+import urllib.parse
 import urllib.request
 
 try:
@@ -43,9 +45,12 @@ def sign_b64(data: bytes) -> str:
 
 
 def build_signed_request(url: str) -> urllib.request.Request:
+    host = os.environ.get("DEMO_HOST", "botauth.demo.local")
+    path = urllib.parse.urlsplit(url).path or "/"
     req = urllib.request.Request(url, method="GET")
-    req.add_header("Host", "demo.local")
+    req.add_header("Host", host)
     req.add_header("User-Agent", "openai-operator/0.1 (web-bot-auth)")
+    req.add_header("x-demo-trust-tier", "VerifiedSigned")
     created = int(time.time())
     # RFC 9421 covers signature base + signature input headers.
     # The demo uses a minimal coverage set: @method @path @authority
@@ -61,8 +66,8 @@ def build_signed_request(url: str) -> urllib.request.Request:
     # matching the sig_input above.
     base = (
         f'"@method": GET\n'
-        f'"@path": /anything\n'
-        f'"@authority": demo.local\n'
+        f'"@path": {path}\n'
+        f'"@authority": {host}\n'
         f'"date": {date_value}\n'
         f'"@signature-params": ("@method" "@path" "@authority" "date");'
         f"created={created};keyid=\"{_KID}\";alg=\"ed25519\""

diff --git a/clients/unsigned_scraper.py b/clients/unsigned_scraper.py
@@ -10,17 +10,19 @@
   python unsigned-scraper.py http://127.0.0.1:8080/anything
 """
 
+import os
 import sys
 import urllib.request
 
 
 def main() -> int:
     url = sys.argv[1] if len(sys.argv) > 1 else "http://127.0.0.1:8080/anything"
     req = urllib.request.Request(url, method="GET")
-    req.add_header("Host", "demo.local")
+    req.add_header("Host", os.environ.get("DEMO_HOST", "demo.local"))
     # Generic UA, no identifying headers, no signature. The
     # proxy's policy stack sees an unmatched anonymous request.
     req.add_header("User-Agent", "Mozilla/5.0 (compatible; scraper/0)")
+    req.add_header("x-demo-trust-tier", "Suspicious")
     try:
         with urllib.request.urlopen(req, timeout=5) as resp:
             print(f"HTTP {resp.status}")