From b0bf9f3fd2bf6037e18e918dce43f7e51805ae9a Mon Sep 17 00:00:00 2001
From: Sergey Enin <sergeyenin@gmail.com>
Date: Wed, 3 Jun 2026 09:06:18 +0200
Subject: [PATCH] docs: publish threat model & trust boundaries (#118)

Add a STRIDE-style threat model so a security reviewer can map Talon's
boundaries without contacting us:

- docs/reference/threat-model.md: data-flow diagram of the gateway path,
  six trust boundaries (caller, provider, MCP tool, host/process, admin
  plane, at-rest stores), an assets table, per-boundary controls, a STRIDE
  threats/mitigations table, key-management assumptions (signing/secrets/
  admin keys: location, set-explicitly-in-prod, rotation, blast radius),
  and the integrity-not-correctness statement for the HMAC signature.

Cross-linked from SECURITY.md (expands the existing snapshot), the docs
index (Reference + Proof Pack), and LIMITATIONS.md (no longer
"forthcoming"). Claims are grounded in internal/{gateway,evidence,secrets,
policy,classifier}.

Closes #118
---
 LIMITATIONS.md                 |   3 +-
 SECURITY.md                    |   2 +
 docs/README.md                 |   2 +
 docs/reference/threat-model.md | 163 +++++++++++++++++++++++++++++++++
 4 files changed, 169 insertions(+), 1 deletion(-)
 create mode 100644 docs/reference/threat-model.md

diff --git a/LIMITATIONS.md b/LIMITATIONS.md
index 648f2104..4207bfe6 100644
--- a/LIMITATIONS.md
+++ b/LIMITATIONS.md
@@ -52,4 +52,5 @@ A valid signature proves that this evidence record was signed with the deploymen
 - [SECURITY.md](SECURITY.md) — security boundaries and threat-model snapshot
 - [Evidence store](docs/explanation/evidence-store.md) — how records are created, signed, and verified
 - [Evidence integrity specification](docs/reference/evidence-integrity-spec.md) — byte-exact fields, serialization, signing, and independent verification
-- A formal threat model and reproducible benchmarks are forthcoming.
+- [Threat model](docs/reference/threat-model.md) — attack surface, trust boundaries, and key-management assumptions
+- Reproducible benchmarks are forthcoming.
diff --git a/SECURITY.md b/SECURITY.md
index 518ff25c..da35a326 100644
--- a/SECURITY.md
+++ b/SECURITY.md
@@ -35,6 +35,8 @@ Talon helps enforce and evidence policy decisions in the request path. It does n
 - Does not prevent: compromised upstream provider, stolen operator keys, or host-level compromise outside Talon.
 - Operator responsibility: secure deployment, rotate keys, protect evidence/signing secrets, and monitor incidents.
 
+For the full STRIDE-style threat model — data-flow diagram, trust boundaries, threats and mitigations, and key-management assumptions — see [docs/reference/threat-model.md](docs/reference/threat-model.md).
+
 ## Security Architecture
 
 - **Secrets:** AES-256-GCM encrypted at rest, per-agent/tenant ACL, every access logged
diff --git a/docs/README.md b/docs/README.md
index 054ef617..13235d8a 100644
--- a/docs/README.md
+++ b/docs/README.md
@@ -68,6 +68,7 @@ Choose the shortest path for your situation:
 |-----|-------------|
 | [Configuration and environment](reference/configuration.md) | Environment variables, crypto keys, and config reference. |
 | [Evidence integrity specification](reference/evidence-integrity-spec.md) | Normative signed-record spec: fields, canonical serialization, HMAC-SHA256 signing, and the independent verification procedure. |
+| [Threat model](reference/threat-model.md) | STRIDE-style attack surface, trust boundaries, threats/mitigations, and key-management assumptions for the gateway path. |
 | [Authentication and key scopes](reference/authentication-and-key-scopes.md) | Which keys authenticate which endpoint families (gateway vs control plane vs dashboard). |
 | [Gateway dashboard](reference/gateway-dashboard.md) | Dashboard endpoints, metrics API schema, snapshot fields, and authentication. |
 | [Operational control plane](reference/operational-control-plane.md) | Run management (list/kill/pause/resume), tenant lockdown, runtime overrides, tool approval gates. |
@@ -96,6 +97,7 @@ Choose the shortest path for your situation:
 | [Evidence store](explanation/evidence-store.md) | HMAC integrity model and verification flow. |
 | [Evidence integrity specification](reference/evidence-integrity-spec.md) | Byte-exact spec so a third party can independently verify a record. |
 | [Evidence integrity 5-minute proof](tutorials/evidence-integrity-demo.md) | Fast proof moment for auditors/operators, including offline signed-export verification. |
+| [Threat model](reference/threat-model.md) | Attack surface, trust boundaries, and what the HMAC signature does and does not prove. |
 | [Security policy](../SECURITY.md) | Vulnerability reporting process and security scope. |
 | [Docker Compose demo](../examples/docker-compose/README.md) | Fastest no-key proof loop. |
 
diff --git a/docs/reference/threat-model.md b/docs/reference/threat-model.md
new file mode 100644
index 00000000..ff24115a
--- /dev/null
+++ b/docs/reference/threat-model.md
@@ -0,0 +1,163 @@
+# Threat Model
+
+**Status:** stable · **Scope:** the Talon gateway/proxy request path and its evidence,
+secrets, and policy components.
+
+This document lets a security reviewer map Talon's attack surface, trust boundaries, and
+key-management assumptions without contacting the maintainers. It uses a STRIDE-style
+framing. For the boundaries of what Talon claims, see [LIMITATIONS.md](../../LIMITATIONS.md);
+for the signed-record format, see the
+[Evidence integrity specification](evidence-integrity-spec.md).
+
+> **One-line summary.** Talon is a self-hosted network gateway that enforces policy and
+> emits signed evidence on the request path. It reduces and records risk on that path; it
+> does not harden the host, secure upstream providers, or make the operator's compliance
+> determination.
+
+## 1. System overview and data flow
+
+```mermaid
+flowchart LR
+  caller["Caller / SDK app"]
+
+  subgraph host [Operator-controlled host]
+    subgraph talon [Talon single binary]
+      gw["Gateway: auth, rate limit, PII scan, OPA policy, tool filter, redact/block"]
+      vault["Secrets vault (AES-256-GCM, SQLite)"]
+      evid["Evidence store (HMAC-signed, SQLite)"]
+      admin["Admin / dashboard plane"]
+    end
+  end
+
+  provider["Upstream LLM provider"]
+  tool["MCP tool / server"]
+  operator["Operator / auditor"]
+
+  caller -->|"Bearer key, request body"| gw
+  gw -->|"reads upstream key"| vault
+  gw -->|"forwarded request"| provider
+  gw -->|"filtered tool calls"| tool
+  gw -->|"writes signed record"| evid
+  operator -->|"X-Talon-Admin-Key"| admin
+  admin -->|"reads"| evid
+```
+
+Trust-boundary crossings (each is a point where data changes trust domain):
+
+1. **Caller → Gateway** — the caller is untrusted until authenticated by API key.
+2. **Gateway → Upstream provider** — a separate trust domain, possibly outside the EU.
+3. **Gateway → MCP tool/server** — tool execution happens outside Talon.
+4. **Operator → Host/Process** — everything inside the host is operator-controlled.
+5. **Admin/auditor → Admin plane** — privileged read/management access.
+6. **At rest** — the SQLite vault and evidence database files on the host disk.
+
+## 2. Assets
+
+| Asset | Why it matters | Primary protection |
+|-------|----------------|--------------------|
+| Upstream provider API keys | Spend and data-egress authority | AES-256-GCM vault, per-agent/tenant ACL, audited access ([`internal/secrets/vault.go`](../../internal/secrets/vault.go)) |
+| Evidence records | The audit/compliance value proposition | HMAC-SHA256 signature ([`internal/evidence/signature.go`](../../internal/evidence/signature.go)) |
+| PII in request/response | GDPR exposure | Pre-call input scan + output scan, redact/block ([`internal/classifier/pii.go`](../../internal/classifier/pii.go)) |
+| Policy configuration | Defines allow/deny, budgets, routing | Operator-controlled config; embedded OPA ([`internal/policy/engine.go`](../../internal/policy/engine.go)) |
+| Signing & encryption keys | Root of evidence integrity and vault confidentiality | Operator-managed secrets (see [§5](#5-key-management)) |
+
+## 3. Trust boundaries and controls
+
+### 3.1 Caller ↔ Gateway
+
+- Callers authenticate with `Authorization: Bearer <key>`; comparison is constant-time
+  (`crypto/subtle.ConstantTimeCompare`) to resist timing attacks.
+- The caller identity selects tenant, team, and policy overrides; rate limits apply
+  per-caller (token bucket).
+- Request bodies are **untrusted input**: parsed defensively, scanned for PII, and (for
+  attachments) sandboxed and scanned for prompt-injection patterns.
+
+### 3.2 Gateway ↔ Upstream provider
+
+- The upstream provider is a distinct trust domain. Talon forwards the request using the
+  real key resolved from the vault; the caller never needs the upstream key.
+- Sovereignty/routing policy can **deny** a non-EU destination with signed evidence; in
+  the proxy path this is allow/deny, not a silent reroute (see [LIMITATIONS.md](../../LIMITATIONS.md)).
+- A compromised or misbehaving provider is **out of scope** — Talon cannot vouch for what
+  happens after the request leaves it.
+
+### 3.3 Gateway ↔ MCP tool / server
+
+- Tool governance today is **request-body filtering**: forbidden tools are stripped before
+  forwarding ([`internal/gateway/tool_filter.go`](../../internal/gateway/tool_filter.go)).
+- Talon does **not** intercept tool *execution* in another runtime, and does not prevent a
+  tool from being invoked on a path that does not pass through Talon.
+
+### 3.4 Operator ↔ Host / Process
+
+- Talon is a single binary applying process-level controls. It is **not** an OS/kernel
+  sandbox. Host compromise, container escape, and lateral movement are out of scope and
+  remain the operator's responsibility.
+
+### 3.5 At-rest stores and admin plane
+
+- Secrets are encrypted at rest (AES-256-GCM); evidence is signed (HMAC-SHA256). The
+  SQLite files themselves rely on host filesystem permissions.
+- Admin/control-plane and dashboard/metrics endpoints are gated by `TALON_ADMIN_KEY`
+  (`X-Talon-Admin-Key`). If unset, those endpoints are unrestricted — set it in production.
+
+## 4. STRIDE threats and mitigations
+
+| Threat | Example | Mitigation | Residual risk / out of scope |
+|--------|---------|------------|------------------------------|
+| **Spoofing** | Caller impersonates another tenant | Bearer-key auth, constant-time compare, per-caller identity | Stolen caller key (operator key hygiene) |
+| **Tampering** | Edit a stored evidence row | HMAC-SHA256 over canonical JSON; `talon audit verify` fails on any change | Attacker with the signing key can forge new records |
+| **Repudiation** | "That request never happened" | Evidence-by-default: every decision (incl. denials/failures) is recorded and signed | Records created only for traffic that passes through Talon |
+| **Information disclosure** | PII leaks to provider or logs | Pre-call PII scan + redact/block; output scan; vault encryption; PII redaction in evidence | Regex/heuristic PII detection has imperfect recall |
+| **Denial of service** | Caller floods the gateway | Per-caller + global token-bucket rate limiting; context timeouts | Host/network-level DoS is out of scope |
+| **Elevation of privilege** | Unauthorized secret/admin access | Per-agent/tenant secret ACLs with audit logging; `TALON_ADMIN_KEY` on admin plane | Host compromise or leaked admin key |
+
+## 5. Key management
+
+Talon uses three operator-managed secrets (see
+[Configuration reference](configuration.md)):
+
+| Key | Purpose | Format | Default |
+|-----|---------|--------|---------|
+| `TALON_SIGNING_KEY` | HMAC-SHA256 evidence signing | >= 32 raw bytes or 64+ hex chars | Auto-derived per machine |
+| `TALON_SECRETS_KEY` | AES-256-GCM vault encryption | 32 raw bytes or 64 hex chars | Auto-derived per machine |
+| `TALON_ADMIN_KEY` | Admin/control-plane + dashboard auth | operator-chosen string | unset (endpoints unrestricted) |
+
+Assumptions and guidance:
+
+- **Location.** Keys are read from the environment/configuration of the Talon process.
+  They are never sent to callers or providers, and the signing key never leaves the host.
+- **Set them explicitly in production.** By default the signing and secrets keys are
+  derived per machine; explicit, backed-up keys are required for reproducible verification
+  across machines and for disaster recovery. The signing and secrets keys must differ.
+- **Rotation.** Rotating `TALON_SIGNING_KEY` means new records are signed with the new
+  key; records signed with a previous key verify only under that previous key. Retain
+  prior signing keys (or re-export) to keep historical evidence verifiable. Rotating
+  `TALON_SECRETS_KEY` requires re-encrypting stored secrets.
+- **Blast radius.** A leaked signing key lets an attacker forge or alter records that
+  still verify — evidence integrity depends entirely on its secrecy. A leaked secrets key
+  exposes the vaulted provider credentials. A leaked admin key exposes the control plane.
+
+## 6. What the HMAC signature does and does not prove
+
+A valid signature proves that an evidence record was signed with the deployment's
+configured key and — assuming that key remains protected — has not been modified since
+signing. It does **not** prove that the policy decision, model response, tool result, or
+operator configuration was correct, and it does not attest anything about upstream or
+downstream systems. HMAC is symmetric: anyone holding the key can produce valid
+signatures, so this is integrity under the operator's key custody, not third-party
+non-repudiation.
+
+## 7. Residual risks (operator responsibilities)
+
+- Host and OS hardening, network security, and container isolation.
+- Custody, rotation, and backup of signing/encryption/admin keys.
+- Filesystem access control on the SQLite vault and evidence databases.
+- Trust decisions about upstream providers and external tools.
+- The legal/compliance determination itself (Talon supplies supporting controls and
+  evidence only).
+
+## 8. Reporting
+
+Report suspected vulnerabilities privately per [SECURITY.md](../../SECURITY.md). Do not
+open a public issue for security reports.