talon serve --proxy-quickstart enables a dev/local OpenAI-compatible host-root proxy.
Supported host-root endpoints:
POST /v1/chat/completionsPOST /v1/responses
Unsupported in quickstart host-root mode:
POST /v1/embeddingsGET /v1/responses/{id}DELETE /v1/responses/{id}
Unsupported paths return 404 with a partial-compatibility message.
| Flag | Meaning |
|---|---|
--proxy-quickstart |
Enable quickstart mode. |
--unsafe-listen |
Allow non-loopback bind with quickstart mode. |
--host |
Host/IP for bind. Loopback only unless --unsafe-listen. |
--port |
Listen port. |
| Variable | Meaning |
|---|---|
OPENAI_API_KEY |
Upstream fallback when caller bearer is absent. |
TALON_QUICKSTART_OPENAI_BASE_URL |
Upstream OpenAI-compatible base URL. |
TALON_QUICKSTART_MODE |
Set to shadow to opt into shadow mode; any other value uses default enforce. |
TALON_QUICKSTART_ALLOW_ALL_MODELS |
1/true clears quickstart model allowlist. |
Quickstart uses upstream BYOK as a scoped exception:
- Caller
Authorization: Bearer <key>is forwarded to upstream. - If missing, Talon tries
OPENAI_API_KEY. - If neither exists, request fails with
401.
- Enforcement mode:
enforce. - PII default action:
redact. - Default model allowlist:
gpt-4o-mini,gpt-4o(useTALON_QUICKSTART_ALLOW_ALL_MODELS=1to disable for local-only experiments). - Evidence includes
upstream_auth_mode,upstream_key_source,upstream_key_fingerprint, and optionalgateway_annotations(e.g.quickstart_mode,quickstart_shadow_mode,quickstart_model_allowlist_disabled,quickstart_unsafe_listen).
Quickstart traffic is projected into the same operational feed used by the dashboard and CLI:
GET /api/v1/events/recent?limit=50GET /api/v1/events/stream(SSE, supportsLast-Event-ID)
Each event includes an evidence_id pointer to the signed evidence record.
401withno upstream credential: set OPENAI_API_KEY or send Authorization: Bearer ...means Talon received neither a client bearer key nor a usableOPENAI_API_KEY.404withpartial OpenAI compatibility in quickstart mode; see docsmeans the requested/v1/*path is outside quickstart scope.
Quickstart is strictly a host-root OpenAI-compatibility facade backed by a synthetic in-process caller. It does not register a synthetic tenant key and does not unlock Talon's tenant-auth surface.
The relocated tenant agent chat route POST /v1/agents/chat/completions is only mounted when the operator has configured real tenant keys (for example through a full gateway config). In default quickstart (no tenant keys), this route is not mounted at all and returns 404 Not Found, preserving a clean facade-only boundary and avoiding any dev-mode-open backdoor to tenant APIs. When tenant keys are configured, the relocated route sits behind standard tenant-auth middleware and returns 401 Unauthorized without a valid key.
--hostomitted: binds to127.0.0.1:<port>.- Loopback hosts (
127.0.0.1,::1,localhost): allowed. - Non-loopback host without
--unsafe-listen: startup error.
--proxy-quickstart cannot be used with:
--gateway--gateway-config
Use --gateway for production-style caller mapping and vaulted provider auth.