Use this path when you already have an app using an OpenAI-compatible SDK and want Talon governance without writing gateway YAML.
talon serve --proxy-quickstart --port 8080By default Talon binds 127.0.0.1 and enables:
POST /v1/chat/completionsPOST /v1/responses
export OPENAI_BASE_URL=http://127.0.0.1:8080/v1
export OPENAI_API_KEY=sk-your-keyIf your SDK already appends /v1, set OPENAI_BASE_URL=http://127.0.0.1:8080 instead.
Your app keeps using the OpenAI SDK. Talon is now in the request path.
curl -sS http://127.0.0.1:8080/v1/chat/completions \
-H "Authorization: Bearer sk-test" \
-H "Content-Type: application/json" \
-d '{"model":"gpt-4o-mini","messages":[{"role":"user","content":"hello"}]}'Responses API:
curl -sS http://127.0.0.1:8080/v1/responses \
-H "Authorization: Bearer sk-test" \
-H "Content-Type: application/json" \
-d '{"model":"gpt-4o-mini","input":"hello"}'talon audit list --tenant quickstart --limit 5Look for:
tenant_id=quickstartagent_id=quickstart-localupstream_auth_mode=client_bearerupstream_key_source=openai_api_key_envwhen no bearer was sent and env fallback was used
- Enforcement mode defaults to
enforce(shadow optional viaTALON_QUICKSTART_MODE=shadow). - PII default action is
redact. - Key source precedence: client bearer >
OPENAI_API_KEY> 401. - Partial OpenAI compatibility: only chat completions and responses create endpoints are supported at host root.
401withno upstream credential: set OPENAI_API_KEY or send Authorization: Bearer ...means Talon did not receive any usable upstream key. Send a bearer token from the client or setOPENAI_API_KEYin the Talon process.- Model denied in policy means quickstart's default allowlist blocked it. Default models are
gpt-4o-miniandgpt-4o. For local-only testing, setTALON_QUICKSTART_ALLOW_ALL_MODELS=1. 404for/v1/embeddingsorGET /v1/responses/{id}is expected in v1 quickstart scope. See Reference: proxy quickstart.- Startup bind error on non-loopback host requires
--unsafe-listenin quickstart mode. This is recorded in evidence via thequickstart_unsafe_listenannotation and is intended for local/dev exceptions.
For production gateway rollout, use --gateway and gateway guides.