Pokrov.AI

 ____   ___  _  ______   _____     __   _    ___ 
|  _ \ / _ \| |/ /  _ \ / _ \ \   / /  / \  |_ _|
| |_) | | | | ' /| |_) | | | \ \ / /  / _ \  | | 
|  __/| |_| | . \|  _ <| |_| |\ V /_ / ___ \ | | 
|_|    \___/|_|\_\_| \_\\___/  \_/(_)_/   \_\___|

Rust 1.85+ | Apache 2.0 | Docker |

Pokrov.AI is a self-hosted, security-first proxy that sits between AI coding agents and external LLM/MCP providers. It sanitizes prompts, tool arguments, and model responses in real time — preventing secrets, PII, and corporate markers from leaving your infrastructure.

Built in Rust for low-latency inline processing, Pokrov works transparently with OpenCode, Codex, Cursor, Cline, and any OpenAI-compatible agent or autonomous AI system.

Quick Start

export POKROV_API_KEY='dev-runtime-key'
export OPENAI_API_KEY='provider-dev-key'
cargo run -p pokrov-runtime -- --config ./config/pokrov.example.yaml

Pokrov is now running at http://127.0.0.1:8080. Point your agent's OpenAI-compatible base URL there. See Quick Start (Local) for full setup, or Container Run for Docker.

How It Works

  AI Agent (Cursor / Codex / OpenCode / Cline / any OpenAI-compatible)
       |
       |  OpenAI-compatible request
       v
  +-------------------------------------------+
  |             Pokrov.AI Gateway             |
  |                                           |
  |  1. Authenticate (API key / mTLS)         |
  |  2. Select policy profile (per-key)       |
  |  3. Sanitize input (regex + NER + rules)  |
  |  4. Enforce tool allowlist (MCP)          |
  |  5. Proxy to upstream LLM / MCP server    |
  |  6. Sanitize response (optional)          |
  |  7. Emit metadata-only audit event        |
  +-------------------------------------------+
       |
       |  Safe response
       v
  AI Agent

Who Is This For

Segment	Why Pokrov
AI / Developer Platform Teams	Single reusable layer for agent-to-LLM and agent-to-MCP traffic; no per-agent guardrails needed
Security / AppSec	Centralized enforcement point, metadata-only audit trail, Prometheus metrics, dry-run mode for safe policy rollout
Teams using coding agents	Stop API keys, credentials, and internal URLs from leaking into external LLMs via prompts or tool outputs
Compliance-driven organizations	On-prem deployment, no raw payloads in logs, supports 152-FZ / GDPR alignment

Why Pokrov

Pokrov occupies a focused niche: a sanitization-first interaction layer, not a generic AI gateway.

Compared to	Pokrov difference
LiteLLM, Portkey, OpenRouter	Sanitization is the core feature, not an add-on plugin
Kong AI/MCP Gateway	Lighter, developer-first; focused on content hygiene, not enterprise control-plane breadth
Nightfall MCP Security	Open-source, self-hosted, no vendor lock-in
HelloVeil / Microsoft Presidio	Proxy-aware runtime with MCP tool control and policy engine, not just a client-side SDK
Solo / Agentgateway	Sanitization-first, not identity/auth-first

Project Status

Current scope: Pokrov.AI v1 runtime focused on sanitization-first LLM/MCP proxying.
Maturity: actively developed; interfaces and config are stable within documented v1 boundaries.
Roadmap and verification artifacts live in specs/ and docs/verification/.

Purpose

Pokrov is a single control point for coding/AI agents to:

prevent leakage of secrets, PII, and corporate markers to LLM/MCP systems;
enforce centralized allow/mask/redact/block policies;
validate MCP tool calls against per-server and per-tool allowlists;
separate enforce and dry_run modes for safe rollout;
keep metadata-only audit records (no raw payloads in logs).

What Pokrov Detects

All detection is deterministic (regex + NER), runs in-memory, and never stores raw content.

Category	Examples	Default Action
Secrets & credentials	`sk-...`, `AKIA...`, `ghp_...`, private keys, passwords, tokens	block
PII	Email addresses, phone numbers, credit card numbers (Luhn), passport numbers, IBAN	redact
Corporate markers	Internal URLs (`*.internal.corp`), project codes, custom keywords from config	mask / redact
Named entities (NER, optional)	Person names, organization names (English + Russian)	redact

Before Pokrov:

{
  "messages": [
    {"role": "user", "content": "Connect to db using sk-prod-abc123, contact admin@corp.internal"}
  ]
}

After Pokrov (strict profile, enforce mode):

{
  "messages": [
    {"role": "user", "content": "Connect to db using [REDACTED], contact [REDACTED]"}
  ]
}

The decision (allow / mask / redact / block) is configurable per policy profile. Use mode: "dry_run" to preview what would be changed without enforcing.

Key Features

Sanitization core with recursive JSON processing (POST /v1/sanitize/evaluate).
LLM proxy path (OpenAI-compatible chat/completions) with:
- input sanitization,
- optional output sanitization,
- stream and non-stream handling.
MCP mediation path with:
- server/tool allowlists,
- blocking and argument validation,
- tool output sanitization.
API key auth with policy-profile binding.
BYOK upstream auth mode (static and passthrough) with support for OpenAI-compatible single-bearer passthrough on LLM endpoints.
Rate limiting (requests + token units) with enforce and dry_run modes.
Health/readiness/metrics endpoints.
Metadata-only audit and structured logging.
Release evidence artifact generation.
Optional NER-based PII detection (persons, organizations) for English and Russian with per-profile entity type filtering and ONNX inference.

Runtime Endpoints

GET /health
GET /ready
GET /metrics
POST /v1/sanitize/evaluate
GET /v1/models
POST /v1/chat/completions
POST /v1/responses
POST /v1/mcp/tool-call
POST /v1/mcp/tools/{toolName}/invoke

Architecture (Workspace Crates)

pokrov-core: detection, traversal, transformation, policy evaluation, dry-run decisions.
pokrov-api: HTTP routes, middleware, auth/rate-limit entrypoint wiring.
pokrov-proxy-llm: OpenAI-compatible LLM mediation and upstream routing.
pokrov-proxy-mcp: MCP tool mediation, allowlist/validation, output sanitization.
pokrov-config: YAML model/loading/validation and environment secret resolution.
pokrov-metrics: runtime metrics hooks and Prometheus registry integration.
pokrov-runtime: process bootstrap, lifecycle, readiness/shutdown, release evidence.
pokrov-ner (optional feature): ONNX-backed multilingual NER adapter.

Out of Scope for v1

Per docs/PRD.md, v1 intentionally excludes: A2A proxy, RBAC/IAM, SIEM export, web UI/admin panel, heavy ML NER (beyond lightweight ONNX models), response caching, and a full control plane.

Prerequisites

Rust stable toolchain (workspace targets Rust 2021 edition).
Minimum baseline from project specs: rustc 1.85+.
Newer stable compilers are expected to work (current local environment example: rustc 1.94.1).
cargo in PATH.
jq for quick response inspection in the curl examples.
rg (ripgrep) for metrics probe examples.
Provider credential for upstream LLM calls (for example OPENAI_API_KEY in static mode).
Optional NER setup only if you enable --features ner:
- Python 3 + pip.
- torch, transformers, optimum to export/download ONNX model assets.

Quick Start (Local)

export POKROV_API_KEY='dev-runtime-key'
export OPENAI_API_KEY='provider-dev-key'
cargo run -p pokrov-runtime -- --config ./config/pokrov.example.yaml

With NER enabled:

cargo run -p pokrov-runtime --features ner -- --config ./config/pokrov.example.yaml

WARNING: Debug Payload Trace

observability.llm_payload_trace is a sensitive debug mechanism that writes outbound LLM payloads to a local file. Treat this as development-only instrumentation.

Must be explicitly enabled in config (enabled: true).
Requires build feature llm_payload_trace.
Runtime refuses startup when enabled in release builds.
Never enable it in production environments or with real secrets.
Do not build production artifacts with --features llm_payload_trace.

Probe checks:

curl -sS http://127.0.0.1:8080/health
curl -sS -i http://127.0.0.1:8080/ready
curl -sS http://127.0.0.1:8080/metrics | rg 'pokrov_'

Sanitization check:

curl -sS -X POST http://127.0.0.1:8080/v1/sanitize/evaluate \
  -H "Authorization: Bearer $POKROV_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "profile_id": "strict",
    "mode": "enforce",
    "payload": {
      "messages": [
        {"role": "user", "content": "token sk-test-12345678"}
      ]
    }
  }' | jq

LLM proxy check:

curl -sS -X POST http://127.0.0.1:8080/v1/chat/completions \
  -H "Authorization: Bearer $POKROV_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "gpt-4o-mini",
    "stream": false,
    "messages": [{"role": "user", "content": "hello"}]
  }' | jq

Model discovery check:

curl -sS http://127.0.0.1:8080/v1/models \
  -H "Authorization: Bearer $POKROV_API_KEY" | jq

BYOK passthrough check:

curl -sS -X POST http://127.0.0.1:8080/v1/chat/completions \
  -H "X-Pokrov-Api-Key: $POKROV_API_KEY" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "gpt-4o-mini",
    "stream": false,
    "messages": [{"role": "user", "content": "hello"}]
  }' | jq

NER (Named Entity Recognition)

Pokrov includes an optional NER module that detects person names and organization names in English and Russian text using lightweight ONNX models. NER hits are merged with builtin/deterministic detectors and go through the same overlap resolution and policy pipeline.

How It Works

The sanitization engine extracts string leaves from the JSON payload.
Candidate strings are filtered (dates, numbers, short lowercase tokens are skipped).
Unique strings are batched and sent to the NER engine for inference.
Detected entities are mapped to pii (person) or corporate_markers (organization) categories.
Per-profile entity_types controls which entity kinds each profile detects.

Prerequisites

pip install torch transformers optimum

Download Models

Download both recommended models:

./scripts/download-ner-model.sh --all

Or download a single model:

./scripts/download-ner-model.sh dslim/bert-base-NER models/bert-base-NER
./scripts/download-ner-model.sh r1char9/ner-rubert-tiny-news models/ner-rubert-tiny-news

Default model locations:

Language	Model	Directory
EN	`dslim/bert-base-NER`	`models/bert-base-NER/`
RU	`r1char9/ner-rubert-tiny-news`	`models/ner-rubert-tiny-news/`

Configuration

Enable NER in the runtime config under the top-level ner section and set ner_enabled: true on each sanitization profile that should use NER detection.

ner:
  enabled: true
  default_language: ""
  skip_fields: []
  models:
    - language: en
      model_path: "./models/bert-base-NER/model.onnx"
      tokenizer_path: "./models/bert-base-NER/tokenizer.json"
      priority: 100
    - language: ru
      model_path: "./models/ner-rubert-tiny-news/model.onnx"
      tokenizer_path: "./models/ner-rubert-tiny-news/tokenizer.json"
      priority: 100
  fallback_language: en
  timeout_ms: 80
  confidence_threshold: 0.7
  max_seq_length: 512
  profiles:
    strict:
      fail_mode: fail_closed
      entity_types: [person, organization]
    minimal:
      fail_mode: fail_open
      entity_types: [person]

sanitization:
  profiles:
    strict:
      ner_enabled: true
      # ...
    minimal:
      ner_enabled: true
      # ...

Key settings:

Field	Description
`ner.enabled`	Global NER on/off switch
`ner.models[].language`	Language tag (`en`, `ru`, etc.) used for auto-detection
`ner.models[].priority`	Model priority when multiple models match the same language
`ner.profiles.<name>.entity_types`	Which entity types to detect for this profile: `person`, `organization`
`ner.profiles.<name>.fail_mode`	`fail_open` (skip NER on error) or `fail_closed` (block request on error)
`sanitization.profiles.<name>.ner_enabled`	Per-profile toggle — must be `true` for NER to run on that profile

Running with NER

export POKROV_API_KEY='dev-runtime-key'
cargo run -p pokrov-runtime --features ner -- --config ./config/pokrov.example.yaml

NER Examples

English — person and organization detection:

curl -sS -X POST http://127.0.0.1:8080/v1/sanitize/evaluate \
  -H "Authorization: Bearer $POKROV_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "profile_id": "strict",
    "mode": "enforce",
    "effective_language": "en",
    "payload": {
      "text": "John Smith works at Google and Alice Johnson visited Microsoft"
    }
  }' | jq '{action: .final_action, hits: .explain.rule_hits_total, ner: .explain.family_counts.ner, sanitized: .sanitized_payload}'

Result:

{
  "action": "redact",
  "hits": 4,
  "ner": 4,
  "sanitized": {
    "text": "[REDACTED] works at [REDACTED] and [REDACTED] visited [REDACTED]"
  }
}

Russian — person and organization detection:

curl -sS -X POST http://127.0.0.1:8080/v1/sanitize/evaluate \
  -H "Authorization: Bearer $POKROV_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "profile_id": "strict",
    "mode": "enforce",
    "effective_language": "ru",
    "payload": {
      "text": "Иван Петров работает в Газпром и Мария Иванова посетила офис Яндекса"
    }
  }' | jq '{action: .final_action, hits: .explain.rule_hits_total, ner: .explain.family_counts.ner, sanitized: .sanitized_payload}'

Result:

{
  "action": "redact",
  "hits": 4,
  "ner": 4,
  "sanitized": {
    "text": "[REDACTED] работает в [REDACTED] и [REDACTED] посетила офис [REDACTED]"
  }
}

Mixed payload — NER combined with builtin secret detection:

curl -sS -X POST http://127.0.0.1:8080/v1/sanitize/evaluate \
  -H "Authorization: Bearer $POKROV_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "profile_id": "strict",
    "mode": "enforce",
    "effective_language": "en",
    "payload": {
      "user_name": "John Smith",
      "company": "Google",
      "secret_key": "sk-test-abc12345",
      "message": "Alice Johnson from Microsoft confirmed the payment"
    }
  }' | jq '{action: .final_action, hits: .explain.rule_hits_total, families: .explain.family_counts}'

Result — NER hits merged with builtin secret/card detection:

{
  "action": "block",
  "hits": 7,
  "families": {
    "builtin": 3,
    "ner": 4,
    "normalized_hit": 7,
    "resolved_hit": 7
  }
}

Per-Profile Entity Type Filtering

The ner.profiles section controls which entity types each sanitization profile detects:

strict profile with entity_types: [person, organization] — detects both persons and organizations.
minimal profile with entity_types: [person] — detects only person names; organization names pass through.
Profiles not listed in ner.profiles use the adapter default (person + organization).

Performance

EN model (dslim/bert-base-NER): ~20-50ms per batch, PER F1=0.91.
RU model (r1char9/ner-rubert-tiny-news): ~15-30ms per batch.
Batch inference with automatic text deduplication — identical strings across JSON fields are inferred only once.
P95 sanitization + NER overhead stays within the 50ms latency budget.

Container Run

docker build -t pokrov-runtime:latest .
docker run --rm -p 8080:8080 \
  -e POKROV_API_KEY='dev-runtime-key' \
  -e OPENAI_API_KEY='provider-dev-key' \
  pokrov-runtime:latest

Container build args for NER models:

Use local pre-converted models only (skip network download in Docker build):

docker build -t pokrov-runtime:latest --build-arg SKIP_NER_DOWNLOAD=true .

Override RU model source used when models/ner-rubert-tiny-news/ is missing:

docker build -t pokrov-runtime:latest \
  --build-arg RU_NER_MODEL_ID=r1char9/ner-rubert-tiny-news .

OpenCode Setup with Pokrov

You can connect Pokrov in OpenCode as a custom OpenAI-compatible provider.

Add credentials for your provider id (for example, pokrov) via /connect (or opencode auth login).
Add or update opencode.json in your project:

{
  "$schema": "https://opencode.ai/config.json",
  "provider": {
    "pokrov": {
      "npm": "@ai-sdk/openai-compatible",
      "name": "Pokrov Runtime",
      "options": {
        "baseURL": "http://127.0.0.1:8080/v1"
      },
      "models": {
        "gpt-4o-mini": {
          "name": "gpt-4o-mini via Pokrov"
        }
      }
    }
  }
}

Select the gpt-4o-mini model from provider pokrov.
Ensure requests use your bearer key (POKROV_API_KEY).

Codex Setup with Pokrov

You can configure a custom provider in ~/.codex/config.toml:

model_provider = "pokrov"
model = "gpt-4o-mini"

[model_providers.pokrov]
name = "Pokrov Runtime"
base_url = "http://127.0.0.1:8080/v1"
env_key = "POKROV_API_KEY"
wire_api = "responses"

Important:

Pokrov exposes both POST /v1/chat/completions and Codex-compatible POST /v1/responses.
Codex custom providers use the responses wire protocol.
/v1/responses is a minimal compatibility layer mapped to the existing sanitization-first proxy pipeline.

Authentication Modes for Codex

auth.gateway_auth_mode: api_key (default):
- Gateway auth via X-Pokrov-Api-Key or Authorization: Bearer ... against security.api_keys.
auth.gateway_auth_mode: mesh_mtls:
- Gateway auth via trusted mesh identity header (default x-forwarded-client-cert).
- Useful for Istio deployments where mTLS is enforced in the service mesh.
auth.gateway_auth_mode: internal_mtls:
- Gateway auth is bound to transport-verified client certificates (mTLS), not request headers.
- Runtime config requires server.tls.* client-certificate validation settings (enabled=true, require_client_cert=true, client_ca_file set).
- The configured identity header name (default x-pokrov-client-cert-subject) is compatibility metadata only and is not treated as proof of client identity.
- Example runtime config:

server:
  host: 0.0.0.0
  port: 8443
  tls:
    enabled: true
    cert_file: /etc/pokrov/tls/server.crt
    key_file: /etc/pokrov/tls/server.key
    client_ca_file: /etc/pokrov/tls/clients-ca.crt
    require_client_cert: true

auth:
  gateway_auth_mode: internal_mtls
  internal_mtls:
    identity_header: x-pokrov-client-cert-subject
    require_header: true
  upstream_auth_mode: static

auth.upstream_auth_mode: static:
- Use POKROV_API_KEY as the gateway credential.
- Pokrov uses provider credentials from runtime config for upstream calls.
auth.upstream_auth_mode: passthrough:
- Send gateway credential via X-Pokrov-Api-Key.
- Send provider credential via Authorization: Bearer ....
- For OpenAI-compatible LLM endpoints (/v1/chat/completions, /v1/responses), single-bearer mode is supported: one Authorization: Bearer ... can be used for both gateway auth and upstream passthrough credential.

Upstream TLS Trust (Custom CA)

When upstream LLM endpoint uses a private/corporate CA, set SSL_CERT_FILE for the Pokrov runtime process so outbound upstream TLS validation can trust that CA.

export SSL_CERT_FILE=/etc/pokrov/certs/corp-root-ca.pem
cargo run -p pokrov-runtime -- --config ./config/pokrov.example.yaml

SSL_CERT_FILE must point to a PEM file.
The file may contain one or more BEGIN CERTIFICATE blocks (CA chain).
If the CA is not trusted, upstream calls fail with connect errors such as invalid peer certificate: UnknownIssuer.

Routing UX Notes

llm.providers[].upstream_path lets you override the provider upstream endpoint path.
llm.providers[].auth.api_key is optional for local/no-auth upstreams in auth.upstream_auth_mode: static.
- Set it to env:VAR/file:/path when upstream requires bearer auth.
- Leave it empty to call upstream without Authorization header.
llm.routes[].aliases lets you expose additional model ids that map to the same canonical route.
response_envelope.pokrov_metadata.mode supports:
- enabled (default): include pokrov metadata in successful LLM responses
- suppressed: omit pokrov metadata for strict-client compatibility

OpenCode Setup with Pokrov (Split Auth)

OpenCode custom providers support explicit header configuration, so you can keep gateway and upstream credentials separated in passthrough mode:

{
  "$schema": "https://opencode.ai/config.json",
  "provider": {
    "pokrov": {
      "npm": "@ai-sdk/openai-compatible",
      "name": "Pokrov Runtime",
      "options": {
        "baseURL": "http://127.0.0.1:8080/v1",
        "headers": {
          "X-Pokrov-Api-Key": "{env:POKROV_GATEWAY_KEY}",
          "Authorization": "Bearer {env:OPENAI_API_KEY}"
        }
      },
      "models": {
        "gpt-4o-mini": {
          "name": "gpt-4o-mini"
        }
      }
    }
  }
}

Quick Verification for `/v1/responses`

Static mode:

curl -sS -X POST http://127.0.0.1:8080/v1/responses \
  -H "Authorization: Bearer $POKROV_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "gpt-4o-mini",
    "stream": false,
    "input": "print hello world in rust"
  }' | jq

Passthrough mode:

curl -sS -X POST http://127.0.0.1:8080/v1/responses \
  -H "X-Pokrov-Api-Key: $POKROV_API_KEY" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "gpt-4o-mini",
    "stream": false,
    "input": "print hello world in rust"
  }' | jq

Expected:

HTTP 200 for valid requests.
request_id in headers/body and pokrov metadata in JSON response.
Metadata-only structured errors on auth/policy/upstream failures.

Quality Checks

See docs/configuration.md for the complete configuration reference with all fields, defaults, and examples. See config/README.md for config directory usage notes.

cargo check --workspace
cargo fmt --check
cargo test --workspace
cargo clippy --all-targets --all-features

Release evidence generation:

cargo run -p pokrov-runtime -- \
  --release-evidence-output ./config/release/release-evidence.json \
  --release-id hardening-v1 \
  --artifact ./config/pokrov.example.yaml \
  --artifact ./config/release/verification-checklist.md

Security Policy

See SECURITY.md for vulnerability reporting and response guidelines.

Contributing

See CONTRIBUTING.md for contribution workflow, coding expectations, and pull request guidance.

License

This project is licensed under the Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
.agents/skills		.agents/skills
.github/workflows		.github/workflows
.specify		.specify
config		config
crates		crates
docs		docs
scripts		scripts
specs		specs
src		src
target-clippy		target-clippy
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
rustfmt.toml		rustfmt.toml
skills-lock.json		skills-lock.json

Folders and files

Latest commit

History

Repository files navigation

Pokrov.AI

Quick Start

How It Works

Who Is This For

Why Pokrov

Table of Contents

Project Status

Purpose

What Pokrov Detects

Key Features

Runtime Endpoints

Architecture (Workspace Crates)

Out of Scope for v1

Prerequisites

Quick Start (Local)

WARNING: Debug Payload Trace

NER (Named Entity Recognition)

How It Works

Prerequisites

Download Models

Configuration

Running with NER

NER Examples

Per-Profile Entity Type Filtering

Performance

Container Run

OpenCode Setup with Pokrov

Codex Setup with Pokrov

Authentication Modes for Codex

Upstream TLS Trust (Custom CA)

Routing UX Notes

OpenCode Setup with Pokrov (Split Auth)

Quick Verification for /v1/responses

Quality Checks

Security Policy

Contributing

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Quick Verification for `/v1/responses`

Packages