-
-
Notifications
You must be signed in to change notification settings - Fork 0
Roadmap
Planned features and improvements for Nenya. Items are grouped by domain — implementation order depends on user demand and technical feasibility.
Synchronous non-streaming responses (stream: false) — buffers upstream SSE into complete JSON response before returning.
Full lifecycle (/v1/responses) with GET/POST/DELETE support.
Token counting, rate limiting, and usage tracking for embeddings requests.
File CRUD (/v1/files): create, list, get, delete, content download.
Batch API (/v1/batches): submit, check status, cancel, retrieve results.
/proxy/{provider}/* — arbitrary HTTP method passthrough with auth injection, SSE streaming auto-detect.
Auto-generated spec served at /openapi.json.
Dynamic fetch of /v1/models from providers at startup and on reload.
Vector-based caching using local embeddings and cosine similarity (zero-dep, in-memory).
Elo rank-based fallback with capability overlap scoring.
External model list with pricing, categories, rankings for cost tracking.
Detailed per-agent/model/provider usage breakdowns with time-series data.
CRUD via API with internal hot-reload — manage agents, providers, and keys without editing JSON.
Inspect and manually control circuit breaker state per target.
These features are explicitly out of scope for Nenya's single-user, local-first design:
- Multi-tenancy — Designed for single-user, local deployment
- Per-key budgets — No multi-user isolation needed
- Cluster mode — Single-node by design
- Admin UI — Admin APIs provided; UI is a separate project
- Semantic search — Not relevant for gateway use case
- Workflow engine — Agents serve a similar purpose
- Zero-dependency: All features maintain Go stdlib-only policy
- Backward compatibility: All new features preserve existing streaming behavior
-
Security: Admin APIs require
client_tokenauth - Testing: Each feature includes unit, integration, and fuzz tests
- Home — Project overview and features
- Architecture — Current architecture
- Providers — Provider reference
Getting Started
- Home — Project overview
- Quick Start — Install and run in 5 minutes
- Client Setup — OpenCode, Cursor, and other clients
- Deployment — Bare metal, container, Kubernetes
Core Concepts
- Configuration — Config reference and examples
- Providers — 22 providers, capabilities, special behaviors
- Routing — Latency-aware routing and fallback chains
- Architecture — Package overview and request lifecycle
- MCP Integration — MCP server integration
Reference
- Passthrough Proxy — Raw provider endpoint proxying
- Secrets — Systemd credentials and container secrets
- Model Discovery — Dynamic model catalog fetching
- API Endpoints — Endpoint reference
Operations
- Demo — Test all pipeline tiers
- Troubleshooting — Common issues and solutions
- FAQ — Frequently asked questions
- Security — Security policy and vulnerability reporting
Project
- Roadmap — Planned features
- Disclaimer — Legal disclaimer