feat(gateway): serve embedding models from OpenRouter by kilo-code-bot[bot] · Pull Request #3109 · Kilo-Org/cloud

kilo-code-bot · 2026-05-07T14:39:01Z

Summary

Follow-up to #3099. The /api/gateway/embedding-models endpoint now proxies the live OpenRouter catalog (output_modalities=embeddings) instead of returning a hardcoded list, mirroring the existing /api/gateway/models endpoint for language models. Clients now see every embedding model OpenRouter exposes (Gemini, Perplexity, Qwen, BAAI, Sentence Transformers, etc.) without us shipping a new release each time the catalog changes.

Added getOpenRouterEmbeddingModels() in apps/web/src/lib/ai-gateway/providers/openrouter/index.ts that fetches ${OPENROUTER}/models?output_modalities=embeddings, validates with OpenRouterModelsResponseSchema, and reuses the existing attribution headers and Sentry plumbing.
Rewrote apps/web/src/app/api/gateway/embedding-models/route.ts to call that helper and return the OpenRouter response shape ({ data: [...] }), with a 500 + Sentry capture on failure.

Verification

curl http://localhost:3000/api/gateway/embedding-models returns the live OpenRouter embeddings catalog.

Visual Changes

N/A

Reviewer Notes

The response shape changes from the static { defaultModel, models, aliases } catalog to the standard OpenRouterModelsResponse ({ data: OpenRouterModel[] }) used by /api/gateway/models. Any client still relying on the old shape (e.g. the bundled fallback in the kilocode indexing PR) should be updated in lockstep.
The catalog can be sizeable (~25+ models). The fetch reuses the 60s next.revalidate caching that the language-models endpoint uses.
apps/web/src/lib/ai-gateway/embeddings/kilo-embedding-models.ts is untouched; it is still consumed by apps/web/src/app/(app)/claw/components/embeddingModels.ts and other internal call sites and can be retired in a separate change once those move to the live endpoint.

Make /api/gateway/embedding-models proxy the live OpenRouter catalog (output_modalities=embeddings) so clients see all supported embedding models, mirroring the existing /api/gateway/models endpoint for language models.

kilo-code-bot · 2026-05-07T14:40:21Z

Code Review Summary

Status: No Issues Found | Recommendation: Merge

Files Reviewed (3 files)

apps/web/src/app/api/gateway/embedding-models/route.test.ts
apps/web/src/app/api/gateway/embedding-models/route.ts
apps/web/src/lib/ai-gateway/providers/openrouter/index.ts

_{Reviewed by gpt-5.5-2026-04-23 · 159,355 tokens}

Importing jest from @jest/globals interferes with module auto-mocking under SWC, leaving the mocked function as the real implementation.

feat(gateway): serve embedding models from OpenRouter

6b87b9f

Make /api/gateway/embedding-models proxy the live OpenRouter catalog (output_modalities=embeddings) so clients see all supported embedding models, mirroring the existing /api/gateway/models endpoint for language models.

kilo-code-bot Bot assigned chrarnoldus May 7, 2026

test: use global jest in embedding-models route test

75137e8

Importing jest from @jest/globals interferes with module auto-mocking under SWC, leaving the mocked function as the real implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gateway): serve embedding models from OpenRouter#3109

feat(gateway): serve embedding models from OpenRouter#3109
kilo-code-bot[bot] wants to merge 2 commits intomainfrom
feat/embedding-models-from-openrouter

kilo-code-bot Bot commented May 7, 2026

Uh oh!

kilo-code-bot Bot commented May 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kilo-code-bot Bot commented May 7, 2026

Summary

Verification

Visual Changes

Reviewer Notes

Uh oh!

kilo-code-bot Bot commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kilo-code-bot Bot commented May 7, 2026 •

edited

Loading