Missing model
The model gemini-3.1-flash-lite (the stable/GA release) is available on Google AI but absent from the model catalog (packages/proxy/schema/model_list.json). The catalog currently has only the preview variant gemini-3.1-flash-lite-preview.
Google's models page lists gemini-3.1-flash-lite with a "Stable" badge, and the Google AI pricing page lists it as a distinct entry from the preview version with identical pricing.
Model details
| Field |
Value |
| Model ID |
gemini-3.1-flash-lite |
| Format |
google |
| Flavor |
chat |
| Multimodal |
true (text, image, video, audio, PDF input) |
| Reasoning |
true (thinking supported) |
| Reasoning budget |
true |
| Input cost (per 1M tokens) |
$0.25 |
| Output cost (per 1M tokens) |
$1.50 |
| Cache read cost (per 1M tokens) |
$0.025 |
| Max input tokens |
1048576 |
| Max output tokens |
65536 |
| Available providers |
["google"] |
| Display name |
Gemini 3.1 Flash-Lite |
Verification checklist
Verification notes
| Field |
Source |
| Model ID, token limits, capabilities |
Model detail page |
| "Stable" status |
Models overview page |
| Standard pricing ($0.25/$1.50) |
Pricing page |
| Cache read pricing ($0.025) |
Pricing page — listed under context caching section |
reasoning, reasoning_budget |
Inferred from "Thinking: Supported" on model detail page, consistent with preview variant |
Fields NOT verified from official sources:
supported_regions — not published on Google AI docs; Vertex AI pricing page lists only the preview variant, so Vertex availability for the stable version is unconfirmed. Only google is listed as an available provider.
max_output_tokens discrepancy note — the model detail page reports 65,536; the preview variant in the catalog uses 65,536 for the Google entry and 65,535 for the Vertex entry.
{
"kind": "missing_model",
"provider": "google",
"models": ["gemini-3.1-flash-lite"],
"status": "active",
"model_specs": {
"gemini-3.1-flash-lite": {
"format": "google",
"flavor": "chat",
"multimodal": true,
"input_cost_per_mil_tokens": 0.25,
"output_cost_per_mil_tokens": 1.5,
"input_cache_read_cost_per_mil_tokens": 0.025,
"displayName": "Gemini 3.1 Flash-Lite",
"reasoning": true,
"reasoning_budget": true,
"max_input_tokens": 1048576,
"max_output_tokens": 65536,
"available_providers": ["google"]
}
},
"source_urls": [
"https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-lite",
"https://ai.google.dev/gemini-api/docs/models/gemini",
"https://ai.google.dev/pricing"
]
}
Missing model
The model
gemini-3.1-flash-lite(the stable/GA release) is available on Google AI but absent from the model catalog (packages/proxy/schema/model_list.json). The catalog currently has only the preview variantgemini-3.1-flash-lite-preview.Google's models page lists
gemini-3.1-flash-litewith a "Stable" badge, and the Google AI pricing page lists it as a distinct entry from the preview version with identical pricing.Model details
gemini-3.1-flash-litegooglechattrue(text, image, video, audio, PDF input)true(thinking supported)true$0.25$1.50$0.025104857665536["google"]Gemini 3.1 Flash-LiteVerification checklist
model_list.json. Recent commits cover Bedrock Nova, Mistral Magistral, and Together models only.gemini-2.5-flash-lite(stable) andgemini-3.1-flash-lite-previewboth exist in the catalog, sogemini-3.1-flash-liteis the natural stable counterpart.Verification notes
reasoning,reasoning_budgetFields NOT verified from official sources:
supported_regions— not published on Google AI docs; Vertex AI pricing page lists only the preview variant, so Vertex availability for the stable version is unconfirmed. Onlygoogleis listed as an available provider.max_output_tokensdiscrepancy note — the model detail page reports 65,536; the preview variant in the catalog uses 65,536 for the Google entry and 65,535 for the Vertex entry.{ "kind": "missing_model", "provider": "google", "models": ["gemini-3.1-flash-lite"], "status": "active", "model_specs": { "gemini-3.1-flash-lite": { "format": "google", "flavor": "chat", "multimodal": true, "input_cost_per_mil_tokens": 0.25, "output_cost_per_mil_tokens": 1.5, "input_cache_read_cost_per_mil_tokens": 0.025, "displayName": "Gemini 3.1 Flash-Lite", "reasoning": true, "reasoning_budget": true, "max_input_tokens": 1048576, "max_output_tokens": 65536, "available_providers": ["google"] } }, "source_urls": [ "https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-lite", "https://ai.google.dev/gemini-api/docs/models/gemini", "https://ai.google.dev/pricing" ] }