Add NVIDIA Nemotron 3 Super deployment guide example page by wbrennan899 · Pull Request #85 · vast-ai/docs

wbrennan899 · 2026-03-13T17:24:55Z

Summary

Add end-to-end guide for deploying NVIDIA Nemotron 3 Super (120B/12B active MoE) on Vast.ai using SGLang
Covers instance search, deployment with FP8 quantization, and querying via OpenAI-compatible API
Documents all three reasoning modes (on, off, low-effort) with Python and cURL examples
Includes note about SGLang's nano_v3 parser behavior where reasoning-off responses are returned in reasoning_content instead of content

The search command filtered for disk_space>=150 but the hardware requirements and instance creation both specify 200GB.

wbrennan899 added 2 commits March 13, 2026 10:12

Remove unsupported claim that SGLang is NVIDIA recommended framework

92120f6

Fix disk_space search filter to match 200GB requirement

6a9a998

The search command filtered for disk_space>=150 but the hardware requirements and instance creation both specify 200GB.

mintlify bot deployed to staging March 13, 2026 17:25 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NVIDIA Nemotron 3 Super deployment guide example page#85

Add NVIDIA Nemotron 3 Super deployment guide example page#85
wbrennan899 wants to merge 2 commits intomainfrom
examples/nemotron-3

wbrennan899 commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

wbrennan899 commented Mar 13, 2026

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant