Nemotron 3 Super Improvements and Fixes #124

chrisalexiuk-nvidia · 2026-03-25T17:06:25Z

chrisalexiuk-nvidia
Mar 25, 2026
Maintainer

We are pleased to announce several improvements since the initial Nemotron 3 Super launch on Mar 11, 2026.

What you need to know:

force_nonempty_content now works in streaming mode. Please note that the content field is only not empty in the last response from the server, where it duplicates all the content from the reasoning field (TensorRT-LLM: details).
Fixed the support for tool calling when using the qwen3coder tool parser in vLLM and TRT-LLM. With the fix it should return an object instead of a string when using the anyOf mode (vLLM: details; TensorRT-LLM: details).

The fixes are merged into releases: