Nemotron 3 Super Improvements and Fixes #124
chrisalexiuk-nvidia
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
We are pleased to announce several improvements since the initial Nemotron 3 Super launch on Mar 11, 2026.
What you need to know:
force_nonempty_contentnow works in streaming mode. Please note that the content field is only not empty in the last response from the server, where it duplicates all the content from the reasoning field (TensorRT-LLM: details).Fixed the support for tool calling when using the qwen3coder tool parser in vLLM and TRT-LLM. With the fix it should return an object instead of a string when using the
anyOfmode (vLLM: details; TensorRT-LLM: details).The fixes are merged into releases:
build.nvidia.com: https://build.nvidia.com/nvidia/nemotron-3-super-120b-a12b
Beta Was this translation helpful? Give feedback.
All reactions