Skip to content

Releases: NVIDIA/NeMo-Agent-Toolkit

v1.4.3

14 Mar 00:29
v1.4.3
7d69086

Choose a tag to compare

What's Changed

🐛 Bug Fixes

  • fix(release/1.4): specify explicit versions for the nvidia-nat-all package by @dagardner-nv in #1801

Full Changelog: v1.4.2...v1.4.3

v1.4.2

13 Mar 22:31
v1.4.2
6854b1a

Choose a tag to compare

What's Changed

🐛 Bug Fixes

Full Changelog: v1.4.1...v1.4.2

v1.5.0

12 Mar 20:04

Choose a tag to compare

🚀 Notable Features and Improvements

  • Dynamo Runtime Intelligence: Automatically infer per-request latency sensitivity from agent profiles and apply runtime hints for cache control, load-aware routing, and priority-aware serving.
  • Agent Performance Primitives (APP): Introduce framework-agnostic performance primitives that accelerate graph-based agent frameworks such as LangChain, CrewAI, and Agno with parallel execution, speculative branching, and node-level priority routing.
  • LangSmith Native Integration: Observe end-to-end agent execution with native LangSmith tracing, run evaluation experiments, compare outcomes, and manage prompt versions across development and production workflows.
  • FastMCP Workflow Publishing: Publish NeMo Agent Toolkit workflows as MCP servers using the FastMCP runtime to simplify MCP-native deployment and integration.

Migration notice: 1.5.0 includes packaging and compatibility refactors (meta-package restructure, eval/profiling package split, and import-path updates). See the Migration Guide.

🚨 Breaking Changes

✨ New Features

🔧 Improvements

Read more

v1.4.1

09 Feb 17:51
v1.4.1
9f65fb7

Choose a tag to compare

What's Changed

🐛 Bug Fixes

  • fix(serve): ensure a single event loop for python 3.11 by @willkill07 in #1558

📚 Documentation

Full Changelog: v1.4.0...v1.4.1

v1.4.0

03 Feb 02:18
v1.4.0
34f3894

Choose a tag to compare

🚀 Notable Features and Improvements

  • LangGraph Agent Automatic Wrapper: Easily onboard existing LangGraph agents to NeMo Agent Toolkit. Use the automatic wrapper to access NeMo Agent Toolkit advanced features with very little modification of LangGraph agents.
  • Automatic Reinforcement Learning (RL): Improve your agent quality by fine-tuning open LLMs to better understand your agent's workflows, tools, and prompts. Perform GRPO with OpenPipe ART or DPO with NeMo Customizer using NeMo Agent Toolkit built-in evaluation system as a verifier.
  • Initial NVIDIA Dynamo Integration: Accelerate end-to-end deployment of agentic workflows with initial Dynamo support. Utilize the new agent-aware router to improve worker latency by predicting future agent behavior.
  • A2A Support: Build teams of distributed agents using the A2A protocol.
  • Safety and Security Engine: Strengthen safety and security workflows by simulating scenario-based attacks, profiling risk, running guardrail-ready evaluations, and applying defenses with red teaming. Validate defenses, profile risk, monitor behavior, and harden agents across any framework.
  • Amazon Bedrock AgentCore and Strands Agents Support: Build agents using Strands Agents framework and deploy them securely on Amazon Bedrock AgentCore runtime.
  • Microsoft AutoGen Support: Build agents using the Microsoft AutoGen framework.
  • Per-User Functions: Use per-user functions for deferred instantiation, enabling per-user stateful functions, per-user resources, and other features.

🚨 Breaking Changes

✨ New Features

🔧 Improvements

Read more

v1.3.1

07 Nov 20:21
v1.3.1
78ed10c

Choose a tag to compare

What's Changed

✨ New Features

🐛 Bug Fixes

📝 Documentation Updates

Other Changes

New Contributors

Full Changelog: v1.3.0...v1.3.1

v1.3.0

24 Oct 21:26
v1.3.0
ee4bd73

Choose a tag to compare

What's Changed

🚀 Notable Features and Improvements

  • ADK Support: Supports Google Agent Development Kit (ADK). Adds tool calling, core observability, and LLM integration in this release.
  • Control-Flow Agents: Sequential Executor (Linear Agent) and Router Agent now control flow patterns of tool calls and sub-agents.
  • Function Groups: Packages multiple related functions together so they share configuration, context, and resources.
  • Hyperparameter Agent Optimizer: Automates hyperparameter tuning and prompt engineering for workflows.
  • Introductory Notebook Improvements: Reorganizes getting started notebooks and adds Open in Colab links.
  • LLM Improvements
    • Adds LiteLLM Provider
    • Supports GPT-5 (/chat/completions endpoint only)
    • Adds Nemotron thinking configuration
  • MCP Improvements
    • Supports streamable-http - sse is no longer the default transport type.
    • Supports initial authorization - Enables connecting to MCP servers that require authentication.
    • Supports multiple MCP tools from a single configuration - Pulls in entire tool sets published by MCP servers or filters them based on user configuration.
    • Enhances CLI utilities for MCP servers and clients - Improves the nat mcp sub command for querying, calling, and listing tools.
  • Python 3.13 support

🚨 Breaking Changes

✨ New Features

🔧 Improvements

Read more

v1.2.1

20 Aug 21:25
c8a7f31

Choose a tag to compare

📦 Overview

This is a documentation only release, there are no code changes in this release.

📜 Full Change Log

Full Changelog: v1.2.0...v1.2.1

v1.2.0

19 Aug 23:17
2af034c

Choose a tag to compare

📦 Overview

The NeMo Agent toolkit, formerly known as Agent Intelligence (AIQ) toolkit, has been renamed to align with the NVIDIA NeMo family of products. This release brings significant new capabilities and improvements across authentication, resource management, observability, and developer experience. The toolkit continues to offer backwards compatibility, making the transition seamless for existing users.

While NeMo Agent Toolkit is designed to be compatible with the previous version, users are encouraged to update their code to follow the latest conventions and best practices. Migration instructions are provided in the migration guide.

🚨 Breaking Changes

🚀 Notable Features and Improvements

  • Authentication for Tool Calling: Implement robust authentication mechanisms that enable secure and configurable access management for tool invocation within agent workflows.
  • Test Time Compute: Dynamically reallocate compute resources after model training, allowing agents to optimize reasoning, factual accuracy, and system robustness without retraining the base model.
  • Sizing Calculator: Estimate GPU cluster requirements to support your target number of users and desired response times, simplifying deployment planning and scaling.
  • Object Store Integration: Connect and manage data through supported object stores, improving agent extensibility and enabling advanced data workflows.
  • Enhanced Cursor Rules: Build new workflows or extend existing ones by leveraging cursor rules, making agent development faster and more flexible.
  • Interactive Notebooks: Access a suite of onboarding and example notebooks to accelerate agent workflow development, testing, and experimentation.
  • Observability Refactor: Onboard new observability and monitoring platforms more easily, and take advantage of improved plug-in architecture for workflow inspection and analysis.
  • Examples Reorganization: Organize examples by functionality, making it easier to find and use the examples.

📜 Full Change Log

Read more

v1.1.0

16 May 17:35
4e39dcf

Choose a tag to compare

Key Features

  • Full Model Context Protocol (MCP) support. Workflows/tools can now be exposed as MCP servers.
  • Deep integration with Weights and Biases’ Weave for logging and tracing support.
  • Addition of the Agno LLM framework.
  • A new ReWOO agent which improves on ReAct by removing the tool output from the LLM context, reducing token counts.
  • A new Alert Triage Agent example which demonstrates how to build a full application with AIQ toolkit to automatically analyze system monitoring alerts, performs diagnostic checks using various tools, and generates structured triage reports with root cause categorization.
  • Support for Python 3.11.
  • Various quality of life improvements.

What's Changed

Read more