Build your own benchmark feature integration #174
Open
anushaknvidia wants to merge 7 commits into
Open
Conversation
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
rkalaniNV
reviewed
Apr 30, 2026
| |---------|-------------| | ||
| | [RAG Agent with Nemotron RAG Models](RAG Agent with Nemotron RAG Models/README.md) | End-to-end example of a Retrieval-Augmented Generation (RAG) agent workflow using Nemotron RAG models through Hugging Face and Nemotron 9B hosted through build.nvidia.com models | | ||
| | [Data Science ML Agent](Data Science ML Agent/README.md) | End-to-end example of a natural language-driven data science and machine learning agent powered by NVIDIA GPUs. The agent allows users to perform data exploration, model training, and hyperparameter optimization interactively using RAPIDS cuDF and cuML for GPU acceleration.| | ||
| | [Build Your Own Benchmark (BYOB)](build-your-own-benchmark/build_mcq_benchmark.ipynb) | End-to-end **Build Your Own Benchmark** flow for MMLU-style multiple-choice: prepare seed data, run a staged Data Designer pipeline (draft, judge, dedupe, distractor checks), and export a `benchmark.parquet`. See `build-your-own-benchmark/download_wikipedia_data.ipynb` to build source text from Wikipedia. | |
There was a problem hiding this comment.
Compilation fails with invalid nested f-strings.
Please add test cases for the byob pipeline.
There is no readme for the review.
Dependencies are not put in pyproject.toml and uv.lock
Please run Nemotron's pre-commit hooks
What is assets folder in the usecase README?
Please take a look at the translation branch for a checklist of agentic skills.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.