Skip to content

Default CPU benchmark estimators to max parallelism#8025

Draft
dantegd wants to merge 12 commits intorapidsai:mainfrom
dantegd:fea-bench-yaml-njobs
Draft

Default CPU benchmark estimators to max parallelism#8025
dantegd wants to merge 12 commits intorapidsai:mainfrom
dantegd:fea-bench-yaml-njobs

Conversation

@dantegd
Copy link
Copy Markdown
Member

@dantegd dantegd commented Apr 28, 2026

This PR updates the cuML benchmark harness so CPU-side estimators use maximum available parallelism by default when the underlying estimator supports it. In practice, sklearn/UMAP-style estimators now default to n_jobs=-1 unless the benchmark config or CLI explicitly provides a different value. This should avoid users or anyone in general to get a false picture of the speed of execution of libraries.

HDBSCAN is handled explicitly through its CPU-specific parallelism parameter, core_dist_n_jobs=-1. Explicit user-provided values are preserved, so benchmarks can still constrain CPU parallelism when needed.

Builds on top of #7980

@dantegd dantegd added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Apr 28, 2026
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Apr 28, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions github-actions Bot added conda conda issue Cython / Python Cython or Python issue labels Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

conda conda issue Cython / Python Cython or Python issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants