BLD: Refine cuda build by codingl2k1 · Pull Request #156 · xorbitsai/xllamacpp

codingl2k1 · 2026-06-11T03:33:41Z

Removed the detect_cuda_architectures function and updated CUDA architecture handling to default to 'all' if not set in the environment.

Updated delvewheel repair command to include CUDA path adjustments for Windows.

gemini-code-assist

Code Review

This pull request simplifies the build script by removing the custom detect_cuda_architectures function and defaulting to CMake's "all" keyword when CUDA_ARCHITECTURES is not set. The reviewer suggests using "native" instead of "all" as a fallback for local builds to avoid excessively long compilation times and resource usage by targeting only the host's GPU architecture.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update fallback behavior for CUDA architectures in build script.

Updated CUDA requirements for Linux and Windows, added CUDA GPU architecture coverage section with details on supported architectures and compatibility.

Enhanced error handling for network issues during file downloads.

iwr-redmond · 2026-06-11T11:08:30Z

Great idea 👍

codingl2k1 added 3 commits June 10, 2026 15:55

Update build-wheel-cuda-hip.yaml

390e624

Refactor CUDA architecture detection in build.py

ebc61b7

Removed the detect_cuda_architectures function and updated CUDA architecture handling to default to 'all' if not set in the environment.

Enhance delvewheel repair for CUDA on Windows

e51717f

Updated delvewheel repair command to include CUDA path adjustments for Windows.

gemini-code-assist Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread scripts/build.py Outdated

codingl2k1 and others added 4 commits June 11, 2026 11:37

Update scripts/build.py

1eaad4e

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Change CUDA_ARCHITECTURES fallback to 'native'

5ed7a96

Update fallback behavior for CUDA architectures in build script.

Revise CUDA requirements and add architecture coverage

4968bf9

Updated CUDA requirements for Linux and Windows, added CUDA GPU architecture coverage section with details on supported architectures and compatibility.

Improve retry logic for transient network errors

5816c04

Enhanced error handling for network issues during file downloads.

codingl2k1 merged commit 0d523bd into main Jun 11, 2026
13 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BLD: Refine cuda build#156

BLD: Refine cuda build#156
codingl2k1 merged 7 commits into
mainfrom
bld/refine_cuda_build

codingl2k1 commented Jun 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

iwr-redmond commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

codingl2k1 commented Jun 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

iwr-redmond commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants