Skip to content

Updated to new CUDA version#103

Open
ljn7 wants to merge 15 commits into
HawkAaron:masterfrom
ljn7:master
Open

Updated to new CUDA version#103
ljn7 wants to merge 15 commits into
HawkAaron:masterfrom
ljn7:master

Conversation

@ljn7
Copy link
Copy Markdown

@ljn7 ljn7 commented Jul 9, 2025

This PR updates the PyTorch binding and CMake build setup to support newer CUDA versions (e.g., CUDA 12.9).

Changes:

  • Updated pytorch_binding/setup.py to work with recent CUDA installations.

  • Added support for setting CFLAGS, LDFLAGS, and CUDA_HOME manually when using newer CUDA directory structures.

  • Updated README with troubleshooting instructions.

  • Ensured compatibility with Python 3.12 and PyTorch compiled for newer environments.

ljn7 and others added 13 commits July 8, 2025 11:00
Pytorch gpu allocator
…rp-transducer)

- Adapted and updated for compatibility and enhancements on my fork
- Updated relevant documentation and options
- Updated CMakeLists.txt to support CUDA and CMake version selection
Updated CMakeLists.txt to support CUDA and CMake version detection/selection
Detect CUDA to on/off WITH_GPU by default
- Check CUDA availability before declaring it as project language
- Fall back to CPU-only build when CUDA toolkit not found
- Prevents "Failed to find nvcc" error on systems without CUDA
- Maintains GPU support when CUDA is properly installed
…atform checks

- Enabled robust detection of CUDA and platform compatibility
- Improved build logic to support CUDA builds on Windows
- Added fallback mechanisms and clearer error handling
- Ensured smoother CPU-only builds without redundant checks
- Reduced platform-specific pain points for easier maintenance
- Added robust detection of CUDA toolkit using CUDAToolkit and CUDA packages
- Introduced WITH_GPU option to allow user-controlled GPU builds
- Automatically fallback to CPU-only build if CUDA is unavailable or WITH_GPU=OFF
- Improved status messages to reflect user intent and system CUDA availability
- Added support for CUDA architecture detection and override via CMAKE_CUDA_ARCHITECTURES
- Improved build summary output for clarity
@ljn7 ljn7 force-pushed the master branch 2 times, most recently from 1f14512 to a6f8c22 Compare July 26, 2025 15:32
ljn7 and others added 2 commits July 26, 2025 21:09
- Uses cmake and make to build and run test_cpu
- Targets only CPU environments for faster and simpler CI
- Updated for compatability and performance similar to b-flo's implementation
- Removed tensorflow and mxnet bindings (might add as git-modules)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant