Skip to content

[release/3.4][CUDA] Support Paddle on CUDA 13.2#79154

Merged
sneaxiy merged 1 commit into
PaddlePaddle:release/3.4from
ShigureNyako:cherry-pick/78720/release-3.4
May 27, 2026
Merged

[release/3.4][CUDA] Support Paddle on CUDA 13.2#79154
sneaxiy merged 1 commit into
PaddlePaddle:release/3.4from
ShigureNyako:cherry-pick/78720/release-3.4

Conversation

@ShigureNyako
Copy link
Copy Markdown
Contributor

@ShigureNyako ShigureNyako commented May 26, 2026

PR Category

Environment Adaptation

PR Types

Improvements

Description

devPR:#78720

Backport #78720 to release/3.4 as a single squashed commit.

Source head: aa3920f0c43660700e1ba68e63a4ddb38fbc908b.
Backport commit: 363ad9abe2d0aac09f73132862990171032594f1.

Changes included:

  • Add CUDA 13.x build branch/arch configuration and CUDA 13.x nvcc flags.
  • Restrict bundled flash-attn enablement to CUDA 12.3-12.9 and update third_party/flashattn to the official PaddlePaddle commit required by CUDA 13.2.
  • Add CUDA 13.2 Python package requirements, TensorRT cu13 handling, manylinux CUDA 13.2 Dockerfile/stage, and install_cuda.sh 13.2 support.
  • Add nvcc 13.x workarounds for kernel registration and range/arange explicit instantiations.

Validation:

  • git diff HEAD --check
  • bash -n tools/dockerfile/manylinux/common/install_cuda.sh
  • python3 -m py_compile setup.py
  • Targeted consistency check for CUDA 13.2 markers across the touched CMake/Python/Dockerfile/shell files
  • Commit hook checks passed: ruff, clang-format, cmake-format, CMake lint, whitespace/conflict/private-key checks, and related repository pre-commit hooks

Not run:

  • Full Paddle/CUDA build: not feasible locally in this environment.
  • Docker BuildKit --check: attempted, but Docker daemon is not running (Cannot connect to the Docker daemon at unix:///var/run/docker.sock).

是否引起精度变化

Backport PaddlePaddle#78720 to release/3.4 as a single squashed commit.

Source head: aa3920f
@ShigureNyako ShigureNyako requested a review from zhangbo9674 as a code owner May 26, 2026 17:14
@paddle-bot
Copy link
Copy Markdown

paddle-bot Bot commented May 26, 2026

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot paddle-bot Bot added the contributor External developers label May 26, 2026
@ShigureNyako
Copy link
Copy Markdown
Contributor Author

自查完成,backport diff 已核对。

  • GitHub PR diff 与本地 release/3.4 backport commit 的 stable patch-id 一致:db9e43689b5334b198557a15265035efdf67bbbf
  • 本地验证通过:git diff HEAD --checkbash -n tools/dockerfile/manylinux/common/install_cuda.shpython3 -m py_compile setup.py、CUDA 13.2 相关 CMake/Python/Dockerfile/shell marker 一致性检查。
  • Commit hook 已通过 ruff、clang-format、cmake-format、CMake lint 等仓库检查。
  • 未跑完整 Paddle/CUDA 构建;本机 Docker daemon 未运行,Docker BuildKit --check 无法连接 daemon。

@SigureMo 请 review 这个 release/3.4 backport PR。

@ShigureNyako
Copy link
Copy Markdown
Contributor Author

/re-run all-failed

risemeup1111

This comment was marked as resolved.

@SigureMo
Copy link
Copy Markdown
Member

@ShigureNyako 这个 PR 是 cherry-pick,要和 develop 完全一致,你给我把刚刚的 commit revert 了!!!

@ShigureNyako ShigureNyako force-pushed the cherry-pick/78720/release-3.4 branch from fe73f86 to 363ad9a Compare May 26, 2026 18:12
@ShigureNyako
Copy link
Copy Markdown
Contributor Author

@SigureMo 已撤销刚刚的 review-fix commit,并强推回原来的单 commit cherry-pick 状态:363ad9abe2d0aac09f73132862990171032594f1

我重新核对了 patch-id,当前 backport diff 与 develop 源 PR #78720 的 diff 一致:db9e43689b5334b198557a15265035efdf67bbbf。PR 描述也已同步改回只描述原始 cherry-pick/backport。

@ShigureNyako
Copy link
Copy Markdown
Contributor Author

/re-run ixuca mac

@ShigureNyako
Copy link
Copy Markdown
Contributor Author

/re-run approval

@ShigureNyako
Copy link
Copy Markdown
Contributor Author

Check approval 已在 SigureMo approve 后重跑,当前仍显示缺少 required RD approval:

  • setup.py / python/setup.py.in:需要 risemeup1zhangbo9674 任一 approval(当前已有 zhangbo9674 review request)。
  • paddle/phi/*:需要 wanghuancoder / zrr1999 / DanielSun11 任一 approval。

@zhangbo9674 @wanghuancoder 麻烦有空帮忙看下这个 release/3.4 cherry-pick 的 approval gate。当前 PR 仍保持与源 PR #78720 diff 一致。

Copy link
Copy Markdown
Contributor

@risemeup1 risemeup1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Copy Markdown
Contributor

@wanghuancoder wanghuancoder left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Copy Markdown
Collaborator

@sneaxiy sneaxiy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@sneaxiy sneaxiy merged commit 8b49e40 into PaddlePaddle:release/3.4 May 27, 2026
208 of 256 checks passed
@SigureMo SigureMo deleted the cherry-pick/78720/release-3.4 branch May 27, 2026 06:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants