Skip to content

[pull] main from NVIDIA:main#592

Merged
pull[bot] merged 1 commit into
phu0ngng:mainfrom
NVIDIA:main
May 2, 2026
Merged

[pull] main from NVIDIA:main#592
pull[bot] merged 1 commit into
phu0ngng:mainfrom
NVIDIA:main

Conversation

@pull
Copy link
Copy Markdown

@pull pull Bot commented May 2, 2026

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

* Handle empty tensors in dequantize for CUDA graph compatibility

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* dequant with swizzled scales

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* pass nvfp4 dequant tests

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* cleanup unit tests

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* remove allocation in set amax

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* Drop disabling `optimize_for_gemm` introduced in PR 2644

Signed-off-by: Ziang Li <ziangli@umich.edu>
Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* Drop `optimize_for_gemm` in basic linear

Signed-off-by: Ziang Li <ziangli@umich.edu>
Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

Co-authored-by: Tim Moon <4406448+timmoon10@users.noreply.github.com>
Signed-off-by: Tim Moon <4406448+timmoon10@users.noreply.github.com>

* remove redundant set scale

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* rebase on nvfp4 test fix

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* remove redundant line

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* add missing from_cpu() for scale

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>

* Remove unnecessary scale from NVFP4 C++ tests

Signed-off-by: Tim Moon <tmoon@nvidia.com>

---------

Signed-off-by: YigongQin <qqqyyy1233@outlook.com>
Signed-off-by: Ziang Li <ziangli@umich.edu>
Signed-off-by: Tim Moon <4406448+timmoon10@users.noreply.github.com>
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Ziang Li <ziangli@umich.edu>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Tim Moon <4406448+timmoon10@users.noreply.github.com>
Co-authored-by: Tim Moon <tmoon@nvidia.com>
@pull pull Bot locked and limited conversation to collaborators May 2, 2026
@pull pull Bot added the ⤵️ pull label May 2, 2026
@pull pull Bot merged commit 0803102 into phu0ngng:main May 2, 2026
@pull pull Bot had a problem deploying to github-pages May 2, 2026 10:33 Failure
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant