Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[JAX] HLO FFI tests
#2593 opened Jan 13, 2026 by jberchtold-nvidia Draft
7 of 13 tasks
Revert adding pytorch-triton as a build requirement 2.12.0
#2592 opened Jan 13, 2026 by tdophung Loading…
5 of 13 tasks
[Common] MXFP8 kernel for grouped tensors
#2586 opened Jan 12, 2026 by Oleg-Goncharov Draft
13 tasks
[Common] Enable determinism for cuDNN >= 9.18 on Blackwell 2.12.0
#2584 opened Jan 12, 2026 by cyanguwa Loading…
8 of 13 tasks
docs: Update README Latest News section
#2583 opened Jan 9, 2026 by sbhavani Loading…
3 of 13 tasks
fix(build): Handle namespace packages for PyPI CUDA detection
#2580 opened Jan 9, 2026 by sbhavani Loading…
6 of 13 tasks
fix(examples): te_llama compatibility with transformers >= 4.57
#2572 opened Jan 7, 2026 by sbhavani Loading…
6 of 13 tasks
CPU Optimizations for FP8 cpu_overhead
#2559 opened Jan 5, 2026 by vthumbe1503 Loading…
13 tasks
[PyTorch] Remove unnecessary save of weights
#2549 opened Dec 30, 2025 by pggPL Loading…
8 of 13 tasks
[PyTorch]Add Casting-Free FP8-Flow-MoE Blockwise Optimizations community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2544 opened Dec 26, 2025 by xiaoxi-wangfj Loading…
4 of 13 tasks
[PyT] Plumbing correct bias dims from TE to cudnn attention bug Something isn't working pytorch
#2537 opened Dec 20, 2025 by KshitijLakhani Loading…
5 of 11 tasks
Documentation for cpu offloading documentation Improvements or additions to documentation
#2520 opened Dec 16, 2025 by pggPL Loading…
8 of 13 tasks
Cpu optimizations v2 cpu_overhead
#2514 opened Dec 12, 2025 by vthumbe1503 Draft
13 tasks
[Common] Optimize fused RoPE kernel performance performance Performance issues
#2508 opened Dec 11, 2025 by yaox12 Draft
13 tasks
[common] Add support for cuBLASLt GEMM for GroupedTensor MoE
#2502 opened Dec 10, 2025 by pggPL Loading…
8 tasks done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.