Skip to content

perf(prefill): MoE prefill CUDA-graph capture — +9-27% pp512 on NVFP4

5119ddc
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

perf(prefill): MoE prefill CUDA-graph capture — +9-27% pp512 on NVFP4 #179

perf(prefill): MoE prefill CUDA-graph capture — +9-27% pp512 on NVFP4
5119ddc
Select commit
Loading
Failed to load commit list.
enable-auto-merge
succeeded May 15, 2026 in 4s