Skip to content

Pull requests: Dao-AILab/quack

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Rotary] Fuse copy into kernel
#144 opened May 24, 2026 by simveit Contributor Loading…
[WIP] Blockscaled SM90
#141 opened May 17, 2026 by KareemMusleh Contributor Loading…
Add LayerNorm Backward
#136 opened May 9, 2026 by ighoshsubho Loading…
Fix the SwiGLU/dSwiGLU bug for the (2, 2) epilogue layout
#133 opened May 7, 2026 by GarlGuo Member Loading…
fix the SM100 2CTA issue for SwiGLU/dSwiGLU
#131 opened May 6, 2026 by GarlGuo Member Loading…
Add Sm120 blockscaled FP4 GEMM path
#127 opened Apr 30, 2026 by alecco Contributor Draft
Refactor GEMM for flexible work, operand routing, and epilogue policies
#111 opened Apr 19, 2026 by santoshmo Contributor Loading…
Rmsnorm backward fusing sum
#101 opened Apr 8, 2026 by AaronWang04 Loading…
Fused Add + RMSNorm pattern
#55 opened Nov 9, 2025 by AndreSlavescu Loading…
ProTip! Follow long discussions with comments:>50.