Skip to content

decode bf16 smallm: support arbitrary 1<heads_per_group<=8 via direct Q/Y GMEM when TMA unsuitable#37

Open
Religious-J wants to merge 1 commit intoTencent:mainfrom
Religious-J:feat/decode_push
Open

decode bf16 smallm: support arbitrary 1<heads_per_group<=8 via direct Q/Y GMEM when TMA unsuitable#37
Religious-J wants to merge 1 commit intoTencent:mainfrom
Religious-J:feat/decode_push

Commits

Commits on Apr 2, 2026