transformers: in-place KV cache for decode via a fused InPlaceKvSdpa op#2321
Open
czoli1976 wants to merge 1 commit into
Open
transformers: in-place KV cache for decode via a fused InPlaceKvSdpa op#2321czoli1976 wants to merge 1 commit into
czoli1976 wants to merge 1 commit into