Skip to content

[cuda backend] skip fully-masked KV blocks calculation in SDPA #13925

[cuda backend] skip fully-masked KV blocks calculation in SDPA

[cuda backend] skip fully-masked KV blocks calculation in SDPA #13925

Job Run time
3s
30s
46m 19s
45m 33s
16m 10s
18m 55s
31m 17s
31m 33s
24m 18s
33m 28s
21m 35s
29m 41s
34m 22s
1d 0h 0m 0s
21m 15s
21m 38s
18m 5s
27m 4s
28m 54s
29m 40s
33m 56s
1d 0h 0m 0s
22m 29s
3s
0s
-1s
2d 8h 56m 47s