[cuda backend] skip fully-masked KV blocks calculation in SDPA #13925
| Job | Run time |
|---|---|
| 3s | |
| 30s | |
| 46m 19s | |
| 45m 33s | |
| 16m 10s | |
| 18m 55s | |
| 31m 17s | |
| 31m 33s | |
| 24m 18s | |
| 33m 28s | |
| 21m 35s | |
| 29m 41s | |
| 34m 22s | |
| 1d 0h 0m 0s | |
| 21m 15s | |
| 21m 38s | |
| 18m 5s | |
| 27m 4s | |
| 28m 54s | |
| 29m 40s | |
| 33m 56s | |
| 1d 0h 0m 0s | |
| 22m 29s | |
| 3s | |
| 0s | |
| -1s | |
| 2d 8h 56m 47s |