[ET-VK][matmul] Re-implement fp32/fp16 matmul and linear with tiled compute and blocked weight packing #11466
| Job | Run time |
|---|---|
| 27m 43s | |
| 19m 15s | |
| 19m 22s | |
| 18m 54s | |
| 22m 39s | |
| 21m 9s | |
| 28m 38s | |
| 22m 1s | |
| 28m 9s | |
| 21m 20s | |
| 21m 16s | |
| 21m 24s | |
| 24m 39s | |
| 21m 13s | |
| 33m 45s | |
| 26m 48s | |
| 25m 0s | |
| 33m 3s | |
| 29m 26s | |
| 24m 37s | |
| 36m 42s | |
| 28m 5s | |
| 25m 10s | |
| 24m 44s | |
| 35m 13s | |
| 28m 36s | |
| 23m 16s | |
| 42m 7s | |
| 24m 20s | |
| 35m 12s | |
| 23m 29s | |
| 34m 6s | |
| 3s | |
| 22m 19s | |
| 24m 27s | |
| 22m 10s | |
| 25m 22s | |
| 31m 23s | |
| 17m 45s | |
| 17m 8s | |
| 18m 19s | |
| 29m 54s | |
| 31m 18s | |
| 29m 52s | |
| 16m 30s | |
| 31m 8s | |
| 35m 36s | |
| 17m 51s | |
| 16m 25s | |
| 16m 20s | |
| 16m 44s | |
| 30m 53s | |
| 16m 35s | |
| 21h 59m 23s |