[ET-VK][matmul] Re-implement fp32/fp16 matmul and linear with tiled compute and blocked weight packing #11476
| Job | Run time |
|---|---|
| 28m 38s | |
| 19m 51s | |
| 21m 48s | |
| 19m 2s | |
| 18m 53s | |
| 28m 3s | |
| 21m 13s | |
| 21m 19s | |
| 21m 13s | |
| 28m 25s | |
| 21m 22s | |
| 21m 21s | |
| 22m 33s | |
| 23m 10s | |
| 35m 11s | |
| 32m 45s | |
| 41m 6s | |
| 24m 35s | |
| 36m 39s | |
| 35m 26s | |
| 24m 13s | |
| 29m 54s | |
| 33m 14s | |
| 24m 28s | |
| 24m 31s | |
| 28m 26s | |
| 33m 38s | |
| 28m 10s | |
| 23m 28s | |
| 24m 33s | |
| 24m 55s | |
| 25m 11s | |
| 2s | |
| 24m 12s | |
| 25m 11s | |
| 22m 10s | |
| 31m 3s | |
| 30m 38s | |
| 22m 8s | |
| 18m 12s | |
| 17m 44s | |
| 16m 32s | |
| 30m 50s | |
| 35m 45s | |
| 17m 3s | |
| 16m 26s | |
| 16m 24s | |
| 18m 20s | |
| 16m 26s | |
| 16m 40s | |
| 30m 55s | |
| 30m 14s | |
| 31m 19s | |
| 21h 55m 28s |