[ET-VK][matmul] Re-implement fp32/fp16 matmul and linear with tiled compute and blocked weight packing #27550
| Job | Run time |
|---|---|
| 11m 37s | |
| 12m 1s | |
| 9m 16s | |
| 9m 16s | |
| 7m 4s | |
| 6m 38s | |
| 5m 32s | |
| 6m 27s | |
| 7m 39s | |
| 6m 52s | |
| 6m 27s | |
| 6m 24s | |
| 9m 6s | |
| 7m 10s | |
| 9m 2s | |
| 2h 0m 31s |
| Job | Run time |
|---|---|
| 11m 37s | |
| 12m 1s | |
| 9m 16s | |
| 9m 16s | |
| 7m 4s | |
| 6m 38s | |
| 5m 32s | |
| 6m 27s | |
| 7m 39s | |
| 6m 52s | |
| 6m 27s | |
| 6m 24s | |
| 9m 6s | |
| 7m 10s | |
| 9m 2s | |
| 2h 0m 31s |