[ET-VK][matmul] Re-implement fp32/fp16 matmul and linear with tiled compute and blocked weight packing #27534
| Job | Run time |
|---|---|
| 40s | |
| 11m 44s | |
| 40s | |
| 7m 9s | |
| 6m 58s | |
| 6m 13s | |
| 6m 55s | |
| 6m 57s | |
| 9m 6s | |
| 9m 14s | |
| 6m 31s | |
| 6m 42s | |
| 6m 58s | |
| 5m 47s | |
| 9m 2s | |
| 1h 40m 36s |
| Job | Run time |
|---|---|
| 40s | |
| 11m 44s | |
| 40s | |
| 7m 9s | |
| 6m 58s | |
| 6m 13s | |
| 6m 55s | |
| 6m 57s | |
| 9m 6s | |
| 9m 14s | |
| 6m 31s | |
| 6m 42s | |
| 6m 58s | |
| 5m 47s | |
| 9m 2s | |
| 1h 40m 36s |