[ET-VK][matmul] Re-implement fp32/fp16 matmul and linear with tiled compute and blocked weight packing #27507
| Job | Run time |
|---|---|
| 11m 54s | |
| 1m 19s | |
| 5m 58s | |
| 7m 4s | |
| 7m 24s | |
| 41s | |
| 7m 22s | |
| 6m 13s | |
| 6m 42s | |
| 6m 32s | |
| 9m 1s | |
| 6m 58s | |
| 9m 14s | |
| 9m 4s | |
| 8m 41s | |
| 1h 44m 7s |
| Job | Run time |
|---|---|
| 11m 54s | |
| 1m 19s | |
| 5m 58s | |
| 7m 4s | |
| 7m 24s | |
| 41s | |
| 7m 22s | |
| 6m 13s | |
| 6m 42s | |
| 6m 32s | |
| 9m 1s | |
| 6m 58s | |
| 9m 14s | |
| 9m 4s | |
| 8m 41s | |
| 1h 44m 7s |