[ET-VK][matmul] Re-implement fp32/fp16 matmul and linear with tiled compute and blocked weight packing #4611
| Job | Run time |
|---|---|
| 28m 50s | |
| 29m 10s | |
| 40m 1s | |
| 22m 51s | |
| 23m 23s | |
| 21m 37s | |
| 32m 21s | |
| 34m 16s | |
| 33m 3s | |
| 34m 33s | |
| 34m 50s | |
| 32m 42s | |
| 6h 7m 37s |
| Job | Run time |
|---|---|
| 28m 50s | |
| 29m 10s | |
| 40m 1s | |
| 22m 51s | |
| 23m 23s | |
| 21m 37s | |
| 32m 21s | |
| 34m 16s | |
| 33m 3s | |
| 34m 33s | |
| 34m 50s | |
| 32m 42s | |
| 6h 7m 37s |