[ET-VK][matmul] Re-implement fp32/fp16 matmul and linear with tiled compute and blocked weight packing #4568
| Job | Run time |
|---|---|
| 23m 21s | |
| 28m 20s | |
| 28m 44s | |
| 23m 37s | |
| 21m 29s | |
| 40m 4s | |
| 1m 39s | |
| 2m 24s | |
| 41s | |
| 51s | |
| 41s | |
| 41s | |
| 2h 52m 32s |
| Job | Run time |
|---|---|
| 23m 21s | |
| 28m 20s | |
| 28m 44s | |
| 23m 37s | |
| 21m 29s | |
| 40m 4s | |
| 1m 39s | |
| 2m 24s | |
| 41s | |
| 51s | |
| 41s | |
| 41s | |
| 2h 52m 32s |