[ET-VK][ez] Make q8ta_conv2d use 4C1W layout #1956
cuda-windows.yml
on: pull_request
Matrix: export-model-cuda-windows-artifact
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-non-quantized
Expired
|
6.82 GB |
sha256:66b5ba32699dd20b3ea5703e621aebe5e632160ce6cd0c753240623162ecc588
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-quantized-int4-weight-only
Expired
|
6.15 GB |
sha256:075f8c0e680c78722daf11fcc76080df6c9772a0e9d35f5db3d518c4fd38e1d7
|
|
|
nvidia-parakeet-tdt-cuda-windows-non-quantized
Expired
|
954 MB |
sha256:e8c66bc9e80a13fad817c85fa7b1b3c9efe72851520141be1d35e11c821abd48
|
|
|
nvidia-parakeet-tdt-cuda-windows-quantized-int4-weight-only
Expired
|
432 MB |
sha256:93287f75e8dfa32db1d3cacfcccc8fce47afaf08807de5db50512f79e56c15ee
|
|