[ET-VK] Layout-flexible impl of quantized binary #1911
cuda-windows.yml
on: pull_request
Matrix: export-model-cuda-windows-artifact
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-non-quantized
Expired
|
6.82 GB |
sha256:583a87335a4a89b2407365fb52a29d4ce32e91bf0ebe014feb1f32e68e9a07f0
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-quantized-int4-weight-only
Expired
|
6.15 GB |
sha256:eb4e1263492bdbb3df97e4bea8659976c234b54b4e9af1ed8e6a239efb5dfbe8
|
|
|
nvidia-parakeet-tdt-cuda-windows-non-quantized
Expired
|
954 MB |
sha256:793b662bc95f20ce5b171f560b843e45d3980cdd9959694587e4b187a811dc55
|
|
|
nvidia-parakeet-tdt-cuda-windows-quantized-int4-weight-only
Expired
|
432 MB |
sha256:c3053709af92a3482bbf4c28cebc29e363c84000ede15f416841a20cf8edc014
|
|