[ET-VK][ez] Make q8ta_conv2d use 4C1W layout #8815
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
24m 20s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:efac0bebc9554eb169c7e3ba93c806d4bef623f47c498ecb847c1fa6a02ac1d3
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:0e7d6f542f12a72fa8f00ec0d951890cb3038e015c356c38d7b9535e5d07293f
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:ebe08697f1b1fbd06d92c311dcb46ad3371fad84a6f03814608a4963b3426efd
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:37ef40d677580243869bb107348fefff551be1d008c27d6d0928a8365c0bbeb5
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:052d39afba0be805dff69df8b9c18fb918c3faf3398326a61890a2340ec7eeb2
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
Expired
|
952 MB |
sha256:a83bc97be662cd92d30df80b4494c89678d4310468eb4bc287cce67cae354afa
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
Expired
|
443 MB |
sha256:862e3e3fce3a0c597b123a16e2e759044f07063cd75d63fc5b9d7e3c4c1d3f54
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
Expired
|
430 MB |
sha256:f153e6b5193fab8c7220398f67e8587d446e50b022bd8a1e22abeb6e71d54868
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:bfda48251c8666d0ff5a3f806c6529c2c36ad7d1cd9098ef8cec6486a2d246eb
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:eeef78b40df85fb235ce36b96f2f6bdbd84685ee00d7e9acaa8cf3dd2bb746cf
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:6f4114eb658fa03b4bf234ec181e4a8be42dc97a2247ffd13c68ba4d95757515
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:5f5a4710971e3c60186361a9eb9b2133e64e882b817e78a70fe15f6dbafac42f
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:2fd147604431dd4ac871288582785f7bd9e66204bc3d6a4882e91674a924d42d
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:960197284e3b17bbacb38bf5c7b1726c0323a5954bfad19cde7074b4e6e5fa57
|
|