Skip to content

[ET-VK][ez] Make q8ta_conv2d use 4C1W layout #8815

[ET-VK][ez] Make q8ta_conv2d use 4C1W layout

[ET-VK][ez] Make q8ta_conv2d use 4C1W layout #8815

Triggered via pull request February 13, 2026 03:57
Status Success
Total duration 1h 13m 9s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
24m 20s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:efac0bebc9554eb169c7e3ba93c806d4bef623f47c498ecb847c1fa6a02ac1d3
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:0e7d6f542f12a72fa8f00ec0d951890cb3038e015c356c38d7b9535e5d07293f
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:ebe08697f1b1fbd06d92c311dcb46ad3371fad84a6f03814608a4963b3426efd
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:37ef40d677580243869bb107348fefff551be1d008c27d6d0928a8365c0bbeb5
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:052d39afba0be805dff69df8b9c18fb918c3faf3398326a61890a2340ec7eeb2
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:a83bc97be662cd92d30df80b4494c89678d4310468eb4bc287cce67cae354afa
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:862e3e3fce3a0c597b123a16e2e759044f07063cd75d63fc5b9d7e3c4c1d3f54
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:f153e6b5193fab8c7220398f67e8587d446e50b022bd8a1e22abeb6e71d54868
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:bfda48251c8666d0ff5a3f806c6529c2c36ad7d1cd9098ef8cec6486a2d246eb
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:eeef78b40df85fb235ce36b96f2f6bdbd84685ee00d7e9acaa8cf3dd2bb746cf
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:6f4114eb658fa03b4bf234ec181e4a8be42dc97a2247ffd13c68ba4d95757515
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:5f5a4710971e3c60186361a9eb9b2133e64e882b817e78a70fe15f6dbafac42f
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:2fd147604431dd4ac871288582785f7bd9e66204bc3d6a4882e91674a924d42d
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:960197284e3b17bbacb38bf5c7b1726c0323a5954bfad19cde7074b4e6e5fa57