Skip to content

[ET-VK][qconv] Read weight buffer as int in pack_q8_conv2d_weights shader #9803

[ET-VK][qconv] Read weight buffer as int in pack_q8_conv2d_weights shader

[ET-VK][qconv] Read weight buffer as int in pack_q8_conv2d_weights shader #9803

Triggered via pull request February 25, 2026 03:16
Status Success
Total duration 1h 9m 11s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 3s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:29f2931ff0b0f9229eb7b861fc98da20fdd78eb3227c69a4273ea1f522ca7516
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:9a4fc0f6ee52e6b2df437d94d54826a43aa684e52f9cce894c3da24f11de0c9f
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:6605fe716faa88d7f4d1d8690d3060fb2c50cadb19212c0e9c7819dc35c0cf88
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:d9cbfa8b5d59cdeb6adb31441565d4d7b7b4b6a69e47dcfaddcbfc662fe9b536
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:c8fc5fb45883d4ee561d344f107860d2b4e411ee750acd81111b92ddab70ef1e
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:c11a791ee52eee6c3b53857012edb1ee5ae75d85b51be687e162ed31389ed2ff
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:6942d297621188c5935f245f5eb9d4b8f10143ad25628daf23893d35ac02353e
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:b32f29c9362deb91b5056d3271b71adc9e2bd855bec4cd4fca84737f978ec2f3
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:5a4be2ae80212cd663bf58a4d1c4d9897bb672b9ee786bc8f98609d517cc6eca
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:46bf09e95edf69ce3952488699e6acac217d98e259a7d4dc0d01ded80175994f
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:a8462cd6694fd7192144a89c218ea3ef713dffb28b40425da0ec83e5f989a75b
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:33dcb7750d7f57660faa5cac4eb6ed13943cb05fb9d3aa6b666cd3ee83810351
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:a7d979ea2597aa1ee7180dc1bb41c1eb314b9086dc206d7c6af8f6e19cfed578
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:79de1efa2cc88d24dfd6db1ee65b5ce339c1b2bbce4a2f2f98738b5c9060b35e