Skip to content

[ET-VK][qconv] Add dynamic PACKED_INT8_CONV2D memory layout for device-adaptive conv2d #10245

[ET-VK][qconv] Add dynamic PACKED_INT8_CONV2D memory layout for device-adaptive conv2d

[ET-VK][qconv] Add dynamic PACKED_INT8_CONV2D memory layout for device-adaptive conv2d #10245

Triggered via pull request March 2, 2026 21:03
Status Success
Total duration 1h 15m 6s
Artifacts 17

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 13s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized Expired
1.1 GB
sha256:bf262bf1b5326708578d06b2f2cc5b9bd7a69bb4c590c5aae1c7494c9ebcbb2c
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed Expired
559 MB
sha256:21e023e3216c6a97e1ae8ec0ee0fcc2eb20fac444c48bbe0c2a8a47bd0147140
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only Expired
1.1 GB
sha256:3230d622cff198b7045aedce46adbabb07f5b8f3e65514e76ffd0cbdc20aee70
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:54bd14bdfe43e5167fde7d1a6193c0adee12fe0b9cecd70ca64ac25ea155de85
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:ee1aa7de1fa5ccde3ffabaa6fd9195636d005812720bbf74d6ffec4de0ffdff6
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:2925d6cc8428df80ad480f897a9715665d4964ea25c49ddc1174a5c282249d6f
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:1812bb47fce884468c6c8858b5a04b10fd6705fc7d24183a5421dd5637fd1fe2
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:f0c64a2191e7995248fde90c3f5a17089689d59cc3d7dd930933903e8e318d4f
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:4a0e76f98d276ff705595b2b40cef5ad623548c9e50f5b1be1c460acb9eaa8bd
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:6506a03e2c65ef64c7cc2da318d55f89e206d5baa60c4a750ce6c7c8cb45c57d
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:b1a106f8f3feeef8f91c1136418a495260b03c0bc5963d67e8b5593ef8922fd2
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:6f31a748c217279c2b0b40d9b6c3afce2a8bbc6c0d8f3bb2810f129451768302
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:128234e4e968cfc5e85f43b4a24fdc5e8c8d681f442e079c0b8abe792f825078
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:c2c4aa619addcfc723510084153bd34e4aab8af0e0bb8d1bf5efd567152ce177
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:5d6644d61fc7b644300f13aebf6d9aa861c773e7bc25e867e1e01ca84c21c812
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:1d1ae8712f0cc89c7036c1a63571583c98d988161f1872f33e301d653355877f
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:7b2bda773770a61d59e76f17464ff4029443880a2cdbfb3f3d221df81faef7f9