Skip to content

[ET-VK] Layout-flexible impl of quantized binary #8864

[ET-VK] Layout-flexible impl of quantized binary

[ET-VK] Layout-flexible impl of quantized binary #8864

Triggered via pull request February 13, 2026 17:38
Status Success
Total duration 1h 12m 22s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
24m 19s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:92e3c5b8bbe3d6ba7833491cf001043b76e0af87cbd569f597e98df484b6c9b3
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:95d6d28c81fce51633e6d0d2db757ec314ef11aeb0d0072dde02eceb0dc73f6f
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:9b297b2a457f4a32a8f12df6f7561b13f971521a551d5e060361d1e3caaeb846
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:74d2ee157335df0c51b37ddf3176ed2c8c12724d60ad1dee86925d1c4f6e5fea
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:14edd391381ec1cb069f7521f84667af1bd3aff1e390ac414072758ff91fcaf7
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:36fce13ba617265bdcca0c5e9b5d8a3654af7be9252c684b3caeec3f079f9854
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:5452c11c49df1fc7db20ae7247d398769f4a24bd787f0acc56c8983b805e8f46
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:992365eded83843f91e8de19e6a4ea354b7ea0aacc5794abe503e9f8b8aac670
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:9672c0fa81f2772391c7e27aabda2708ed8b14dd9d80767fb9d796ff690ffd37
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:8eaf64e1cea7f81390ef12f65483fe4174a4b807a6cb4cf1395ba161cf12eef9
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:587a774bffc3ceaa26aba5ee1cf8ba0a931a161b1e6c82d3d55461c31b4ddab4
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:c7e8971c6fd21e1c5fc57ab7132fc2cd7aa0db4c722c8b17cc66cfbf49d0ba1d
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:6b7da71ae63b3abbfb379d722c2749f3b3cc660cbe11835202aa90cf43b33cd4
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:7b446556f1e63627e908ebb634488b0b5cf1bece44f242ecf03c6c68f3e0894d