[ET-VK] Layout-flexible impl of quantized binary #8864
Triggered via pull request
February 13, 2026 17:38
Status
Success
Total duration
1h 12m 22s
Artifacts
14
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
24m 19s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:92e3c5b8bbe3d6ba7833491cf001043b76e0af87cbd569f597e98df484b6c9b3
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:95d6d28c81fce51633e6d0d2db757ec314ef11aeb0d0072dde02eceb0dc73f6f
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:9b297b2a457f4a32a8f12df6f7561b13f971521a551d5e060361d1e3caaeb846
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:74d2ee157335df0c51b37ddf3176ed2c8c12724d60ad1dee86925d1c4f6e5fea
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:14edd391381ec1cb069f7521f84667af1bd3aff1e390ac414072758ff91fcaf7
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
Expired
|
952 MB |
sha256:36fce13ba617265bdcca0c5e9b5d8a3654af7be9252c684b3caeec3f079f9854
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
Expired
|
443 MB |
sha256:5452c11c49df1fc7db20ae7247d398769f4a24bd787f0acc56c8983b805e8f46
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
Expired
|
430 MB |
sha256:992365eded83843f91e8de19e6a4ea354b7ea0aacc5794abe503e9f8b8aac670
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:9672c0fa81f2772391c7e27aabda2708ed8b14dd9d80767fb9d796ff690ffd37
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:8eaf64e1cea7f81390ef12f65483fe4174a4b807a6cb4cf1395ba161cf12eef9
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:587a774bffc3ceaa26aba5ee1cf8ba0a931a161b1e6c82d3d55461c31b4ddab4
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:c7e8971c6fd21e1c5fc57ab7132fc2cd7aa0db4c722c8b17cc66cfbf49d0ba1d
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:6b7da71ae63b3abbfb379d722c2749f3b3cc660cbe11835202aa90cf43b33cd4
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:7b446556f1e63627e908ebb634488b0b5cf1bece44f242ecf03c6c68f3e0894d
|
|