[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype #8817
Triggered via pull request
February 13, 2026 03:57
Status
Success
Total duration
1h 20m 21s
Artifacts
14
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
24m 15s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:023ab4b1634149eea663fa693d63175ec70c1514b35a4dd508797d6a5d815dda
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:75961e0c250bf18f0410809e048e0ceea81195c7e70c77765f251922dd9ebb73
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:06b86e04ec5183d275dd08f0d3f3c8e5e849adca6e94862750c12350826e33f1
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:b8f4873c60f46553cf638528bfc2c304ccf3e68a72e0d01c8dfba4c08b93197f
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:5e8a0e20adbb1ba9b1da80ef24845a88bedc1b0598c0750dba757cce6b52588e
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
Expired
|
952 MB |
sha256:1e998f088e1cd767f4f2de8afa80cb694284a6e7818375cd7ed4704f6e15f750
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
Expired
|
443 MB |
sha256:54ad0fe55ec57af9b4a39ad28720b0a8c75306e027968b6045ac3eeb9251be03
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
Expired
|
430 MB |
sha256:707e707f860d229c2745527a3dc80d02127f8df3640381400dac9001f88bb01c
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:6b0040fe5955aeb8b95544ec55eee2f7d08f79d5118740169d621db7433b1696
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:98315f967a9f308aea74782015e5eab9c9f64674aa9b6c090fe15326d626c82f
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:5a142217f98ac37ccb9f7dbf1a7c4a615808b0b9f2042a94ea836ecd30d1ebd8
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:b14468c3c6fd247594aca0de4aa148dca459fa2fdbda47d2b2fba5008f8d2ead
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:43fe16aac8aba9d9f6d9fcd79828036fb0c5a9a0d2ad92694706fc3a68824c4b
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:9a0a173d15888a1c5e54f76e2bc9c98b074ccfd08d44b6ec3cecd9fb3dcc212c
|
|