Skip to content

[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype #8817

[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype

[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype #8817

Triggered via pull request February 13, 2026 03:57
Status Success
Total duration 1h 20m 21s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
24m 15s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
3s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:023ab4b1634149eea663fa693d63175ec70c1514b35a4dd508797d6a5d815dda
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:75961e0c250bf18f0410809e048e0ceea81195c7e70c77765f251922dd9ebb73
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:06b86e04ec5183d275dd08f0d3f3c8e5e849adca6e94862750c12350826e33f1
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:b8f4873c60f46553cf638528bfc2c304ccf3e68a72e0d01c8dfba4c08b93197f
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:5e8a0e20adbb1ba9b1da80ef24845a88bedc1b0598c0750dba757cce6b52588e
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:1e998f088e1cd767f4f2de8afa80cb694284a6e7818375cd7ed4704f6e15f750
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:54ad0fe55ec57af9b4a39ad28720b0a8c75306e027968b6045ac3eeb9251be03
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:707e707f860d229c2745527a3dc80d02127f8df3640381400dac9001f88bb01c
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:6b0040fe5955aeb8b95544ec55eee2f7d08f79d5118740169d621db7433b1696
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:98315f967a9f308aea74782015e5eab9c9f64674aa9b6c090fe15326d626c82f
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:5a142217f98ac37ccb9f7dbf1a7c4a615808b0b9f2042a94ea836ecd30d1ebd8
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:b14468c3c6fd247594aca0de4aa148dca459fa2fdbda47d2b2fba5008f8d2ead
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:43fe16aac8aba9d9f6d9fcd79828036fb0c5a9a0d2ad92694706fc3a68824c4b
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:9a0a173d15888a1c5e54f76e2bc9c98b074ccfd08d44b6ec3cecd9fb3dcc212c