Test CUDA Builds

[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype #8817

Sign in to view logs

Triggered via pull request February 13, 2026 03:57

SS-JIA

synchronize #17389

gh/SS-JIA/420/head

Status Success

Total duration 1h 20m 21s

Artifacts 14

cuda.yml

on: pull_request

Matrix: export-model-cuda-artifact

Matrix: test-cuda-builds

unittest-cuda / linux-job

Matrix: test-models-cuda

Matrix: test-cuda-pybind

Matrix: test-model-cuda-e2e

check-all-cuda-builds

Artifacts

Produced during runtime

Name	Size	Digest
google-gemma-3-4b-it-cuda-non-quantized Expired	7.22 GB	`sha256:023ab4b1634149eea663fa693d63175ec70c1514b35a4dd508797d6a5d815dda`
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired	3.36 GB	`sha256:75961e0c250bf18f0410809e048e0ceea81195c7e70c77765f251922dd9ebb73`
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired	6.82 GB	`sha256:06b86e04ec5183d275dd08f0d3f3c8e5e849adca6e94862750c12350826e33f1`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired	2.8 GB	`sha256:b8f4873c60f46553cf638528bfc2c304ccf3e68a72e0d01c8dfba4c08b93197f`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired	6.14 GB	`sha256:5e8a0e20adbb1ba9b1da80ef24845a88bedc1b0598c0750dba757cce6b52588e`
nvidia-parakeet-tdt-cuda-non-quantized Expired	952 MB	`sha256:1e998f088e1cd767f4f2de8afa80cb694284a6e7818375cd7ed4704f6e15f750`
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired	443 MB	`sha256:54ad0fe55ec57af9b4a39ad28720b0a8c75306e027968b6045ac3eeb9251be03`
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired	430 MB	`sha256:707e707f860d229c2745527a3dc80d02127f8df3640381400dac9001f88bb01c`
openai-whisper-large-v3-turbo-cuda-non-quantized Expired	1.18 GB	`sha256:6b0040fe5955aeb8b95544ec55eee2f7d08f79d5118740169d621db7433b1696`
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired	491 MB	`sha256:98315f967a9f308aea74782015e5eab9c9f64674aa9b6c090fe15326d626c82f`
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired	485 MB	`sha256:5a142217f98ac37ccb9f7dbf1a7c4a615808b0b9f2042a94ea836ecd30d1ebd8`
openai-whisper-small-cuda-non-quantized Expired	361 MB	`sha256:b14468c3c6fd247594aca0de4aa148dca459fa2fdbda47d2b2fba5008f8d2ead`
openai-whisper-small-cuda-quantized-int4-tile-packed Expired	172 MB	`sha256:43fe16aac8aba9d9f6d9fcd79828036fb0c5a9a0d2ad92694706fc3a68824c4b`
openai-whisper-small-cuda-quantized-int4-weight-only Expired	270 MB	`sha256:9a0a173d15888a1c5e54f76e2bc9c98b074ccfd08d44b6ec3cecd9fb3dcc212c`