Skip to content

[Draft] iOS support #10864

[Draft] iOS support

[Draft] iOS support #10864

Triggered via pull request March 7, 2026 02:43
Status Success
Total duration 1h 22m 20s
Artifacts 18

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
27m 35s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized Expired
1.1 GB
sha256:44d846a1ce966c05fa1296c0db64a18202b815cbfc567fe50e316d2aa21a0e9a
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed Expired
559 MB
sha256:110f19d674935bb3e51e153d1efdf68f48dc2163f9bd905fc7a6eb11734a8e06
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only Expired
1.1 GB
sha256:145d5a9f134b1b8ec7f967e8516b6f4a644f86a406e56b408314579bf2c88648
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:de3ba5df5b246f632696b570c3d93ba22eef7cb2596c9d0489d52e705b8a5389
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:ce26049b57259fb632e96a3a16e8d82fdd5495a90ea2173ffe41396cfe813aeb
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:64773b55e8691b8284877bc1166057c07b47d38b47c7afe00ccaba0b83e14bdf
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:d708e467f75f454eda12a69281b12774774988c143a9e9a50c00d3594a2c24ce
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:09b155ca0419e726df366b836e4919ceba10a4208207c5f7c96ba70bcf75c6f7
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed Expired
15.6 GB
sha256:c1677bfa6deb19fa07cde12e1db01ca7c23908d1f50832f1af4613b38cdf6d27
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:76fa385b7aa94fe6e70512250daba6ee3f1afc63d9417de33b13c7f31061e723
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:809ccb970f188ea23e88b23232659c282734c46b8b280c3b84d1ddc573127e8f
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
433 MB
sha256:ecaf5a8880b8c6d3761a5db9c2056a32e8316e4f4a9999f8b60c5c7e2c3c2f4a
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:83c3c4b9ac823c06c0b80b41420c3e8439ad9f2259609cf1873a292f434b6366
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:0ac2e8a28c0f7543eb8fa3a75bdae3fa4eb42441c3ef9900707f06bc198773d8
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:c735b3f876e8f59e4d19d81312d96a7543ea3e6533621eb4a4844c80a56d39d0
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:7cf8625c2f7db8a7f309f74fbb4c34d9059cc4c7377d2779a8327e154ac7ca70
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:e58d9ea06c6d690807bdceb778d360712993e99d4e6af601d8f92a9cba42de9c
openai-whisper-small-cuda-quantized-int4-weight-only Expired
271 MB
sha256:023891f3cf1b5eab1809fb935f6f670a69fa500f2fb42080579017298c3cade1