Skip to content

Remove unnecessary cuda sync for better perf #9383

Remove unnecessary cuda sync for better perf

Remove unnecessary cuda sync for better perf #9383

Triggered via pull request February 20, 2026 07:42
Status Success
Total duration 1h 13m 43s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 3s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:37e37758f58fe79d1363c12787effb0d88a620b0dc616a7051946ca381bb7835
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:ac5f244a05187814a82ebde73196766f0ad3997f837f829916a58adf791c5fc3
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:a91e49bb5369cb7568e827e4ba230215711f366e5be021e699b6f01b42dccc55
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:d39bd037baf14dec39ff58b9de74f74ccaab832fb42c1b6870ad3b69fb63013f
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:6e2e62ad5572415756f5032cc0ad559826304073f038dfd7179d4ff67f41f663
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:f6b6baa62746b9295dbd26b1143591eab1c497dc9091000014f0a12af1efbe1e
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:6dbe15b6b2d312266a19aeaa2cd74fb0166c706f7dc52510756fff742a5db5ea
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:ba3745cb5522709cf7bbc312df5869ffbb3d1d5d9849f0ef873a5d09ab75d3fb
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:2ba2774ff88ec9d936e9818b307ff743d71383a18b7d0389f35d4b85bfc55d73
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:d66ac76b57865e215788874fe3f3f0b04ef7bb13bc8a453664f83b9bc6c88e34
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:4349981594c696c95d205d31ee72137e03526646c0dfb7072273c51039067a97
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:b865166c95cda3c53699726ce3dbcb7a6d097d31355c3073276529efdc1520f3
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:6768baa0b79677da0c27a1f910f7264c59d698b7f579f305aa8bcee5d340d3b8
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:e2978cca6a5a810f98933cf7a382d82fdbee77691ab76b9623e1003c0cea6922