Remove unnecessary cuda sync for better perf #9383
Triggered via pull request
February 20, 2026 07:42
Status
Success
Total duration
1h 13m 43s
Artifacts
14
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
25m 3s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:37e37758f58fe79d1363c12787effb0d88a620b0dc616a7051946ca381bb7835
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:ac5f244a05187814a82ebde73196766f0ad3997f837f829916a58adf791c5fc3
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:a91e49bb5369cb7568e827e4ba230215711f366e5be021e699b6f01b42dccc55
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:d39bd037baf14dec39ff58b9de74f74ccaab832fb42c1b6870ad3b69fb63013f
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:6e2e62ad5572415756f5032cc0ad559826304073f038dfd7179d4ff67f41f663
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
Expired
|
952 MB |
sha256:f6b6baa62746b9295dbd26b1143591eab1c497dc9091000014f0a12af1efbe1e
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
Expired
|
443 MB |
sha256:6dbe15b6b2d312266a19aeaa2cd74fb0166c706f7dc52510756fff742a5db5ea
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
Expired
|
430 MB |
sha256:ba3745cb5522709cf7bbc312df5869ffbb3d1d5d9849f0ef873a5d09ab75d3fb
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:2ba2774ff88ec9d936e9818b307ff743d71383a18b7d0389f35d4b85bfc55d73
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:d66ac76b57865e215788874fe3f3f0b04ef7bb13bc8a453664f83b9bc6c88e34
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:4349981594c696c95d205d31ee72137e03526646c0dfb7072273c51039067a97
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:b865166c95cda3c53699726ce3dbcb7a6d097d31355c3073276529efdc1520f3
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:6768baa0b79677da0c27a1f910f7264c59d698b7f579f305aa8bcee5d340d3b8
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:e2978cca6a5a810f98933cf7a382d82fdbee77691ab76b9623e1003c0cea6922
|
|