Skip to content

Support multimethod in export_llama_lib #8895

Support multimethod in export_llama_lib

Support multimethod in export_llama_lib #8895

Triggered via pull request February 13, 2026 21:20
Status Success
Total duration 1h 30m 7s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 12s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:5d8a2e0223c79d70534b6eeee3322c798aef533bf80e3aaf7d680be5b011d111
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:6600dcc9b9ff9a95c405140d1122af5b5cd1241c0b131bc0148171757cf337cd
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:bb465b6307d8ea8ecce7c11e88f36c3532e26f925f3bdf2dde7f9283a14a53d5
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:483d1b2864ba4fa3e2fe6ae7d8d181f5dd8049ed5ab1c6b937d62747ac2b49e7
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:f52bca7f36fa322c9cdb11780359f55a2ab7fced0769efa9b7c30c3b4c8ba2bf
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:5e7af23f7f131783a163c00d308c56395f14104ca57c836b95c82be8de0b73ad
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:2772ddc87bacf04d3f236e47ec39b8ae3ee9f7ddfc4ecbf3fef7726cc79ee7e6
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:de353801196de74d859549f7473bcfad6dd3070e813e6ebee050b80154da6776
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:d1dd47c2f4af251eff683fe36227408a11af14151e1669e1f53cd115365d5841
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:4de64cd777a529a0043e0df0334b71d9db939f5a92709c66e34bf372bb07a024
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:cddc861c0b86f328afd118d494e41a70d6e4e251eb0379f4d0a2d5f61b9160b4
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:27a85bf764139a666637b20ebab40f31104f7b5655459c3fca424d6c0bb99c95
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:f3b4025af1e4d65f914b03ec8daf4a0657ae894f7aafe90508ea33aa273ccce0
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:bf9b54afd36bf82a4c552d5b889bb8927dd5238866681cf8693d509a43310ef7