Support multimethod in export_llama_lib #8895
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
25m 12s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:5d8a2e0223c79d70534b6eeee3322c798aef533bf80e3aaf7d680be5b011d111
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:6600dcc9b9ff9a95c405140d1122af5b5cd1241c0b131bc0148171757cf337cd
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:bb465b6307d8ea8ecce7c11e88f36c3532e26f925f3bdf2dde7f9283a14a53d5
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:483d1b2864ba4fa3e2fe6ae7d8d181f5dd8049ed5ab1c6b937d62747ac2b49e7
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:f52bca7f36fa322c9cdb11780359f55a2ab7fced0769efa9b7c30c3b4c8ba2bf
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
Expired
|
952 MB |
sha256:5e7af23f7f131783a163c00d308c56395f14104ca57c836b95c82be8de0b73ad
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
Expired
|
443 MB |
sha256:2772ddc87bacf04d3f236e47ec39b8ae3ee9f7ddfc4ecbf3fef7726cc79ee7e6
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
Expired
|
430 MB |
sha256:de353801196de74d859549f7473bcfad6dd3070e813e6ebee050b80154da6776
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:d1dd47c2f4af251eff683fe36227408a11af14151e1669e1f53cd115365d5841
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:4de64cd777a529a0043e0df0334b71d9db939f5a92709c66e34bf372bb07a024
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:cddc861c0b86f328afd118d494e41a70d6e4e251eb0379f4d0a2d5f61b9160b4
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:27a85bf764139a666637b20ebab40f31104f7b5655459c3fca424d6c0bb99c95
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:f3b4025af1e4d65f914b03ec8daf4a0657ae894f7aafe90508ea33aa273ccce0
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:bf9b54afd36bf82a4c552d5b889bb8927dd5238866681cf8693d509a43310ef7
|
|