[ET-VK][qconv] Add dynamic PACKED_INT8_CONV2D memory layout for device-adaptive conv2d #10245
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
25m 13s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
Qwen-Qwen3-0.6B-cuda-non-quantized
Expired
|
1.1 GB |
sha256:bf262bf1b5326708578d06b2f2cc5b9bd7a69bb4c590c5aae1c7494c9ebcbb2c
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
Expired
|
559 MB |
sha256:21e023e3216c6a97e1ae8ec0ee0fcc2eb20fac444c48bbe0c2a8a47bd0147140
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
Expired
|
1.1 GB |
sha256:3230d622cff198b7045aedce46adbabb07f5b8f3e65514e76ffd0cbdc20aee70
|
|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:54bd14bdfe43e5167fde7d1a6193c0adee12fe0b9cecd70ca64ac25ea155de85
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:ee1aa7de1fa5ccde3ffabaa6fd9195636d005812720bbf74d6ffec4de0ffdff6
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:2925d6cc8428df80ad480f897a9715665d4964ea25c49ddc1174a5c282249d6f
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:1812bb47fce884468c6c8858b5a04b10fd6705fc7d24183a5421dd5637fd1fe2
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:f0c64a2191e7995248fde90c3f5a17089689d59cc3d7dd930933903e8e318d4f
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
Expired
|
952 MB |
sha256:4a0e76f98d276ff705595b2b40cef5ad623548c9e50f5b1be1c460acb9eaa8bd
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
Expired
|
443 MB |
sha256:6506a03e2c65ef64c7cc2da318d55f89e206d5baa60c4a750ce6c7c8cb45c57d
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
Expired
|
430 MB |
sha256:b1a106f8f3feeef8f91c1136418a495260b03c0bc5963d67e8b5593ef8922fd2
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:6f31a748c217279c2b0b40d9b6c3afce2a8bbc6c0d8f3bb2810f129451768302
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:128234e4e968cfc5e85f43b4a24fdc5e8c8d681f442e079c0b8abe792f825078
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:c2c4aa619addcfc723510084153bd34e4aab8af0e0bb8d1bf5efd567152ce177
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:5d6644d61fc7b644300f13aebf6d9aa861c773e7bc25e867e1e01ca84c21c812
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:1d1ae8712f0cc89c7036c1a63571583c98d988161f1872f33e301d653355877f
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:7b2bda773770a61d59e76f17464ff4029443880a2cdbfb3f3d221df81faef7f9
|
|