[ET-VK][qlinear] Add bmm support to quantized linear pattern detector #11007
Triggered via pull request
March 10, 2026 08:54
Status
Success
Total duration
1h 28m 37s
Artifacts
19
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
28m 14s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
Qwen-Qwen3-0.6B-cuda-non-quantized
Expired
|
1.1 GB |
sha256:54369976ac53fa93d270c6a0be72c426a9dfcaa0f235ba2887985e6b76588891
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
Expired
|
559 MB |
sha256:d61ae4ce9610fbfd2a2876328649f8261d1f03dd3a027401bbcf07525ceca0a3
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
Expired
|
1.1 GB |
sha256:585d65d3044e9f31dd4f164aee2d982a56067f97907ad7de18b4aa201c673875
|
|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:74483fb40621dda700d160ca61c26f7558dc41cb6ffacdc94f4469ea2d5ca4f4
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:6d1f6e1673186c10bc1e542bf834369a5711157c6bbb7eab4e70ace523fddcc7
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:40e8e9f364894818dee9b0eba6d0c113ddaabd5ebe09bfeed3c25bcabe83d432
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:cb8c8f5112a7e42684413e3011e2201d0fcb5ced2ab5468874d92af9d9fda4e1
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:1c65ac8474395ec68f322ed25862ecebff139b4f5d5bb8cd8bc82f95db0727d2
|
|
|
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
Expired
|
15.5 GB |
sha256:21f10863fd0ae4762a83f532e26a687e9b990ebe921e3e5c570bea2111a67b88
|
|
|
nvidia-diar_streaming_sortformer_4spk-v2-cuda-non-quantized
Expired
|
436 MB |
sha256:c07d38af195b7bc8764d0f0405899d5760c14da4b65f9c50e9d521001e600551
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
Expired
|
952 MB |
sha256:7f81852890332a8f69ca38603960a4504cffedb69399b86ca5562a21fb7a7dca
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
Expired
|
443 MB |
sha256:0a215cfa14e3d7a0e9ac61528a14cead765ebfc714d5d929972c8b77e4b3136d
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
Expired
|
430 MB |
sha256:7b91352bf40811fbe5cf9ad552947e85e3a920573790bfea7b722f7ac1c6da22
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:bfc517eb6484753b2f87037613b01fd5ce8d410338776169a3423c7c02fb424b
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:691e98154c7302e2e8256a3a8f09d02856cee7f14357d21eeff593f4e0522a55
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:ac24a99976fdeb51945fa8e408205f6673a1c04d15c91b7018fbe8caf119ae6a
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:6698cb4ec01b554d845884a72271e9ee55b7bfb2827f863a4efceb58adcded23
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:0dd280c360099a01f063efc604f3ecd882e1938f63c9a7c48a63ce57a401f77d
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:8845162f1056bc1be8c3a417606f93912b7f964e21caf0d127eeb910572691f0
|
|