Skip to content

[ET-VK][qlinear] Add bmm support to quantized linear pattern detector #11007

[ET-VK][qlinear] Add bmm support to quantized linear pattern detector

[ET-VK][qlinear] Add bmm support to quantized linear pattern detector #11007

Triggered via pull request March 10, 2026 08:54
Status Success
Total duration 1h 28m 37s
Artifacts 19

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
28m 14s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized Expired
1.1 GB
sha256:54369976ac53fa93d270c6a0be72c426a9dfcaa0f235ba2887985e6b76588891
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed Expired
559 MB
sha256:d61ae4ce9610fbfd2a2876328649f8261d1f03dd3a027401bbcf07525ceca0a3
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only Expired
1.1 GB
sha256:585d65d3044e9f31dd4f164aee2d982a56067f97907ad7de18b4aa201c673875
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:74483fb40621dda700d160ca61c26f7558dc41cb6ffacdc94f4469ea2d5ca4f4
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:6d1f6e1673186c10bc1e542bf834369a5711157c6bbb7eab4e70ace523fddcc7
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:40e8e9f364894818dee9b0eba6d0c113ddaabd5ebe09bfeed3c25bcabe83d432
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:cb8c8f5112a7e42684413e3011e2201d0fcb5ced2ab5468874d92af9d9fda4e1
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:1c65ac8474395ec68f322ed25862ecebff139b4f5d5bb8cd8bc82f95db0727d2
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed Expired
15.5 GB
sha256:21f10863fd0ae4762a83f532e26a687e9b990ebe921e3e5c570bea2111a67b88
nvidia-diar_streaming_sortformer_4spk-v2-cuda-non-quantized Expired
436 MB
sha256:c07d38af195b7bc8764d0f0405899d5760c14da4b65f9c50e9d521001e600551
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:7f81852890332a8f69ca38603960a4504cffedb69399b86ca5562a21fb7a7dca
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:0a215cfa14e3d7a0e9ac61528a14cead765ebfc714d5d929972c8b77e4b3136d
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:7b91352bf40811fbe5cf9ad552947e85e3a920573790bfea7b722f7ac1c6da22
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:bfc517eb6484753b2f87037613b01fd5ce8d410338776169a3423c7c02fb424b
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:691e98154c7302e2e8256a3a8f09d02856cee7f14357d21eeff593f4e0522a55
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:ac24a99976fdeb51945fa8e408205f6673a1c04d15c91b7018fbe8caf119ae6a
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:6698cb4ec01b554d845884a72271e9ee55b7bfb2827f863a4efceb58adcded23
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:0dd280c360099a01f063efc604f3ecd882e1938f63c9a7c48a63ce57a401f77d
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:8845162f1056bc1be8c3a417606f93912b7f964e21caf0d127eeb910572691f0