Skip to content

[ET-VK] Insert prepack nodes for constant primary inputs of prepacking ops #10454

[ET-VK] Insert prepack nodes for constant primary inputs of prepacking ops

[ET-VK] Insert prepack nodes for constant primary inputs of prepacking ops #10454

Triggered via pull request March 4, 2026 16:29
Status Success
Total duration 1h 14m 43s
Artifacts 17

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
24m 44s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized Expired
1.1 GB
sha256:79be853610c3aec0b6d9ea147aa7225d16f54729216d458462b152f3882a50b4
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed Expired
559 MB
sha256:33c2d8b888d6beb4bc01b05989a41e9f281ee4af795d922754d99b7023ea4bfa
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only Expired
1.1 GB
sha256:d21e79441207e196efd5534dd1697551dc4e40f531a994cf34d7375963d7f5bc
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:31dd29ab0a54169892a4bdfc1a5f4dbc813b5677fc6195aead24f096cfc2e461
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:2de4c279c6eb9938853f63c761104e83deac790e0d94bf90224d9671f3315b93
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:a03e5fd6e75e2d1c19adeb032089af933724b47d861ee551afe7495938bed419
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:858d7505b058c1d6125a1e1345ce4c82d64f83be3add0087ac9ca15f09dff6dc
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:7cacdefcaf76b872be047120354b09dad449f677b6648895f6fa95b038e79c4f
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:d6667fd922236b3a59b02fad3110da64f57ba32be800a0a31109249974bef835
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:25eb8c582572c48b9205284ed06e4b4750c8d3526641aad8359ab0dcac3e23cc
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:d8ebe89d7fa3caa56adcf5f49ef1577a5c581fde396dcb4b5a93b1d065a61ff9
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:57dbdc9951aa585d65a87aa13ed164fdaf56094ea4b0a1366b4724f554f57d96
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:cc2c4dde25c6c323c6bacd801ade1d4a544584b75b61b15f2c3d431aec48cb15
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:c1e309017149b3bba2b337fd7db0a54f657599982e58d512249a873b14e47a6f
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:33f11dc015d55d38ae60dea5e067d1642b4fe80c9e275abd1656f56213a9a16c
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:830a2a69fd399d4fb4b9ed4e8767b3461766f280f393f9732a7c83bf53f30d46
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:756cad8e23b328cadb51eab1cf5fda4b3c305e8c8e8bc9cba8de4ece3d9f10d6