Skip to content

[ET-VK] Insert prepack nodes for constant primary inputs of prepacking ops #10502

[ET-VK] Insert prepack nodes for constant primary inputs of prepacking ops

[ET-VK] Insert prepack nodes for constant primary inputs of prepacking ops #10502

Triggered via pull request March 4, 2026 23:44
Status Success
Total duration 1h 12m 4s
Artifacts 17

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 7s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized Expired
1.1 GB
sha256:c9fb1e789340c3d3235855804aef116be354214aa20117623c41e1cea46970e8
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed Expired
559 MB
sha256:8d501a77df645e135c0a2882e74126a24b31a1b872c9f5d3df468baa88b974d4
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only Expired
1.1 GB
sha256:d0fd60c2222ef4681159613482bb6a8cabf568bd99065ddc41e1573951366444
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:238df64a44d8787c87a65715433b9540b5029f6217af9c7e19a99f6d64e6e924
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:1c1a146c256eb80bfe3abf90230813a65084d21a03921d524beeaa7cac496936
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:a48e3fa5250aa2220c9901348a62c3379bd190bfa524dbf61dc871bdcb6710d2
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:a7a0b6cb992f6923c416ac7fabee3ae67099852a133c1517ffe90eb27a0b05ce
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:46bb269a59cd103ab8fdde56f46232da8c20b7efdddbc9242ce0de3d2a08cd5b
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:ded309ac97a03fbf2e115b861cc109d840a99ad4133b4629988939c9c30871d5
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:38014029a0c822619f1207d73121f7d9f039af2c32e5a2d5fb09dc152e92cc2c
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:d670dc82c7c8ca0b7c569b33c7cd07a0103f3cfd649e28533dda35c848447dec
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:738177eaaa604057245d39a23eee2c3f1fa9c7fb6e472737a0fe01dc8f17d0a0
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:06b1c279e161c895a233c27e8bb63ddfca14739b7e926e1ecafec1551155f70e
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:61736065ee32a6a0e6649e8d44d01096ada9e8608e0c718e61d4dde88bb508af
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:552d223926e7db157fbe514acbf393400714560255cd5d32de99f39556087a3e
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:71a9798c7f3600ea4127a303706ad9ae6ebf6686d2fab36d8c25e049ea9d0221
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:0efdc8dbe5101e7f2af070fa20b3b202b4b567dee21bcb4b78be7f778268b311