Skip to content

[ET-VK][q8_ops] Add int8x4_buffer_to_nchw shader and refactor Int8x4Staging #9839

[ET-VK][q8_ops] Add int8x4_buffer_to_nchw shader and refactor Int8x4Staging

[ET-VK][q8_ops] Add int8x4_buffer_to_nchw shader and refactor Int8x4Staging #9839

Triggered via pull request February 25, 2026 15:27
Status Failure
Total duration 1h 21m 6s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 29s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
3s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

1 error and 4 warnings
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Process completed with exit code 1.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Back off 21.635 seconds before retry.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Failed to download action 'https://api.github.com/repos/aws-actions/amazon-ecr-login/tarball/062b18b96a7aff071d4dc91bc00c4c1a7945b076'. Error: Response status code does not indicate success: 502 (Bad Gateway). BEB4:15D778:728D86:1ECAE0D:699F2041
test-model-cuda-e2e (openai, whisper-large-v3-turbo, non-quantized) / linux-job
Back off 22.736 seconds before retry.
test-model-cuda-e2e (openai, whisper-large-v3-turbo, non-quantized) / linux-job
Failed to download action 'https://api.github.com/repos/actions/upload-artifact/tarball/ea165f8d65b6e75b540449e92b4886f43607fa02'. Error: Response status code does not indicate success: 502 (Bad Gateway). E560:6ADD1:6E6A36:1DABBC0:699F203F

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:8091608c9ff6fcc7b24dcdc3e3700266af4d3385766e971b34e5119c4be74e1e
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:37981da54236309f7878be883e97285e7d1f097e8cb8621d4d3d3eec70c5106f
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:ea3a6024bef1b41c159e30b12e7ad9a7de21200c69da027b67bc38f1a8606141
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:786e2ca2ab2e8226cd5f8c5c3b3a93f772efcefd6c021be26b75fd1385acd0c9
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:e76132c00201fa138d58c8b6d2a128e13ef0136ae097f9d79895cef1280bb0fa
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:3b59354873b07c110c629997b00950a4c8e87a22f6163033719f31515ecd04a9
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:4f951a739502187c46f28a86436eba52b869360661ea68f0e3774e802e07fd31
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:2abaa48b8377a411bf200a4ef947fcae203b25cc78ab22daa380b42d709eb1e6
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:98bbd48511ce39aa6ca45a1be464a944081705a4510d41612d46aeb57b0b5418
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:e8e5eca93f6608629f9d55c147f0645b329bd55ae093177de09964f68fd0adb2
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:b67d061852cbcc6d2ea4cf95fc39bf8d74ec3b28c80a66e090786c971f34b83e
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:54ea708f5b2fbe500eab8565238f98310101fe948426b080f0bc77dc582b264f
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:6b73212e9170e97ece5f3fd6239202fd1797f0478b2dca504f84a60238139eab
openai-whisper-small-cuda-quantized-int4-weight-only Expired
271 MB
sha256:98ab3df037a193f7adf361f29f9fa09c67191c77d04fc65c62e99c85dcc71e9f