Skip to content

[ET-VK][qconv] Read weight buffer as int in pack_q8_conv2d_weights shader #9875

[ET-VK][qconv] Read weight buffer as int in pack_q8_conv2d_weights shader

[ET-VK][qconv] Read weight buffer as int in pack_q8_conv2d_weights shader #9875

Triggered via pull request February 25, 2026 19:54
Status Success
Total duration 1h 15m 31s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 3s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
3s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

2 warnings
test-models-cuda (sdpa) / linux-job
Back off 24.352 seconds before retry.
test-models-cuda (sdpa) / linux-job
Failed to download action 'https://api.github.com/repos/pytorch/pytorch/tarball/4d9088724ec3aff53bd70a19afca8dfd4b1a5657'. Error: Response status code does not indicate success: 502 (Bad Gateway). E7D0:351561:12E6918:50E5388:699F54D5

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:4a0bb07f7610a2fb150e890ec80dbc936f27002035b3a1a619543ed1925a5f48
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
3.36 GB
sha256:fe22a53f0a10ec4a30e5c0f3a96ba37f44cd25530a0aa0d4e93fd46b5a35016c
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:47abc63b6e2b0b877799f8839a30ae36dfff9e76ce79f50ac350d0dc12426fb0
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.8 GB
sha256:1aedde99be8e260d8549fb08eb7d92d5b356e8e7aa885a59af221bd691f57cec
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:2f47dd1cf9b883673fa8640cf22295f56ce1e3fe04bc365df402a46048bd0e11
nvidia-parakeet-tdt-cuda-non-quantized Expired
952 MB
sha256:20e08fbd275acd4949dc82e5289c615d0b71c82da73a875f48564145d6668536
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed Expired
443 MB
sha256:8173d8c2715b7ce7572aca3ad10ea6056ef502964fb62dfd2b68daa6e0c0f143
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only Expired
430 MB
sha256:5dc94d1661edf29ff804582a7f016e28a8c19dfb097436fb2616e106657bf56c
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.18 GB
sha256:b085ca0c6fb6bb917c6ba05b5013ce01687dfea4eb6f2171bdcb32d4c73b6e54
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
491 MB
sha256:4a9fc947afb0acc2551dd6e20cb102f5c286129f128a95aa3fb3690cd8e1f550
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
485 MB
sha256:ad317d9a7aba2639aeda96e5667b1a8459070be7d5db141b28d8b85e29cc0e5a
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:7eb4cd2e48a9f55d48550f702893c4cc2b53e368d849561a45c63d9d9dcdaae3
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:cf2c7678cd15e4eeb23b3ad215128adc08c83b5944cde530ae689d84f8a9db10
openai-whisper-small-cuda-quantized-int4-weight-only Expired
271 MB
sha256:28e1f978ba61b62a1899ac134b1b6616a605c5814f051dba458a47c0788fb155