[ET-VK][qconv] Pad weight_sums buffer to multiple-of-4 alignment #2458
cuda-windows.yml
on: pull_request
Matrix: export-model-cuda-windows-artifact
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-non-quantized
Expired
|
6.82 GB |
sha256:29fac422db37cb6908b2008536d94b3bdedaeb60a9a9bfa0e7737a123c95bb5c
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-quantized-int4-weight-only
Expired
|
6.15 GB |
sha256:556deefb4c57445271b11d09e7bf30a019dcdbb97117cb13691066abea8f3ae8
|
|
|
nvidia-parakeet-tdt-cuda-windows-non-quantized
Expired
|
954 MB |
sha256:cc41016643f1858932d81f0a029a8a3b006b75031a5c8dc01781c1438da1b775
|
|
|
nvidia-parakeet-tdt-cuda-windows-quantized-int4-weight-only
Expired
|
432 MB |
sha256:98e372ce7902a3ba1a5f9df64861569330ee8b3da030b41bccc2e215014d8be4
|
|