[ET-VK][qdq] Support high-dimensional tensors in quantize/dequantize per tensor #4066
cuda-windows.yml
on: pull_request
Matrix: export-model-cuda-windows-artifact
Matrix: test-model-cuda-windows-e2e
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-non-quantized
Expired
|
6.82 GB |
sha256:cbe2fa3061e272e618f22aa1f37e37d6505762fc42fd5bd03f402b3ebf420f13
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-quantized-int4-weight-only
Expired
|
6.15 GB |
sha256:c0d8173b6159e0a7051d17c83ab8cca088ddcbcfd59104d34a0abbe8c7afd171
|
|
|
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-windows-quantized-int4-tile-packed
Expired
|
15.5 GB |
sha256:839fbab29e49b07d2a5d3968d47979faa25a1bdb65101c159a12260a2ee7b447
|
|
|
nvidia-diar_streaming_sortformer_4spk-v2-cuda-windows-non-quantized
Expired
|
437 MB |
sha256:519a8d58c1a3fd2ed168bf3044e1a65930406f564ce2e3407466f80b35146347
|
|
|
nvidia-parakeet-tdt-cuda-windows-non-quantized
Expired
|
954 MB |
sha256:55ce8baf55836d9bbd2bc91afd3fb477a41e357420595dd4ccec762212402aa4
|
|
|
nvidia-parakeet-tdt-cuda-windows-quantized-int4-weight-only
Expired
|
432 MB |
sha256:132f04e102f0c53347f4ce74be13905218c2cb440f5b7f7498c23ab1a366b827
|
|