ONNX Runtime QNN EP Quantization Support #422

brady-cherish · 2026-05-20T21:51:16Z

brady-cherish
May 20, 2026

I wanted to clarify the purpose of this repository after encountering it in the ONNX Runtime documentation for Quantization for the QNN EP here: https://onnxruntime.ai/docs/execution-providers/QNN-ExecutionProvider.html#generating-a-quantized-model-x64-only

It mentions the following:
"""
Install the ONNX Runtime x64 python package. (please note, you must use x64 package for quantizing the model. use the arm64 package for inferencing and utilizing the HTP/NPU)
python -m pip install onnxruntime-qnn
"""

Given the most recent onnxruntime-qnn releases do not include builds to support x86_64 ABIs, what versions should be used for quantization support for the QNN Execution Provider?
The unavailability of x86_64 packages seems to contradict the ONNX Runtime documentation for quantization:
"""
The quantization utilities are currently only supported on x86_64 due to issues installing the onnx package on ARM64
"""

Would it be possible to make the documentation for both ONNX QNN EP Quantization and this repository more clear about OS and version support for quantization?

Thank you!

yath1 · 2026-05-22T17:45:30Z

yath1
May 22, 2026
Collaborator

Hello @brady-cherish, thank you for starting the thread. Apologies, the documentation is probably not very clear about this. We will update it. EP can ingest a valid ONNX file with QDQs inserted in it. Do you already have a quantized model which you would like to try it through QNN EP or are you looking to quantize a model through our tool chain?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX Runtime QNN EP Quantization Support #422

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

ONNX Runtime QNN EP Quantization Support #422

Uh oh!

Uh oh!

brady-cherish May 20, 2026

Replies: 1 comment

Uh oh!

yath1 May 22, 2026 Collaborator

brady-cherish
May 20, 2026

yath1
May 22, 2026
Collaborator