[ET-VK][Ops] affine quantization operators registration by pytorchbot · Pull Request #12440 · pytorch/executorch

pytorchbot · 2025-07-14T14:59:27Z

This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #12369 by @ahmtox
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/ahmtox/39/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/ahmtox/39/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/ahmtox/38/orig
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/ahmtox/39/orig
@diff-train-skip-merge

pytorch-bot · 2025-07-14T14:59:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12440

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Pull Request resolved: #12369 # Context In order to enable dynamic quantization, especially for the source transform method using `Int8DynActInt4WeightQuantizer` we need to have vulkan versions for `quantize_affine`, `dequantize_affine`, and `choose_qparams_affine`. Currently we do not have a shader that performs block-based quantization as expected from these shaders, so we delegate to the per_tensor variant just to get unblocked. At a later stage, this will likely be developed more on in order to ensure we don't get too much accuracy loss. A full implementation for the affine operators will be done at a later time, since they are required for usage. However, if you wan't to just use the default as per_tensor then you must remove the checks made in `op_registry` and in the vulkan implementation so that the per_tensor version can be used. Without it they will not be registered. # Changes This creates a schema reference in the TorchAO library for out variants of these respective operators. Then there is a VK_REGISTER_OP done on them to ensure that we can properly register them when lowering the ET model with vulkan. We also changed `Linear.cpp`, particularly to allow a passthrough for weight_data since during dynamic quantization it's possible that it'll be a tensor_data than tensor_ref. ghstack-source-id: 295972790 @exported-using-ghexport Differential Revision: [D78035354](https://our.internmc.facebook.com/intern/diff/D78035354/)

github-actions · 2025-07-14T17:10:59Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

pytorchbot requested a review from SS-JIA as a code owner July 14, 2025 14:59

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 14, 2025

SS-JIA changed the base branch from gh/ahmtox/38/orig to main July 14, 2025 16:04

SS-JIA requested review from JacobSzwejbka, manuelcandales and swolchok as code owners July 14, 2025 16:04

SS-JIA force-pushed the gh/ahmtox/39/orig branch from decbbad to 642bed8 Compare July 14, 2025 17:10

ahmtox merged commit 0dd42c4 into main Jul 14, 2025
93 of 95 checks passed

ahmtox deleted the gh/ahmtox/39/orig branch July 14, 2025 17:11

SS-JIA approved these changes Jul 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK][Ops] affine quantization operators registration#12440

[ET-VK][Ops] affine quantization operators registration#12440
ahmtox merged 1 commit into
mainfrom
gh/ahmtox/39/orig

pytorchbot commented Jul 14, 2025

Uh oh!

pytorch-bot Bot commented Jul 14, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Jul 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

pytorchbot commented Jul 14, 2025

Uh oh!

pytorch-bot Bot commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12440

Uh oh!

github-actions Bot commented Jul 14, 2025

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot Bot commented Jul 14, 2025 •

edited

Loading

This PR needs a `release notes:` label