Skip to content
This repository was archived by the owner on Aug 15, 2025. It is now read-only.

aarch64: cherrypick onednn pr#1768 to improve torch.compile performance#1728

Open
snadampal wants to merge 1 commit intopytorch:release/2.3from
snadampal:onednn_cherrypick_torch2.3
Open

aarch64: cherrypick onednn pr#1768 to improve torch.compile performance#1728
snadampal wants to merge 1 commit intopytorch:release/2.3from
snadampal:onednn_cherrypick_torch2.3

Conversation

@snadampal
Copy link
Contributor

this improves the bert base torch.compile perf by 5.8x on AWS c7g instance.

this is same as the one merged to main branch: #1716

this improves the bert base torch.compile perf by 5.8x on
AWS c7g instance.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants