[Executorch][llm] Make mask tensor float only for sdpa#12142
Conversation
…ssful At the moment we continue execution and the stack fails later on as I found when running with quantize kv cache + ring attention Differential Revision: [D77516822](https://our.internmc.facebook.com/intern/diff/D77516822/) ghstack-source-id: 293635304 Pull Request resolved: #12129
Now that we support quantized sdpa query tensor can be quantized and attention mask can be float (the only type allowed). So this check doesnt make sense anymore. Differential Revision: [D77516821](https://our.internmc.facebook.com/intern/diff/D77516821/) ghstack-source-id: 293661338 Pull Request resolved: #12131
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12142
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 Cancelled Job, 1 Unrelated FailureAs of commit ad08b1c with merge base 3d90515 ( CANCELLED JOB - The following job was cancelled. Please retry:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
This PR was created by the merge bot to help merge the original PR into the main branch. ghstack PR number: pytorch#12131 by @kimishpatel ^ Please use this as the source of truth for the PR details, comments, and reviews ghstack PR base: https://github.com/pytorch/executorch/tree/gh/kimishpatel/195/base ghstack PR head: https://github.com/pytorch/executorch/tree/gh/kimishpatel/195/head Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/kimishpatel/194/orig Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/kimishpatel/195/orig @diff-train-skip-merge --------- Co-authored-by: Kimish Patel <kimishpatel@fb.com>
This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #12131 by @kimishpatel
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/kimishpatel/195/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/kimishpatel/195/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/kimishpatel/194/orig
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/kimishpatel/195/orig
@diff-train-skip-merge