[Executorch] Add quantized kv cache to oss ci#6997
Conversation
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6997
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 42ad61a with merge base cd306d3 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) ghstack-source-id: 254679000 Pull Request resolved: #6997
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 254687228 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 254774011 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 254871264 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 254902439 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 254987269 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 255034795 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256616977 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256619056 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256626712 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256628952 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256637054 @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256710069 @exported-using-ghexport //unit-test-arm broken in trunk plus complains about internal failure @bypass-github-export-checks @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256711932 @exported-using-ghexport //unit-test-arm broken in trunk plus complains about internal failure @bypass-github-export-checks @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
Fixes to make sure quantized kv cache works in oss Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) [ghstack-poisoned]
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256718639 @exported-using-ghexport //unit-test-arm broken in trunk plus complains about internal failure @bypass-github-export-checks @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
|
This pull request was exported from Phabricator. Differential Revision: D66269487 |
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256718639 @exported-using-ghexport //unit-test-arm broken in trunk plus complains about internal failure @bypass-github-export-checks @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/)
Pull Request resolved: #6997 Fixes to make sure quantized kv cache works in oss ghstack-source-id: 256718639 @exported-using-ghexport //unit-test-arm broken in trunk plus complains about internal failure @bypass-github-export-checks @exported-using-ghexport Differential Revision: [D66269487](https://our.internmc.facebook.com/intern/diff/D66269487/) Co-authored-by: Kimish Patel <kimishpatel@fb.com>
Stack from ghstack (oldest at bottom):
Fixes to make sure quantized kv cache works in oss
Differential Revision: D66269487