Welcome to oMLX Discussions! #104
Replies: 4 comments 2 replies
-
|
I saw that you already created branch with speculative decoding. I hope it will be merged some day to check how it can utilize 8x batching with draft model, if it can somehow match MoE models in PP and T/s, fingers crossed 🤞 |
Beta Was this translation helpful? Give feedback.
-
|
I've noticed that the |
Beta Was this translation helpful? Give feedback.
-
|
占个楼,关注大神 |
Beta Was this translation helpful? Give feedback.
-
|
Low priority UX suggestions:
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
👋 Welcome!
We’re using Discussions as a place to connect with other members of our community. We hope that you:
build together 💪.
To get started, comment below with an introduction of yourself and tell us about what you do with this community.
Beta Was this translation helpful? Give feedback.
All reactions