-
Notifications
You must be signed in to change notification settings - Fork 13
Pull requests: hw-native-sys/pypto-serving
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Qwen3-14B serving: device-side embedding and sampling
#47
opened Jun 29, 2026 by
sunkaixuan2018
Loading…
Qwen3-14B serving: dynamic KV sizing, vLLM-style admission, chunked p…
#45
opened Jun 26, 2026 by
sunghajung6688
Loading…
feat: add v1 parallel serving strategy support
#37
opened Jun 17, 2026 by
ndleslx
Collaborator
Loading…
feat: add platform skeleton — Engine, Module base, and channel primitives
#34
opened Jun 12, 2026 by
lterrac
Collaborator
Loading…
Add TurboQuant KV cache compression to serving pipeline
#33
opened Jun 11, 2026 by
sunghajung6688
Loading…
Rewrite non-L3 Qwen3 kernels through L3 worker
#22
opened Jun 3, 2026 by
ndleslx
Collaborator
Loading…
ProTip!
Follow long discussions with comments:>50.