Skip to content

Pull requests: hw-native-sys/pypto-serving

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add native Qwen3 14B A8W8 serving path
#48 opened Jun 29, 2026 by vegetabledoww Loading…
feat: Add DeepSeek V4 cache support
#42 opened Jun 25, 2026 by superxf Contributor Draft
Add DeepSeek V4 serving integration
#40 opened Jun 23, 2026 by ndleslx Collaborator Draft
Add radix prefix cache backend
#39 opened Jun 18, 2026 by zmnobug Loading…
feat: add v1 parallel serving strategy support
#37 opened Jun 17, 2026 by ndleslx Collaborator Loading…
Add KV cache CPU offload support
#35 opened Jun 15, 2026 by superxf Contributor Loading…
Add KV cache CPU offload support
#31 opened Jun 9, 2026 by superxf Contributor Loading…
Rewrite non-L3 Qwen3 kernels through L3 worker
#22 opened Jun 3, 2026 by ndleslx Collaborator Loading…
ProTip! Follow long discussions with comments:>50.