-
Notifications
You must be signed in to change notification settings - Fork 111
Pull requests: lightseekorg/tokenspeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
perf(deepseek-v4): vectorize read_deepseek_v4_indexer_fp8_cache
#238
opened May 24, 2026 by
yuanqingz
Loading…
feat(runtime): support multimodal VLM
high priority
#236
opened May 24, 2026 by
chenht2022
Contributor
Loading…
1 task done
Use operand format signatures for kernel selection
#230
opened May 23, 2026 by
antiagainst
Member
Loading…
[WIP] perf(eagle3): skip dead-position compute in draft catch-up step
#217
opened May 22, 2026 by
rjzhb
Loading…
1 task done
feat(trtllm-MHA): support mixed prefill/decode batches
#176
opened May 18, 2026 by
rjzhb
Loading…
4 tasks done
feat: support post-norm EAGLE + add speculative decoding docs
high priority
#174
opened May 17, 2026 by
Dogacel
Loading…
perf(moe): triton biased grouped topk for deepseek-v3 routing
#171
opened May 17, 2026 by
roycho96
Contributor
Loading…
perf: chunked-prefill prefix cache update for non-hybrid models
#22
opened May 7, 2026 by
LorrinWWW
Contributor
Loading…
fix: wait per-layer on drafter KV pool during cpu cache loadback
#6
opened May 6, 2026 by
LorrinWWW
Contributor
Loading…
ProTip!
no:milestone will show everything without a milestone.