Skip to content

Store MoE routing indices per-block in KVBlockAllocator for prefix caching sharing#2

Open
lmcafee-nvidia wants to merge 6 commits into
sidsingh-nvidia:siddharth/support-nemo-rl-router-replayfrom
lmcafee-nvidia:prefix-caching-router-record
Open

Store MoE routing indices per-block in KVBlockAllocator for prefix caching sharing#2
lmcafee-nvidia wants to merge 6 commits into
sidsingh-nvidia:siddharth/support-nemo-rl-router-replayfrom
lmcafee-nvidia:prefix-caching-router-record

Commits

Commits on Mar 19, 2026

Commits on Apr 6, 2026

Commits on Apr 7, 2026