Skip to content

Implement context-length dependent KV-cache and Compute Buffer aware …#335

Merged
Nexesenex merged 1 commit into
Nexesenex:lcpp_pr_kv_aware_layer_distribfrom
borebot:kv-compute-buffer-cache-aware-allocation
Jul 4, 2025
Merged

Implement context-length dependent KV-cache and Compute Buffer aware …#335
Nexesenex merged 1 commit into
Nexesenex:lcpp_pr_kv_aware_layer_distribfrom
borebot:kv-compute-buffer-cache-aware-allocation