Skip to content

server: add KV cache quantization flags (--kv-bits, --kv-group-size, --quantized-kv-start)#1353

Open
soobrosa wants to merge 4 commits into
ml-explore:mainfrom
soobrosa:feat/server-kv-cache-quant
Open

server: add KV cache quantization flags (--kv-bits, --kv-group-size, --quantized-kv-start)#1353
soobrosa wants to merge 4 commits into
ml-explore:mainfrom
soobrosa:feat/server-kv-cache-quant

Merge remote-tracking branch 'upstream/main' into feat/server-kv-cach…

e5585c4
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs