Skip to content

[MLX] Reduce physical footprint memory in RingBufferKVCache for chunked prefill #1660

[MLX] Reduce physical footprint memory in RingBufferKVCache for chunked prefill

[MLX] Reduce physical footprint memory in RingBufferKVCache for chunked prefill #1660

Job Run time
31s
38m 24s
58m 55s
38m 16s
14m 17s
10m 1s
38m 13s
10m 49s
12m 54s
9m 41s
12m 37s
11m 47s
11m 57s
10m 37s
19m 1s
11m 45s
10m 54s
10m 57s
10m 28s
11m 1s
16m 13s
11m 56s
11m 17s
10m 45s
6h 43m 16s