Skip to content

[MLX] Reduce physical footprint memory in RingBufferKVCache for chunked prefill #1673

[MLX] Reduce physical footprint memory in RingBufferKVCache for chunked prefill

[MLX] Reduce physical footprint memory in RingBufferKVCache for chunked prefill #1673

Job Run time
32s
11m 19s
44m 19s
13m 11s
37m 22s
10m 6s
49m 40s
36m 45s
11m 48s
11m 6s
13m 7s
11m 52s
12m 11s
12m 38s
17m 58s
13m 21s
12m 26s
17m 54s
11m 6s
10m 48s
11m 37s
11m 52s
12m 4s
11m 36s
6h 46m 38s