Guard KV cache against page-cache pressure by Ghatage · Pull Request #134 · antirez/ds4

Ghatage · 2026-05-14T04:13:17Z

The disk KV cache uses plain read/write to avoid mapping more VM into a process that already maps a large GGUF, but the bytes still land in the Linux page cache where they compete with the mmapped weights for resident memory. A 30k-token cold save leaves hundreds of MiB sitting in Cached: after the save returns, exactly the RAM pressure the no-mmap decision was meant to avoid.

Hint the kernel to invalidate those pages with
posix_fadvise(POSIX_FADV_DONTNEED) right after each full payload write and read. Header-only scans are deliberately untouched, since repeated small header reads still benefit from page-cache reuse. Expose DS4_KV_KEEP_PAGES=1 as an escape hatch for diagnostic comparisons, mirroring the existing cuda_model_drop_file_pages and its DS4_CUDA_KEEP_MODEL_PAGES toggle.

Correctness: make test; ./ds4_test --server.
Two new mincore-based tests assert that resident pages of a 4 MiB temp file drop from ~all to under 25% after the hint, and that DS4_KV_KEEP_PAGES=1 keeps them in place.

The disk KV cache uses plain read/write to avoid mapping more VM into a process that already maps a large GGUF, but the bytes still land in the Linux page cache where they compete with the mmapped weights for resident memory. A 30k-token cold save leaves hundreds of MiB sitting in Cached: after the save returns, exactly the RAM pressure the no-mmap decision was meant to avoid. Hint the kernel to invalidate those pages with posix_fadvise(POSIX_FADV_DONTNEED) right after each full payload write and read. Header-only scans are deliberately untouched, since repeated small header reads still benefit from page-cache reuse. Expose DS4_KV_KEEP_PAGES=1 as an escape hatch for diagnostic comparisons, mirroring the existing cuda_model_drop_file_pages and its DS4_CUDA_KEEP_MODEL_PAGES toggle. Correctness: make test; ./ds4_test --server. Two new mincore-based tests assert that resident pages of a 4 MiB temp file drop from ~all to under 25% after the hint, and that DS4_KV_KEEP_PAGES=1 keeps them in place.

antirez · 2026-05-14T10:05:29Z

Like this. Will take care.

antirez added kv-cache no-brainer 💃 labels May 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guard KV cache against page-cache pressure#134

Guard KV cache against page-cache pressure#134
Ghatage wants to merge 1 commit into
antirez:mainfrom
Ghatage:guardPageCache

Ghatage commented May 14, 2026

Uh oh!

antirez commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Ghatage commented May 14, 2026

Uh oh!

antirez commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants