perf(provider): optimize anthropic prompt cache hit rate#18

Merged

bitifirefly merged 1 commit intodevelopfrom

backport/upstream-20260322-batch4

Mar 22, 2026

bitifirefly commented Mar 22, 2026

Summary

backport upstream LiteLLM prompt-cache optimization for Anthropic-compatible routes
improve _apply_cache_control to place two cache breakpoints:
- system message
- second-to-last message (conversation history prefix)
keep tool cache marker behavior (last tool definition)
update tests to verify both breakpoints and non-mutation of inputs

Validation

ruff check opencane/providers/litellm_provider.py tests/test_litellm_prompt_caching.py
pytest -q tests/test_litellm_prompt_caching.py tests/test_litellm_message_sanitize.py
pytest -q (full): 396 passed


          perf(provider): improve anthropic prompt cache breakpoints

96c046c

bitifirefly merged commit 388ad10 into develop

2 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet