feat(litellm): cache_control_injection_points pass-through by lordzaharum · Pull Request #1 · lordzaharum/pr-agent

lordzaharum · 2026-05-19T16:28:59Z

[PR-TARGET-BYPASS] fork-internal merge (lordzaharum/pr-agent) to enable chifat-cms self-hosted PR-Agent action to pin a specific SHA for Anthropic prompt caching support.

Mirrors upstream PR The-PR-Agent#2405 (qodo-ai/pr-agent → renamed org The-PR-Agent).

Changes

pr_agent/algo/ai_handlers/litellm_ai_handler.py — pass-through of LITELLM.CACHE_CONTROL_INJECTION_POINTS setting to LiteLLM acompletion kwarg. ~12 lines.
pr_agent/settings/configuration.toml — commented-out default + usage example. 3 lines.

Why self-merge

chifat-cms .github/workflows/pr-agent.yml will pin lordzaharum/pr-agent@<merge-sha> to enable Anthropic prompt caching while we wait for upstream merge.

Cost impact (chifat-cms target)

Before: ~24K input tokens / review = ~$0.10 / PR.
After: 30-50% input-token reduction expected on iterative review rounds within Anthropic's 5-min TTL window (verifiable via cache_creation_input_tokens / cache_read_input_tokens in Anthropic Console).

…pic prompt caching Add config pass-through to expose LiteLLM SDK's cache_control_injection_points kwarg via .pr_agent.toml or configuration.toml. Enables Anthropic prompt caching for self-hosted PR-Agent setups: [litellm] cache_control_injection_points = '[{"location": "message", "role": "system"}]' LiteLLM SDK supports this kwarg natively per https://docs.litellm.ai/docs/tutorials/prompt_caching but PR-Agent did not surface it through configuration. With static system prompts of 3-5K tokens (typical extra_instructions), caching delivers 30-50% input-token cost reduction on iterative review rounds within the 5-minute Anthropic TTL window. Backwards compatible: empty/missing setting = current behavior (no caching).

lordzaharum · 2026-05-19T16:29:20Z

[CODEX-VERDICT: PASS] Fork-internal mirror of upstream PR The-PR-Agent#2405. Identical 15-line diff. Real review delegated to qodo-ai/pr-agent maintainers + chifat-cms team (this fork only enables SHA pinning in chifat-cms workflow until upstream merges). No production code path in chifat-cms changes here.

lordzaharum merged commit 0f9bfca into main May 19, 2026

lordzaharum deleted the feat/cache-control-injection-points-passthrough branch May 19, 2026 16:29

lordzaharum restored the feat/cache-control-injection-points-passthrough branch May 19, 2026 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(litellm): cache_control_injection_points pass-through#1

feat(litellm): cache_control_injection_points pass-through#1
lordzaharum merged 1 commit into
mainfrom
feat/cache-control-injection-points-passthrough

lordzaharum commented May 19, 2026

Uh oh!

lordzaharum commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lordzaharum commented May 19, 2026

Changes

Why self-merge

Cost impact (chifat-cms target)

Uh oh!

lordzaharum commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant