Skip to content

fix(agent): retry transient provider throttles#319

Merged
JaimeCernuda merged 1 commit into
developfrom
fix/transient-provider-backoff-20260524
May 24, 2026
Merged

fix(agent): retry transient provider throttles#319
JaimeCernuda merged 1 commit into
developfrom
fix/transient-provider-backoff-20260524

Conversation

@JaimeCernuda
Copy link
Copy Markdown
Collaborator

Summary

  • add bounded transient-provider backoff around planner and answer-synthesis calls
  • retry ALCF/LiteLLM rate-limit style failures without bypassing DSPy/LiteLLM
  • preserve existing structured routing/provider errors when retries are exhausted

Evidence

  • targeted live ALCF/Metis gpt-oss-120b memory follow-up returned error_info=null
  • follow-up used six Parquet tool calls and six analysis direct-tool handoff events
  • documents the broader hierarchical benchmark target in TASK.md for post-compaction continuity

Verification

  • uv run pytest tests/test_core/test_agent_planner.py -q -> 138 passed
  • uv run ruff check src/ tests/ scripts/run_demo_benchmark.py scripts/create_benchmark_data.py scripts/create_demo_data.py
  • uv run pytest tests/ -> 1158 passed, 37 skipped

@JaimeCernuda JaimeCernuda merged commit 3da27a6 into develop May 24, 2026
1 check failed
@JaimeCernuda JaimeCernuda deleted the fix/transient-provider-backoff-20260524 branch May 24, 2026 05:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant