Skip to content

Add agent scaling, routing, and deployment guidance#585

Draft
pstreef wants to merge 4 commits into
mainfrom
pstreef/agent-config-docs-gaps
Draft

Add agent scaling, routing, and deployment guidance#585
pstreef wants to merge 4 commits into
mainfrom
pstreef/agent-config-docs-gaps

Conversation

@pstreef
Copy link
Copy Markdown
Contributor

@pstreef pstreef commented Mar 31, 2026

Problem

Customers frequently ask how many agents to deploy, how traffic is distributed, whether agents can be split by responsibility, and whether to use VMs or Kubernetes. These questions came up with various customers. The existing docs cover basic mechanics but lack operational guidance.

Solution

agent-config.md — expand the scaling section with:

  • Sizing rule of thumb (1 agent per 20K repos) with per-agent resource recommendations
  • Traffic routing summary with link to the routing reference
  • Deployment environment guidance (VMs recommended, K8s caveats)
  • Proxy cross-reference in troubleshooting

routing-requests-to-agents.md — add three new sections:

  • How routing works (filter → shuffle → try, 10s cache refresh)
  • Operation routing table (what goes through agents vs SaaS-side)
  • Capability splitting guide with common pitfall

Expand agent configuration docs with sizing rules of thumb, per-agent
resource recommendations, deployment environment guidance (VMs vs K8s),
and proxy cross-references. Expand the routing reference with detailed
routing mechanics, operation routing table, capability splitting guide,
and common pitfalls.
- Reword misleading "recipe runs queue" bullet to clarify agents don't
  run recipes
- Remove unsupported "4-6 VMs" claim that conflicts with sizing guidance
- Add downtime tradeoff note to Recreate strategy recommendation
@pstreef pstreef marked this pull request as draft March 31, 2026 09:11
pstreef added 2 commits April 2, 2026 09:55
When multiple agents configure the same tool (e.g. same GitHub URL),
their credentials must be identical because requests shuffle across
matching agents. A multi-step OAuth flow can span agents and will fail
if credentials differ.
- Replace "shuffle and try" with "round-robin" in routing docs
  (simpler mental model for customers)
- Consistent trailing periods on K8s bullet points
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant