Feature Description
Run a suite of test queries against an agent with pass/fail reporting.
Problem/Motivation
Testing agent changes manually is tedious. Need automated regression testing to ensure agents work correctly.
Proposed Solution
chat_loop myagent --test-suite tests/agent_tests.yaml
# tests/agent_tests.yaml:
# tests:
# - query: "What is 2+2?"
# expected_contains: "4"
# - query: "List Python frameworks"
# expected_contains: ["Django", "Flask"]
#
# Output:
# ✓ Test 1/10 passed
# ✗ Test 2/10 failed: Expected 'Django' in response
Benefits
- CI/CD integration
- Regression testing
- Quality assurance
- Documents expected behavior
Priority
Feature Description
Run a suite of test queries against an agent with pass/fail reporting.
Problem/Motivation
Testing agent changes manually is tedious. Need automated regression testing to ensure agents work correctly.
Proposed Solution
Benefits
Priority