Skip to content

[FEATURE] Add --test-suite flag for automated testing #14

@jwesleye

Description

@jwesleye

Feature Description

Run a suite of test queries against an agent with pass/fail reporting.

Problem/Motivation

Testing agent changes manually is tedious. Need automated regression testing to ensure agents work correctly.

Proposed Solution

chat_loop myagent --test-suite tests/agent_tests.yaml

# tests/agent_tests.yaml:
# tests:
#   - query: "What is 2+2?"
#     expected_contains: "4"
#   - query: "List Python frameworks"
#     expected_contains: ["Django", "Flask"]
#
# Output:
# ✓ Test 1/10 passed
# ✗ Test 2/10 failed: Expected 'Django' in response

Benefits

  • CI/CD integration
  • Regression testing
  • Quality assurance
  • Documents expected behavior

Priority

  • Critical
  • High
  • Medium
  • Low

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requeststatus: staleNo activity for extended period

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions