Skip to content

[043][Phase 3][US1e] Registry wiring + US1 end-to-end integration #752

@jwesleye

Description

@jwesleye

Scope

US1e — registry wiring + end-to-end integration. Final wiring for the MVP's US1 layer: unique-name validation, crate-root re-exports, and the end-to-end integration test that lifts the US1 acceptance gate.

Priority: P1 (MVP)

Tasks

  • T087 [US1] Extend EvaluatorRegistry::add to reject duplicate evaluator names with EvalError::DuplicateEvaluator (spec edge case)
  • T088 [US1] Update eval/src/lib.rs public re-exports covering every US1 surface under correct feature gates
  • T089 [US1] Integration test eval/tests/us1_end_to_end_test.rs: registry with one evaluator per family, wiremock judge with canned verdicts; assert per-evaluator score + non-empty reason + prompt_version recorded; non-applicable evaluators produce no entry

Acceptance

  • US1 acceptance scenario from spec.md passes end-to-end with MockJudge / wiremock.
  • cargo test -p swink-agent-eval --features all-evaluators is green.
  • Duplicate evaluator registration returns a structured error, not a panic (spec edge case).

References

  • Spec US1 acceptance scenarios 1–6
  • SC-001 (10-line happy path)

Depends on

#748, #749, #750, #751.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or requestevalin-progressAutomated agent is working on thisspecSpec-driven implementation task

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions