Skip to content

test: add end-to-end integration test for classify_extract.py #53

@SeanClay10

Description

@SeanClay10

Problem

The pipeline has unit tests for individual modules but no integration test that exercises the full classify_extract.py entry point against a known input. There is no automated check that the pipeline produces valid output end-to-end, or that failure modes (e.g. a not-useful PDF) are handled correctly.

Tasks

  • Add at least one end-to-end integration test using a fixture PDF or synthetic text file with known correct outputs
  • Assert that the output JSON passes schema validation (PredatorDietMetrics)
  • Cover at least one failure-mode path — e.g. a not-useful PDF is correctly rejected before extraction reaches the LLM
  • Ensure the test can run in CI without a live Ollama server (mock or stub the LLM call as needed)

Context

Commitment carried over from the Sprint 10 retrospective: every pipeline PR should include an integration test covering a failure mode.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions