Problem
The pipeline has unit tests for individual modules but no integration test that exercises the full classify_extract.py entry point against a known input. There is no automated check that the pipeline produces valid output end-to-end, or that failure modes (e.g. a not-useful PDF) are handled correctly.
Tasks
Context
Commitment carried over from the Sprint 10 retrospective: every pipeline PR should include an integration test covering a failure mode.
Problem
The pipeline has unit tests for individual modules but no integration test that exercises the full
classify_extract.pyentry point against a known input. There is no automated check that the pipeline produces valid output end-to-end, or that failure modes (e.g. a not-useful PDF) are handled correctly.Tasks
PredatorDietMetrics)Context
Commitment carried over from the Sprint 10 retrospective: every pipeline PR should include an integration test covering a failure mode.