Skip to content

Add end-to-end pipeline validation command #48

@natashaannn

Description

@natashaannn

Add end-to-end pipeline validation command

Objective

Introduce a full pipeline validation command that runs all stages and verifies deterministic output.


Why this matters

This is the first step toward production-grade reproducibility guarantees.


Parallelization metadata

Track

pipeline

Depends on

Safe to run in parallel with

  • remotion cleanup

Merge risk

  • medium

Files in scope

Primary:

  • scripts/pipeline/validate.ts

May touch:

  • runner.ts

Required implementation

1. Add validation runner

Execute full pipeline:

  • sync
  • transcribe
  • diarize
  • align
  • assign
  • merge

2. Determinism check

Compare artifact hashes across runs.

3. Report diffs

Highlight stage-level differences.


Constraints

  • read-only validation
  • no pipeline mutation

Acceptance criteria

Functional

  • full pipeline executes end-to-end

Determinism

  • repeated runs produce identical hashes

Verification commands

Run validation twice.

Expected result:

  • identical artifact hashes

Explicitly out of scope

  • fixing nondeterminism issues
  • caching improvements

Suggested branch

refactor/s7-pipeline-validation

Suggested commit slug

phase-2-step-8-validation

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions