Skip to content

Implement robust dataset validation and config file for Super Resolution#217

Open
abhiram123467 wants to merge 4 commits intoML4SCI:mainfrom
abhiram123467:feature/dataset-validation
Open

Implement robust dataset validation and config file for Super Resolution#217
abhiram123467 wants to merge 4 commits intoML4SCI:mainfrom
abhiram123467:feature/dataset-validation

Conversation

@abhiram123467
Copy link
Copy Markdown

Description:
Resolves the feature request outlined in #196 (Follow-up to #135).

This PR introduces a lightweight validation pipeline for the Super-Resolution datasets to catch silent alignment and dimensionality errors before the training loop begins.

Changes included:

  • config.yaml: Externalizes dataset parameters (HR/LR paths, scale factor, patch size) to ensure reproducibility without requiring users to modify core training logic.
  • validate_dataset.py: A pre-training utility that strictly asserts:
    • HR and LR image counts match perfectly.
    • Filenames are identical across both directories to prevent misalignment.
    • The actual NumPy matrix dimensions of the images align with the configured scale_factor.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant