Remove confirmed-dead scripts, defaults, and config keys#1208
Open
trvrb wants to merge 2 commits into
Open
Conversation
First pass of a pandemic-era cruft cleanup. Removes code and data that is referenced by nothing in the workflow, the active profiles, the docs, or the tests (verified by grep plus a dry-run of the CI profile showing an unchanged 37-job DAG). No behavior change to any build. Removed: - Orphan scripts: scripts/add_labels.py, scripts/generate-scientific-credits.py, and the explicitly-deprecated scripts/deprecated/ (calculate_delta_frequency.py, parse_mutational_fitness_tsv_into_distance_map.py). - Unused defaults: defaults/distance_maps/VoC.json (only S1.json is used), defaults/clade_hierarchy.tsv, defaults/clades_who.tsv. - Dead config key files.outgroup (defaults/parameters.yaml) — it pointed at a file that does not exist in the repo and was read nowhere; its config-reference entry (already documented "No longer used") is removed too. - Deprecated my_profiles/ directory (only a deprecation README) and its now-orphan .gitignore exception. - Unused committed example data: data/example_*_worldwide.*, data/example_*_aus.*, data/example_multiple_inputs.tar.xz (the CI-used example_metadata.tsv / example_sequences.fasta.gz are kept). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This was referenced Jun 27, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
First in a series of small, independently-reviewable PRs cleaning up pandemic-era cruft (see the cleanup plan; scope: dead code + proximity removal + docs/schemes).
Motivation
The repo accumulated scripts, default data files, and config keys that nothing references anymore. This PR removes the verified-dead ones. The three live contracts — weekly OPEN build, occasional GISAID builds, and external users running their own builds — are unaffected.
What's removed
workflow/ Snakefile nextstrain_profiles/ docs/ tests/):scripts/add_labels.py,scripts/generate-scientific-credits.py, and the explicitly-deprecatedscripts/deprecated/(calculate_delta_frequency.py,parse_mutational_fitness_tsv_into_distance_map.py).defaults/distance_maps/VoC.json(onlyS1.jsonis used by thedistancesrule),defaults/clade_hierarchy.tsv,defaults/clades_who.tsv.files.outgroupindefaults/parameters.yaml— pointed at a file that doesn't exist in the repo and was read nowhere; its config-reference entry (already documented "No longer used") is removed too.my_profiles/directory (only a deprecation README) and its now-orphan.gitignoreexception.data/example_*_worldwide.*,data/example_*_aus.*,data/example_multiple_inputs.tar.xz. The CI-usedexample_metadata.tsv/example_sequences.fasta.gzare kept.Verification
workflow/,Snakefile,nextstrain_profiles/,docs/, andtests/(accounting for config-key indirection, which hides references behindconfig["files"][...]).snakemake --profile nextstrain_profiles/nextstrain-ci -nbuilds an unchanged 37-job DAG with no errors.Test plan
🤖 Generated with Claude Code