fix: strip lyric pages whose credit uses a colon or band-as-author line#7
Merged
Merged
Conversation
The safe-publish pass gated on the "words/music by ..." authorship line, so
five real licensed-lyric pages slipped through with their lyrics intact:
- eter, libe, onlytime, way2 — credit uses a colon ("Words: Hunter;
music: Garcia") rather than "by"
- ydha — credit is "By the Grateful Dead" (band-as-author, no
words/music token at all)
All five still carry the publisher's licensed-lyric signature, "Copyright
Ice Nine Publishing; used by permission", which sits just above the lyric
block exactly where the authorship line normally does. Add it as a fallback
credit anchor in CREDIT_RE. It is the definitive marker of reproduced GD
lyrics: essays that merely quote permission say "Used with permission"
without naming Ice Nine, and pages whose blockquote is an annotation
(operator's OED entry, slip's reader email) have no permission line, so
they stay untouched.
On the 118 "words by" pages the authorship line still matches first, leaving
their stripped output byte-for-byte unchanged; exactly the five intended
pages flip from skip to strip. make dist/safe + make audit stay green.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
ds17f
added a commit
that referenced
this pull request
Jun 5, 2026
…ne (#7) The safe-publish pass gated on the "words/music by ..." authorship line, so five real licensed-lyric pages slipped through with their lyrics intact: - eter, libe, onlytime, way2 — credit uses a colon ("Words: Hunter; music: Garcia") rather than "by" - ydha — credit is "By the Grateful Dead" (band-as-author, no words/music token at all) All five still carry the publisher's licensed-lyric signature, "Copyright Ice Nine Publishing; used by permission", which sits just above the lyric block exactly where the authorship line normally does. Add it as a fallback credit anchor in CREDIT_RE. It is the definitive marker of reproduced GD lyrics: essays that merely quote permission say "Used with permission" without naming Ice Nine, and pages whose blockquote is an annotation (operator's OED entry, slip's reader email) have no permission line, so they stay untouched. On the 118 "words by" pages the authorship line still matches first, leaving their stripped output byte-for-byte unchanged; exactly the five intended pages flip from skip to strip. make dist/safe + make audit stay green. Co-authored-by: Damian Silbergleith <14797221+ds17f@users.noreply.github.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Problem
The safe-publish pass (
scripts/safe_build.py) gated on thewords/music by …authorship line to decide a page reproduces licensed lyrics. Five real song pages used a different credit format and slipped through with their lyrics intact:Words: Hunter; music: GarciaBy the Grateful Dead(no words/music token)Fix
All five still carry the publisher's licensed-lyric signature,
Copyright Ice Nine Publishing; used by permission, sitting just above the lyric block exactly where the authorship line normally does. This adds it as a fallback credit anchor inCREDIT_RE.It is the definitive marker of reproduced GD lyrics, so it cleanly excludes the lookalikes:
Verification
words byline still matches first).dead.net/songsnotice inserted, title/credit and annotation sections preserved.make dist && make audit→ green;make safe && make audit→ green (now 119 pages / 121 lyric blocks stripped).🤖 Generated with Claude Code