fix(tw): spec-support trailing adverb/verb over-capture (2026-06-05 batch)#199
Merged
Conversation
…atch) Add report-grounded tokens to the TW spec-support extractor so verbal / adverbial tails stop bleeding past the head noun: Trailing tokens: - 可通訊地 / 共同地 — manner adverbs (X + 地 particle); never noun termini. `檢體採集支援系統可通訊地` → `檢體採集支援系統` (#187), which then matches the spec via substring even with an embedded reference numeral. `夾持部共同地` → `夾持部` (#193). - 緊靠 — abutment verb, same family as the existing 抵靠. `夾持部緊靠` → `夾持部` (#193). Fires only when the term ENDS in 緊靠; 緊靠部/緊靠面 (ending 部/面) stay inventoried. - 不平行 — negation + adjective predicate. `第一外側壁不平行` → `第一外側壁` (#194). Leading reject: - 中兩 — stranded `其中兩個` fragment with 其 dropped (#194). Mirror of the existing 中一. FN-guarded: 土地/基地/場地/緊靠部/緊靠面/平行板/對比/占比 all untouched. Drift gate (_spec_support_harness): 20 fixtures, added=0, removed=0 — zero corpus regression (tokens target user-draft patterns absent from the corpus). CN (Simplified) mirror deferred per DR-1 — no CN report. 6 regression tests. Closes #187, #193, #194.
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
This was referenced Jun 5, 2026
Closed
Closed
Closed
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Triage findings (issues #187 / #193 / #194)
TW spec-support (§112(a) / 專利法 §26 第3項) over-captured a verbal or adverbial tail past the head noun, so the clean element name never matched the specification:
夾持部共同地(adverb 共同地),夾持部緊靠(verb 緊靠)第一/第二外側壁不平行(negation+adjective 不平行),中兩個(stranded其中兩個fragment)檢體採集支援系統可通訊地(adverb 可通訊地); the reporter noted the spec does support it (與檢體採集支援系統11可通訊地連接) but the over-capture blocked the matchFix (report-grounded trailing tokens + one leading reject)
可通訊地,共同地(manner adverbs),緊靠(abutment verb, same family as the existing抵靠),不平行(negation predicate).中兩(mirror of the existing中一).For #187, once the adverb is stripped the head noun
檢體採集支援系統matches the spec via Tier-1 substring even with the embedded reference numeral11— the bug was purely the over-capture, not the refnum.Verification gates
夾持部,第一外側壁,檢體採集支援系統);中兩個is leading-rejected.土地/基地/場地(vs 共同地/可通訊地),緊靠部/緊靠面(vs 緊靠),平行板(vs 不平行),對比/占比(these are why single-char比was not added). The verb tokens fire only when the term ENDS in the verb._spec_support_harness.py): 20 fixtures, added=0, removed=0 — zero corpus regression. Baseline unchanged (77→77); the tokens target user-draft patterns absent from the corpus.Scope / deferrals (DR-1)
Deliberately not included (need their own grounding, queued):
比([report] specSupport #186) — 對比/占比 are legitimate nouns; a trailing single-char strip would cause FNs.自+極端且電性耦接([report] specSupport #196),分別抵靠於多個+幾何形狀([report] specSupport #195) — ambiguous / messy fragments.6 regression tests. Closes #187, #193, #194.