Skip to content

fix(tw): spec-support trailing adverb/verb over-capture (2026-06-05 batch)#199

Merged
kwisschen merged 1 commit into
mainfrom
walker/tw-spec-support-2026-06-05
Jun 5, 2026
Merged

fix(tw): spec-support trailing adverb/verb over-capture (2026-06-05 batch)#199
kwisschen merged 1 commit into
mainfrom
walker/tw-spec-support-2026-06-05

Conversation

@kwisschen
Copy link
Copy Markdown
Owner

Triage findings (issues #187 / #193 / #194)

TW spec-support (§112(a) / 專利法 §26 第3項) over-captured a verbal or adverbial tail past the head noun, so the clean element name never matched the specification:

  • [report] specSupport #193夾持部共同地 (adverb 共同地), 夾持部緊靠 (verb 緊靠)
  • [report] specSupport #194第一/第二外側壁不平行 (negation+adjective 不平行), 中兩個 (stranded 其中兩個 fragment)
  • [report] specSupport #187檢體採集支援系統可通訊地 (adverb 可通訊地); the reporter noted the spec does support it (與檢體採集支援系統11可通訊地連接) but the over-capture blocked the match

Fix (report-grounded trailing tokens + one leading reject)

  • Trailing tokens: 可通訊地, 共同地 (manner adverbs), 緊靠 (abutment verb, same family as the existing 抵靠), 不平行 (negation predicate).
  • Leading reject: 中兩 (mirror of the existing 中一).

For #187, once the adverb is stripped the head noun 檢體採集支援系統 matches the spec via Tier-1 substring even with the embedded reference numeral 11 — the bug was purely the over-capture, not the refnum.

Verification gates

  • Reproducer: each reported term reduces to its clean head noun (夾持部, 第一外側壁, 檢體採集支援系統); 中兩個 is leading-rejected.
  • Anti-corpus / FN guard: real nouns ending in the same characters are untouched — 土地/基地/場地 (vs 共同地/可通訊地), 緊靠部/緊靠面 (vs 緊靠), 平行板 (vs 不平行), 對比/占比 (these are why single-char was not added). The verb tokens fire only when the term ENDS in the verb.
  • Drift gate (_spec_support_harness.py): 20 fixtures, added=0, removed=0 — zero corpus regression. Baseline unchanged (77→77); the tokens target user-draft patterns absent from the corpus.
  • Statute pin: 專利法 §26 第3項 — spec-support is for noun phrases; verbs/adverbs/predicates are not reference targets.

Scope / deferrals (DR-1)

Deliberately not included (need their own grounding, queued):

  • ([report] specSupport #186) — 對比/占比 are legitimate nouns; a trailing single-char strip would cause FNs.
  • trailing + 極端且電性耦接 ([report] specSupport #196), 分別抵靠於多個 + 幾何形狀 ([report] specSupport #195) — ambiguous / messy fragments.
  • CN (Simplified) mirror — no CN report of these tokens; CN↔TW spec-support mirroring has been report-driven both ways.

6 regression tests. Closes #187, #193, #194.

…atch)

Add report-grounded tokens to the TW spec-support extractor so verbal /
adverbial tails stop bleeding past the head noun:

Trailing tokens:
- 可通訊地 / 共同地 — manner adverbs (X + 地 particle); never noun termini.
  `檢體採集支援系統可通訊地` → `檢體採集支援系統` (#187), which then matches
  the spec via substring even with an embedded reference numeral.
  `夾持部共同地` → `夾持部` (#193).
- 緊靠 — abutment verb, same family as the existing 抵靠. `夾持部緊靠` →
  `夾持部` (#193). Fires only when the term ENDS in 緊靠; 緊靠部/緊靠面
  (ending 部/面) stay inventoried.
- 不平行 — negation + adjective predicate. `第一外側壁不平行` →
  `第一外側壁` (#194).

Leading reject:
- 中兩 — stranded `其中兩個` fragment with 其 dropped (#194). Mirror of
  the existing 中一.

FN-guarded: 土地/基地/場地/緊靠部/緊靠面/平行板/對比/占比 all untouched.
Drift gate (_spec_support_harness): 20 fixtures, added=0, removed=0 —
zero corpus regression (tokens target user-draft patterns absent from
the corpus). CN (Simplified) mirror deferred per DR-1 — no CN report.

6 regression tests. Closes #187, #193, #194.
@vercel
Copy link
Copy Markdown

vercel Bot commented Jun 5, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
patent-lint Ready Ready Preview, Comment Jun 5, 2026 4:48am

@kwisschen kwisschen merged commit a0cbcbb into main Jun 5, 2026
6 checks passed
@kwisschen kwisschen deleted the walker/tw-spec-support-2026-06-05 branch June 5, 2026 04:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[report] specSupport

1 participant