Skip to content

fix(confidence-badge): lower threshold 80→75 for actual visibility#169

Merged
kwisschen merged 1 commit into
mainfrom
fix/lower-confidence-badge-threshold
Jun 1, 2026
Merged

fix(confidence-badge): lower threshold 80→75 for actual visibility#169
kwisschen merged 1 commit into
mainfrom
fix/lower-confidence-badge-threshold

Conversation

@kwisschen
Copy link
Copy Markdown
Owner

Why

PR #168 shipped the badge at score >= 80. Measured 70% precision but only 2.5% coverage (106/4321 findings). User test on real drafts: badge effectively invisible (1 finding/40-finding draft typical).

Fix

Lower threshold to 75: measured 69% precision, 7.2% coverage (3× wider).

Threshold Precision Coverage Findings/typical draft
80 (was) 70.0% 2.5% ~0–1
75 (now) 69.0% 7.2% ~1–3
70 67.0% 10.6% ~2–3
65 66.4% 11.5% ~2–3

Trade-off: 1pp precision below the original 70% target. Honest in the title attribute (updated across 6 locales to disclose actual measurement + baseline). Still 31pp above the 38% absolute baseline.

No walker change

Pure threshold + i18n string update. Walker scoring formula unchanged.

Tests

pytest -q2704 passed, 11 skipped.

PR #168 shipped the "Higher confidence" badge at score>=80 (measured
70% precision, 2.5% coverage). The user-perspective failure: 2.5% of
4321 corpus findings = ~1 finding per 40-finding draft = badge
effectively invisible on real-world drafts.

Lowered threshold to 75 (measured 69% precision, 7.2% coverage):
  - Precision: 1pp below the 70% bar, disclosed honestly in title attr
  - Coverage: 3× wider → ~1-3 findings/typical draft visible
  - vs 38% baseline absolute precision: still a meaningful lift

Updated i18n title attribute strings across all 6 locales to reflect
the actual measured precision (~69%, baseline 38%). "Higher confidence"
badge text unchanged — language is still honest about ranking-vs-
guarantee semantics.

No walker behavior change. Pure UX threshold + i18n string update.

Tests: pytest 2704 passed, 11 skipped. Frontend build clean.
@vercel
Copy link
Copy Markdown

vercel Bot commented Jun 1, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
patent-lint Ready Ready Preview, Comment Jun 1, 2026 7:07am

@kwisschen kwisschen merged commit 566a338 into main Jun 1, 2026
6 checks passed
@kwisschen kwisschen deleted the fix/lower-confidence-badge-threshold branch June 1, 2026 07:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant