Skip to content

data(verify): verified promotions (2026-06-23)#81

Merged
Seungpyo1007 merged 1 commit into
mainfrom
verify/promote-28002668986
Jun 23, 2026
Merged

data(verify): verified promotions (2026-06-23)#81
Seungpyo1007 merged 1 commit into
mainfrom
verify/promote-28002668986

Conversation

@Seungpyo1007

@Seungpyo1007 Seungpyo1007 commented Jun 23, 2026

Copy link
Copy Markdown
Member

Automated verified promotions from app.verify promote. Each change flips only the verified flag; structural validator passed and diff scope guarded. Review before merge. Closes #1

Auto-promotions from the verification layer (green+live-source or crossref-confirm).
Each flip is verified:false->true only; see data/_verify/ledger.jsonl. Refs #1
@TechEngineBot

Copy link
Copy Markdown
Member

🔎 Data verification — Tier 0 (offline existence/trust)

Scored by app.verify; posted by TechEngineBot. Informational only — the structural gate (app.validate) is separate and authoritative for merge.

Changed records in this PR

23 record(s) scored.

%%{init: {"theme":"base","themeVariables":{"pie1":"#3fb950","pie2":"#d29922","pie3":"#f85149","pieStrokeWidth":"0px","pieOpacity":"1"}}}%%
pie showData
    title Verification bands — all records
    "Green" : 23
    "Yellow" : 0
    "Red" : 0
Loading
Category Distribution Total 🟢 🟡 🔴 🟢 %
tablet 🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩 23 23 0 0 100.0%
All 🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩 23 23 0 0 100.0%

Full-dataset baseline

101954 record(s) scored.

%%{init: {"theme":"base","themeVariables":{"pie1":"#3fb950","pie2":"#d29922","pie3":"#f85149","pieStrokeWidth":"0px","pieOpacity":"1"}}}%%
pie showData
    title Verification bands — all records
    "Green" : 10019
    "Yellow" : 89472
    "Red" : 2463
Loading
Category Distribution Total 🟢 🟡 🔴 🟢 %
brand 🟩🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨 189 10 179 0 5.3%
soc 🟩🟨🟨🟨🟨🟥🟥🟥🟥🟥🟥🟥 2104 123 680 1301 5.8%
smartphone 🟩🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨🟥 90118 8453 80547 1118 9.4%
tablet 🟩🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨🟥 3048 174 2846 28 5.7%
watch 🟩🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨🟥 378 11 357 10 2.9%
pda 🟩🟩🟩🟨🟨🟨🟨🟨🟨🟨🟨🟥 110 27 77 6 24.5%
gpu 🟩🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨 2030 245 1785 0 12.1%
cpu 🟩🟩🟩🟨🟨🟨🟨🟨🟨🟨🟨🟨 3977 976 3001 0 24.5%
All 🟩🟨🟨🟨🟨🟨🟨🟨🟨🟨🟨🟥 101954 10019 89472 2463 9.8%

green = authoritative source + complete + consistent · yellow = plausible, needs confirmation · red = sparse/weak source or a hard contradiction. Promotion to verified runs in the scheduled verify-network workflow.

@Seungpyo1007 Seungpyo1007 merged commit f778127 into main Jun 23, 2026
5 checks passed
@github-project-automation github-project-automation Bot moved this from In Progress to Done in TechAPI-Project Jun 23, 2026
@Seungpyo1007 Seungpyo1007 deleted the verify/promote-28002668986 branch June 23, 2026 04:50
@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine change review: PASS

Check Result
python -m app.validate PASS
python integrity_check.py TechAPI/data --strict PASS

Changed data

Category Added Modified Deleted Added verified Added unverified Added Kaggle-sourced
brand 0 0 0 0 0 0
soc 0 0 0 0 0 0
smartphone 0 0 0 0 0 0
tablet 0 0 0 0 0 0
watch 0 0 0 0 0 0
pda 0 0 0 0 0 0
gpu 0 0 0 0 0 0
cpu 0 0 0 0 0 0

Changed record examples

  • No data file changes detected.

Heuristic review

  • Heuristic warnings: none found.

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine validation stats: PASS

Data summary

Category Total Verified Unverified Missing verified Tracked Verified % of tracked
brand 189 9 180 0 189 4.8%
soc 2104 120 1984 0 2104 5.7%
smartphone 90118 346 89772 0 90118 0.4%
tablet 3048 24 3024 0 3048 0.8%
watch 378 4 374 0 378 1.1%
pda 110 1 109 0 110 0.9%
gpu 2030 242 1788 0 2030 11.9%
cpu 3977 976 3001 0 3977 24.5%
all 101954 1722 100232 0 101954 1.7%

Warning

Tracked verified coverage is below 50% for smartphone 0.4% (346/90118), tablet 0.8% (24/3048), pda 0.9% (1/110), watch 1.1% (4/378), all 1.7% (1722/101954), brand 4.8% (9/189), soc 5.7% (120/2104), gpu 11.9% (242/2030), and 1 more.
Tracked coverage excludes records missing the verified field; see the Missing verified column for those records.
This does not fail validation. Keep imported records verified: false until manual audit, but treat this as follow-up verification work before relying on the affected categories as curated data.

Validation notes

  • Full advisory outlier listings are suppressed on successful runs because they are dataset-wide and mostly stable between PRs.
  • Failure runs still include a detailed log excerpt for debugging.

Key output:

## app.validate
## integrity_check.py --strict
loaded CPU=3977 GPU=2030
✅ integrity gate: no hard anomalies.
Integrity section Flagged lines
structural 0
CPU name/tier consistency (desktop mainstream only) 0
CPU single>multi (cinebench/geekbench — should be multi>=single) 0
CPU era-vs-score outliers 8
CPU cross-source ratio outliers (possible wrong-variant) 152
GPU cross-source ratio outliers + sanity 18

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Dataset changes enhancement New feature or request

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

Massive dataset rebuild: CPU + brand + GPU + smartphone + SoC (1989-2026)

2 participants