Skip to content

test: /verify all-tiers e2e (throwaway)#61

Closed
Seungpyo1007 wants to merge 1 commit into
mainfrom
test/verify-alltiers-e2e
Closed

test: /verify all-tiers e2e (throwaway)#61
Seungpyo1007 wants to merge 1 commit into
mainfrom
test/verify-alltiers-e2e

Conversation

@Seungpyo1007

@Seungpyo1007 Seungpyo1007 commented Jun 22, 2026

Copy link
Copy Markdown
Member

Tweaks one cpu record so /verify exercises Tier 1/2/3 on a changed record. Will close.

Closes #1

@Seungpyo1007

Copy link
Copy Markdown
Member Author

/verify

@Seungpyo1007

Copy link
Copy Markdown
Member Author

🔎 Data verification — Tier 0 (offline existence/trust)

Scored by app.verify; posted by TechEngineBot. Informational only — the structural gate (app.validate) is separate and authoritative for merge.

Changed records in this PR

1 record(s) scored.

Category Total 🟢 Green 🟡 Yellow 🔴 Red Green %
cpu 1 1 0 0 100.0%
All 1 1 0 0 100.0%

Full-dataset baseline

101954 record(s) scored.

Category Total 🟢 Green 🟡 Yellow 🔴 Red Green %
brand 189 10 179 0 5.3%
soc 2104 123 680 1301 5.8%
smartphone 90118 8453 80547 1118 9.4%
tablet 3048 174 2846 28 5.7%
watch 378 11 357 10 2.9%
pda 110 27 77 6 24.5%
gpu 2030 245 1785 0 12.1%
cpu 3977 976 3001 0 24.5%
All 101954 10019 89472 2463 9.8%

green = authoritative source + complete + consistent · yellow = plausible, needs confirmation · red = sparse/weak source or a hard contradiction. Promotion to verified runs in the scheduled verify-network workflow.

@TechEngineBot

Copy link
Copy Markdown
Member

🔎 Data verification — Tiers 0–3 (on demand)

No data records changed in this PR. Showing the full-dataset Tier 0 baseline only; network tiers (1–3) have nothing to check.

Full-dataset Tier 0 baseline

101954 record(s) scored.

Category Total 🟢 Green 🟡 Yellow 🔴 Red Green %
brand 189 10 179 0 5.3%
soc 2104 123 680 1301 5.8%
smartphone 90118 8453 80547 1118 9.4%
tablet 3048 174 2846 28 5.7%
watch 378 11 357 10 2.9%
pda 110 27 77 6 24.5%
gpu 2030 245 1785 0 12.1%
cpu 3977 976 3001 0 24.5%
All 101954 10019 89472 2463 9.8%

Requested by @Seungpyo1007 via /verify · scored by app.verify, posted by TechEngineBot. Informational only — the structural gate (app.validate) is separate; Tier 3 here is dry-run.

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine change review: PASS

Check Result
python -m app.validate PASS
python integrity_check.py TechAPI/data --strict PASS

Changed data

Category Added Modified Deleted Added verified Added unverified Added Kaggle-sourced
brand 0 0 0 0 0 0
soc 0 0 0 0 0 0
smartphone 0 0 0 0 0 0
tablet 0 0 0 0 0 0
watch 0 0 0 0 0 0
pda 0 0 0 0 0 0
gpu 0 0 0 0 0 0
cpu 0 1 0 0 0 0

Changed record examples

cpu modified

  • cpu/amd/1996/consumer/k5-pr133.json - AMD K5 PR133

Heuristic review

  • Heuristic warnings: none found.

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine validation stats: PASS

Data summary

Category Total Verified Unverified Missing verified Tracked Verified % of tracked
brand 189 0 189 0 189 0.0%
soc 2104 58 2046 0 2104 2.8%
smartphone 90118 184 89934 0 90118 0.2%
tablet 3048 0 3048 0 3048 0.0%
watch 378 0 378 0 378 0.0%
pda 110 0 110 0 110 0.0%
gpu 2030 0 2030 0 2030 0.0%
cpu 3977 976 3001 0 3977 24.5%
all 101954 1218 100736 0 101954 1.2%

Warning

Tracked verified coverage is below 50% for brand 0.0% (0/189), tablet 0.0% (0/3048), watch 0.0% (0/378), pda 0.0% (0/110), gpu 0.0% (0/2030), smartphone 0.2% (184/90118), all 1.2% (1218/101954), soc 2.8% (58/2104), and 1 more.
Tracked coverage excludes records missing the verified field; see the Missing verified column for those records.
This does not fail validation. Keep imported records verified: false until manual audit, but treat this as follow-up verification work before relying on the affected categories as curated data.

Validation notes

  • Full advisory outlier listings are suppressed on successful runs because they are dataset-wide and mostly stable between PRs.
  • Failure runs still include a detailed log excerpt for debugging.

Key output:

## app.validate
## integrity_check.py --strict
loaded CPU=3977 GPU=2030
✅ integrity gate: no hard anomalies.
Integrity section Flagged lines
structural 0
CPU name/tier consistency (desktop mainstream only) 0
CPU single>multi (cinebench/geekbench — should be multi>=single) 0
CPU era-vs-score outliers 8
CPU cross-source ratio outliers (possible wrong-variant) 152
GPU cross-source ratio outliers + sanity 18

@github-project-automation github-project-automation Bot moved this from Todo to Done in TechAPI-Project Jun 22, 2026
@Seungpyo1007 Seungpyo1007 deleted the test/verify-alltiers-e2e branch June 22, 2026 06:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Dataset changes enhancement New feature or request

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

Massive dataset rebuild: CPU + brand + GPU + smartphone + SoC (1989-2026)

2 participants