You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Completes the PhoneDB import range with the historic tail — device ids 1–1587 (years 1993–2009), which neither my earlier batches (#27 / #28, top range) nor the concurrent session had reached. +868 variant-level smartphones + 75 SoC seeds (all verified: false, per-record source_urls), plus a refreshed dump.
Distinct per-batch commit messages + Refs #1 in each commit body.
Validation (local)
python -m app.validate → ✅ passed
python TechEngine/integrity_check.py data --strict → ✅ no hard anomalies
Built in an isolated git worktree off origin/main (+ dump from a fresh DB) to avoid a concurrent session's writes to the shared data/. Branch data/phonedb-tail is used instead of the contested data/import-staging.
Commands (local):python -m app.validate ✅ · python TechEngine/integrity_check.py data --strict ✅ (no hard anomalies) Data changes vs main: +868 smartphones, +75 SoC seeds, dump refreshed (→ 25,523 / 1,848). Coverage: PhoneDB ids 1–1587 (years 1993–2009) — the historic tail not covered by #27/#28 or the concurrent session. Example:data/smartphone/palm/2002/..., data/smartphone/nokia/... (early-smartphone / feature-phone era). Heuristic warnings: none new (names normalized at scrape). Build note: generated in an isolated git worktree off origin/main + fresh-DB dump, to avoid a concurrent session's writes to the shared data/.
Tracked verified coverage is below 50% for brand 0.0% (0/60), gpu 0.0% (0/2030), smartphone 0.7% (184/25523), soc 3.1% (58/1848), all 3.6% (1218/33438), cpu 24.5% (976/3977).
Tracked coverage excludes records missing the verified field; see the Missing verified column for those records.
This does not fail validation. Keep imported records verified: false until manual audit, but treat this as follow-up verification work before relying on the affected categories as curated data.
Validation notes
Full advisory outlier listings are suppressed on successful runs because they are dataset-wide and mostly stable between PRs.
Failure runs still include a detailed log excerpt for debugging.
Key output:
## app.validate
## integrity_check.py --strict
loaded CPU=3977 GPU=2030
✅ integrity gate: no hard anomalies.
Integrity section
Flagged lines
structural
0
CPU name/tier consistency (desktop mainstream only)
0
CPU single>multi (cinebench/geekbench — should be multi>=single)
0
CPU era-vs-score outliers
8
CPU cross-source ratio outliers (possible wrong-variant)
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Completes the PhoneDB import range with the historic tail — device ids 1–1587 (years 1993–2009), which neither my earlier batches (#27 / #28, top range) nor the concurrent session had reached. +868 variant-level smartphones + 75 SoC seeds (all
verified: false, per-recordsource_urls), plus a refreshed dump.Changes vs
mainsite/public/v1dump refreshed → 25,523 smartphones / 1,848 SoC.Refs #1in each commit body.Validation (local)
python -m app.validate→ ✅ passedpython TechEngine/integrity_check.py data --strict→ ✅ no hard anomaliesorigin/main(+ dump from a fresh DB) to avoid a concurrent session's writes to the shareddata/. Branchdata/phonedb-tailis used instead of the contesteddata/import-staging.Closes #1