Skip to content

data(mobile): import PhoneDB SoC and smartphone batch#24

Merged
Seungpyo1007 merged 6 commits into
mainfrom
data/import-staging
Jun 18, 2026
Merged

data(mobile): import PhoneDB SoC and smartphone batch#24
Seungpyo1007 merged 6 commits into
mainfrom
data/import-staging

Conversation

@Seungpyo1007

@Seungpyo1007 Seungpyo1007 commented Jun 18, 2026

Copy link
Copy Markdown
Member

Summary

  • import 21 PhoneDB-derived mobile processor seed records
  • import 100 PhoneDB-derived smartphone records across Nokia, Motorola, Lenovo, HTC, Honor, Huawei, LG, Sony, ASUS, Infinix, OPPO, POCO, Realme, and Xiaomi
  • keep all newly imported records as verified: false for later TechEngine/manual audit
  • refresh the published site/public/v1 mobile indexes and detail JSON

Data source

Import notes

  • skipped noisy carrier/product-code rows unless they resolved into clean public model names
  • skipped the separate smartphone_specs Kaggle candidate set for this PR because it contains rumor/unreleased-looking model names that need stricter filtering first

Verification

  • python -m app.validate PASS
  • python TechEngine\integrity_check.py data --strict PASS
  • cd site && npm.cmd run build PASS
  • git diff --check PASS

Closes #1

@github-actions github-actions Bot added data Dataset changes enhancement New feature or request labels Jun 18, 2026
@Seungpyo1007 Seungpyo1007 moved this from Todo to In Progress in TechAPI-Project Jun 18, 2026
@TechEngineBot

TechEngineBot commented Jun 18, 2026

Copy link
Copy Markdown
Member

TechEngine change review: PASS

Check Result
python -m app.validate PASS
python integrity_check.py TechAPI/data --strict PASS

Changed data

Category Added Modified Deleted Added verified Added unverified Added Kaggle-sourced
brand 0 0 0 0 0 0
soc 21 0 0 0 21 21
smartphone 100 0 0 0 100 100
gpu 0 0 0 0 0 0
cpu 0 0 0 0 0 0

Changed record examples

soc added

  • soc/mediatek/2018/helio-a22.json - Helio A22
  • soc/mediatek/2018/helio-p22.json - Helio P22
  • soc/mediatek/2021/helio-g37.json - Helio G37
  • soc/mediatek/2022/dimensity-8000.json - Dimensity 8000
  • soc/mediatek/2022/dimensity-9000-plus.json - Dimensity 9000+
  • soc/mediatek/2022/dimensity-930.json - Dimensity 930
  • soc/qualcomm/2013/snapdragon-200.json - Snapdragon 200
  • soc/qualcomm/2013/snapdragon-400.json - Snapdragon 400
  • soc/qualcomm/2014/snapdragon-210.json - Snapdragon 210
  • soc/qualcomm/2014/snapdragon-410.json - Snapdragon 410
  • soc/qualcomm/2015/snapdragon-212.json - Snapdragon 212
  • soc/qualcomm/2016/snapdragon-425.json - Snapdragon 425
  • soc/qualcomm/2016/snapdragon-430.json - Snapdragon 430
  • soc/qualcomm/2016/snapdragon-435.json - Snapdragon 435
  • soc/qualcomm/2016/snapdragon-625.json - Snapdragon 625
  • ... 6 more

smartphone added

  • smartphone/asus/2022/rog-phone-6d-ultimate.json - ROG Phone 6D Ultimate
  • smartphone/asus/2022/rog-phone-6d.json - ROG Phone 6D
  • smartphone/honor/2019/honor-8s-2019-lte.json - Honor 8S 2019 LTE
  • smartphone/honor/2019/honor-8s-2019.json - Honor 8S 2019
  • smartphone/honor/2019/honor-play-3e.json - Honor Play 3e
  • smartphone/honor/2019/honor-play-8.json - Honor Play 8
  • smartphone/honor/2020/honor-8s-2020.json - Honor 8S 2020
  • smartphone/honor/2020/honor-9a.json - Honor 9A
  • smartphone/honor/2020/honor-9s-2020.json - Honor 9S 2020
  • smartphone/htc/2019/desire-19s.json - Desire 19s
  • smartphone/htc/2019/wildfire-x.json - Wildfire X
  • smartphone/htc/2020/wildfire-e2.json - Wildfire E2
  • smartphone/huawei/2019/y5-2019.json - Y5 2019
  • smartphone/huawei/2019/y6-2019-lte.json - Y6 2019 LTE
  • smartphone/huawei/2019/y6-2019.json - Y6 2019
  • ... 85 more

Heuristic review

  • Added records by manufacturer/brand: nokia: 19, lg: 15, motorola: 15, qualcomm: 13, realme: 9, honor: 7, huawei: 7, lenovo: 7
  • Added records by source class: kaggle: 121
  • Heuristic warnings: none found.

@TechEngineBot

TechEngineBot commented Jun 18, 2026

Copy link
Copy Markdown
Member

TechEngine validation stats: PASS

Data summary

Category Total Verified Unverified Missing verified Tracked Verified % of tracked
brand 129 0 0 129 0 n/a
soc 216 58 158 0 216 26.9%
smartphone 1499 184 1315 0 1499 12.3%
gpu 2030 0 2030 0 2030 0.0%
cpu 3977 976 3001 0 3977 24.5%
all 7851 1218 6504 129 7722 15.8%

Warning

Tracked verified coverage is below 50% for gpu 0.0% (0/2030), smartphone 12.3% (184/1499), all 15.8% (1218/7722), cpu 24.5% (976/3977), soc 26.9% (58/216).
Tracked coverage excludes records missing the verified field; see the Missing verified column for those records.
This does not fail validation. Keep imported records verified: false until manual audit, but treat this as follow-up verification work before relying on the affected categories as curated data.

Validation notes

  • Full advisory outlier listings are suppressed on successful runs because they are dataset-wide and mostly stable between PRs.
  • Failure runs still include a detailed log excerpt for debugging.

Key output:

## app.validate
## integrity_check.py --strict
loaded CPU=3977 GPU=2030
✅ integrity gate: no hard anomalies.
Integrity section Flagged lines
structural 0
CPU name/tier consistency (desktop mainstream only) 0
CPU single>multi (cinebench/geekbench — should be multi>=single) 0
CPU era-vs-score outliers 8
CPU cross-source ratio outliers (possible wrong-variant) 152
GPU cross-source ratio outliers + sanity 18

@github-actions github-actions Bot added ci CI and workflow changes documentation Improvements or additions to documentation labels Jun 18, 2026
@Seungpyo1007 Seungpyo1007 merged commit 9e5d073 into main Jun 18, 2026
4 checks passed
@github-project-automation github-project-automation Bot moved this from In Progress to Done in TechAPI-Project Jun 18, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci CI and workflow changes data Dataset changes documentation Improvements or additions to documentation enhancement New feature or request

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

Massive dataset rebuild: CPU + brand + GPU + smartphone + SoC (1989-2026)

2 participants