Skip to content

data(mobile): import PhoneDB variant batch#33

Merged
Seungpyo1007 merged 6 commits into
mainfrom
data/import-staging
Jun 20, 2026
Merged

data(mobile): import PhoneDB variant batch#33
Seungpyo1007 merged 6 commits into
mainfrom
data/import-staging

Conversation

@Seungpyo1007

Copy link
Copy Markdown
Member

Summary

  • import PhoneDB/Kaggle mobile variants into the variant folder layout
  • add 2,459 smartphone variant records, 5 tablet records, and 8 mobile SoC stubs
  • refresh the published v1 public dump indexes/details for the imported records

Data changes

  • Source: https://www.kaggle.com/datasets/sady36/mobile-phones-specs
  • All imported records use verified: false for later TechEngine/manual audit.
  • Variant records use base_model_slug plus variant fields for region, memory, network, SIM, and model-number grouping.
  • Top imported brands include Samsung, Xiaomi, Oppo, Vivo, Huawei, Motorola, ZTE, Apple, Nokia, and OnePlus.

Commit layout

  • data(soc): add PhoneDB mobile SoC stubs
  • data(mobile): import Samsung Apple and Google variants
  • data(smartphone): import BBK group variants
  • data(smartphone): import Xiaomi and Huawei variants
  • data(smartphone): import remaining PhoneDB variant batch
  • chore(site): refresh public dump for PhoneDB variants

Verification

  • python -m app.validate PASS
  • python TechEngine\integrity_check.py data --strict PASS
  • cd site && npm.cmd run build PASS
  • git diff --check origin/main...HEAD PASS

Closes #1

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine change review: PASS

Check Result
python -m app.validate PASS
python integrity_check.py TechAPI/data --strict PASS

Changed data

Category Added Modified Deleted Added verified Added unverified Added Kaggle-sourced
brand 0 0 0 0 0 0
soc 8 0 0 0 8 8
smartphone 2459 0 0 0 2459 2459
tablet 5 0 0 0 5 5
watch 0 0 0 0 0 0
pda 0 0 0 0 0 0
gpu 0 0 0 0 0 0
cpu 0 0 0 0 0 0

Changed record examples

soc added

  • soc/apple/2016/apple-a10-fusion-apl1024-apl1w24.json - Apple A10 Fusion APL1024 / APL1W24
  • soc/apple/2022/apple-a15-bionic-apl1007-apl1w07.json - Apple A15 Bionic APL1007 / APL1W07
  • soc/apple/2022/apple-a16-bionic-apl1010-apl1w10.json - Apple A16 Bionic APL1010 / APL1W10
  • soc/mediatek/2022/mediatek-dimensity-930-mt6855.json - MediaTek Dimensity 930 MT6855
  • soc/unisoc/2021/unisoc-tiger-t618.json - UNISOC Tiger T618
  • soc/unisoc/2022/unisoc-tiger-t606.json - UNISOC Tiger T606
  • soc/unisoc/2022/unisoc-tiger-t612.json - UNISOC Tiger T612
  • soc/unisoc/2022/unisoc-tiger-t616.json - UNISOC Tiger T616

smartphone added

  • smartphone/acer/2022/sospiro/sospiro-a60-latam.json - Sospiro A60 LATAM
  • smartphone/alcatel/2017/u5-4047d/u5-3g-dual-sim-emea-8gb-4047d.json - U5 3G Dual SIM EMEA 8GB 4047D
  • smartphone/alcatel/2017/u5-4047x/u5-3g-emea-8gb-4047x.json - U5 3G EMEA 8GB 4047X
  • smartphone/alcatel/2017/u5-5044i/u5-dual-sim-td-lte-apac-8gb-5044i.json - U5 Dual SIM TD-LTE APAC 8GB 5044I
  • smartphone/alcatel/2017/u5-5044t-optus-x-spirit/u5-td-lte-apac-8gb-5044t-optus-x-spirit.json - U5 TD-LTE APAC 8GB 5044T / Optus X Spirit
  • smartphone/alcatel/2017/u5-hd-5047y/u5-hd-lte-emea-8gb-5047y.json - U5 HD LTE EMEA 8GB 5047Y
  • smartphone/alcatel/2018/1-5033f/1-dual-sim-lte-emea-16gb-5033f.json - 1 Dual SIM LTE EMEA 16GB 5033F
  • smartphone/alcatel/2018/u5-hd-5047i/u5-hd-dual-sim-td-lte-apac-16gb-5047i.json - U5 HD Dual SIM TD-LTE APAC 16GB 5047I
  • smartphone/alcatel/2018/u5-hd-5047u/u5-hd-premium-dual-sim-lte-emea-16gb-5047u.json - U5 HD Premium Dual SIM LTE EMEA 16GB 5047U
  • smartphone/alcatel/2019/1s-2019-5024a/1s-2019-lte-latam-5024a.json - 1S 2019 LTE LATAM 5024A
  • smartphone/alcatel/2019/1s-2019-5024j/1s-2019-dual-sim-lte-latam-5024j.json - 1S 2019 Dual SIM LTE LATAM 5024J
  • smartphone/alcatel/2020/1se-2020-5030d/1se-2020-global-dual-sim-lte-32gb-5030d.json - 1SE 2020 Global Dual SIM LTE 32GB 5030D
  • smartphone/alcatel/2020/1se-2020-5030f/1se-2020-global-dual-sim-lte-64gb-5030f.json - 1SE 2020 Global Dual SIM LTE 64GB 5030F
  • smartphone/alcatel/2020/1se-2020-5030u/1se-2020-global-dual-sim-lte-64gb-5030u.json - 1SE 2020 Global Dual SIM LTE 64GB 5030U
  • smartphone/alcatel/2021/1-2021/1-2021-dual-sim-lte-emea-8gb.json - 1 2021 Dual SIM LTE EMEA 8GB
  • ... 2444 more

tablet added

  • tablet/samsung/2015/galaxy-tab-3-v/sm-t116-galaxy-tab-3-v-3g-galaxy-tab-3-lite-ve-7-0.json - SM-T116 Galaxy Tab 3 V 3G / Galaxy Tab 3 Lite VE 7.0
  • tablet/samsung/2015/galaxy-tab-3-v/sm-t116nq-galaxy-tab-3-v-3g-galaxy-tab-3-lite-ve-7-0.json - SM-T116NQ Galaxy Tab 3 V 3G / Galaxy Tab 3 Lite VE 7.0
  • tablet/samsung/2015/galaxy-tab-3-v/sm-t116nu-galaxy-tab-3-v-3g-galaxy-tab-3-lite-ve-7-0.json - SM-T116NU Galaxy Tab 3 V 3G / Galaxy Tab 3 Lite VE 7.0
  • tablet/samsung/2015/galaxy-tab-3-v/sm-t116ny-galaxy-tab-3-v-3g.json - SM-T116NY Galaxy Tab 3 V 3G
  • tablet/samsung/2015/galaxy-tab-e/sm-t116bu-galaxy-tab-e-3g-galaxy-tab-3-lite-7-0.json - SM-T116BU Galaxy Tab E 3G / Galaxy Tab 3 Lite 7.0

Heuristic review

  • Added records by manufacturer/brand: samsung: 481, xiaomi: 356, oppo: 317, vivo: 288, huawei: 222, motorola: 157, zte: 123, apple: 87
  • Added records by source class: kaggle: 2472
  • Heuristic warnings: none found.

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine validation stats: PASS

Data summary

Category Total Verified Unverified Missing verified Tracked Verified % of tracked
brand 189 0 60 129 60 0.0%
soc 2075 58 2017 0 2075 2.8%
smartphone 34130 184 33946 0 34130 0.5%
tablet 452 0 452 0 452 0.0%
watch 78 0 78 0 78 0.0%
pda 110 0 110 0 110 0.0%
gpu 2030 0 2030 0 2030 0.0%
cpu 3977 976 3001 0 3977 24.5%
all 43041 1218 41694 129 42912 2.8%

Warning

Tracked verified coverage is below 50% for brand 0.0% (0/60), tablet 0.0% (0/452), watch 0.0% (0/78), pda 0.0% (0/110), gpu 0.0% (0/2030), smartphone 0.5% (184/34130), soc 2.8% (58/2075), all 2.8% (1218/42912), and 1 more.
Tracked coverage excludes records missing the verified field; see the Missing verified column for those records.
This does not fail validation. Keep imported records verified: false until manual audit, but treat this as follow-up verification work before relying on the affected categories as curated data.

Validation notes

  • Full advisory outlier listings are suppressed on successful runs because they are dataset-wide and mostly stable between PRs.
  • Failure runs still include a detailed log excerpt for debugging.

Key output:

## app.validate
## integrity_check.py --strict
loaded CPU=3977 GPU=2030
✅ integrity gate: no hard anomalies.
Integrity section Flagged lines
structural 0
CPU name/tier consistency (desktop mainstream only) 0
CPU single>multi (cinebench/geekbench — should be multi>=single) 0
CPU era-vs-score outliers 8
CPU cross-source ratio outliers (possible wrong-variant) 152
GPU cross-source ratio outliers + sanity 18

@Seungpyo1007 Seungpyo1007 merged commit 7149945 into main Jun 20, 2026
4 checks passed
@github-project-automation github-project-automation Bot moved this from In Progress to Done in TechAPI-Project Jun 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Dataset changes enhancement New feature or request site Homepage and public site changes

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

Massive dataset rebuild: CPU + brand + GPU + smartphone + SoC (1989-2026)

2 participants