Skip to content

data(mobile): import Smartprix device records#37

Merged
Seungpyo1007 merged 5 commits into
mainfrom
data/import-staging
Jun 20, 2026
Merged

data(mobile): import Smartprix device records#37
Seungpyo1007 merged 5 commits into
mainfrom
data/import-staging

Conversation

@Seungpyo1007

Copy link
Copy Markdown
Member

Summary

  • import Smartprix mobile records from Kaggle (mayankkumarpoddar/smartprix-mobiles-dataset)
  • add 2,333 smartphones, 1 tablet, and 25 referenced SoC stubs with verified: false
  • refresh the published v1 dump for the imported Smartprix records
  • correct one source typo from Samaung Galaxy F70e to Samsung Galaxy F70e

Import policy

  • Source: https://www.kaggle.com/datasets/mayankkumarpoddar/smartprix-mobiles-dataset
  • Imported records remain verified: false until manual/TechEngine verification.
  • Feature phones and unknown device types were skipped.
  • Future records beyond the current date window and invalid numeric ranges were excluded.
  • Unmapped chipset rows were imported without forcing synthetic SoC links.

Verification

  • python -m app.validate PASS
  • python TechEngine\integrity_check.py data --strict PASS
  • git diff --check origin/main...HEAD PASS
  • cd site && npm.cmd run build PASS

Closes #1

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine change review: PASS

Check Result
python -m app.validate PASS
python integrity_check.py TechAPI/data --strict PASS

Changed data

Category Added Modified Deleted Added verified Added unverified Added Kaggle-sourced
brand 0 0 0 0 0 0
soc 25 0 0 0 25 25
smartphone 2333 0 0 0 2333 2333
tablet 1 0 0 0 1 1
watch 0 0 0 0 0 0
pda 0 0 0 0 0 0
gpu 0 0 0 0 0 0
cpu 0 0 0 0 0 0

Changed record examples

soc added

  • soc/apple/2024/apple-bionic-a18-pro.json - Apple Bionic A18 Pro
  • soc/mediatek/2025/mediatek-dimensity-7400.json - Mediatek Dimensity 7400
  • soc/mediatek/2025/mediatek-dimensity-8350-apex.json - Mediatek Dimensity 8350 Apex
  • soc/mediatek/2025/mediatek-dimensity-8350-ultimate.json - Mediatek Dimensity 8350 Ultimate
  • soc/mediatek/2025/mediatek-dimensity-8450.json - Mediatek Dimensity 8450
  • soc/mediatek/2025/mediatek-helio-g200.json - Mediatek Helio G200
  • soc/mediatek/2026/mediatek-dimensity-7100.json - Mediatek Dimensity 7100
  • soc/mediatek/2026/mediatek-dimensity-7300e.json - Mediatek Dimensity 7300e
  • soc/mediatek/2026/mediatek-dimensity-7400-apex.json - Mediatek Dimensity 7400 Apex
  • soc/mediatek/2026/mediatek-dimensity-7400-turbo.json - Mediatek Dimensity 7400 Turbo
  • soc/mediatek/2026/mediatek-dimensity-7400-ultimate.json - Mediatek Dimensity 7400 Ultimate
  • soc/mediatek/2026/mediatek-dimensity-7400-ultra.json - Mediatek Dimensity 7400 Ultra
  • soc/mediatek/2026/mediatek-dimensity-8500-ultra.json - Mediatek Dimensity 8500 Ultra
  • soc/qualcomm/2025/qualcomm-snapdragon-6-gen1.json - Qualcomm Snapdragon 6 Gen1
  • soc/qualcomm/2025/qualcomm-snapdragon-6-gen4.json - Qualcomm Snapdragon 6 Gen4
  • ... 10 more

smartphone added

  • smartphone/acer/2015/liquid-jade-primo/acer-liquid-jade-primo-3gb-32gb-4g-lte.json - Acer Liquid Jade Primo
  • smartphone/alcatel/2020/1b-2020/alcatel-1b-2020-2gb-16gb-4g-lte.json - Alcatel 1B (2020)
  • smartphone/alcatel/2020/1se-2020/alcatel-1se-2020-3gb-32gb-4g-lte.json - Alcatel 1SE (2020)
  • smartphone/alcatel/2021/5x/alcatel-5x-4gb-128gb-4g-lte.json - Alcatel 5X
  • smartphone/alcatel/2025/v3-classic-6gb-ram-128gb/alcatel-v3-classic-6gb-ram-128gb-6gb-128gb-5g.json - Alcatel V3 Classic (6GB RAM + 128GB)
  • smartphone/alcatel/2025/v3-classic/alcatel-v3-classic-4gb-128gb-5g.json - Alcatel V3 Classic
  • smartphone/alcatel/2025/v3-pro/alcatel-v3-pro-8gb-256gb-5g.json - Alcatel V3 Pro
  • smartphone/alcatel/2025/v3-ultra-8gb-ram-128gb/alcatel-v3-ultra-8gb-ram-128gb-8gb-128gb-5g.json - Alcatel V3 Ultra (8GB RAM + 128GB)
  • smartphone/alcatel/2025/v3-ultra/alcatel-v3-ultra-6gb-128gb-5g.json - Alcatel V3 Ultra
  • smartphone/amazon/2014/fire-phone-at-t/amazon-fire-phone-at-t-2gb-32gb-4g-lte.json - Amazon Fire Phone (AT&T)
  • smartphone/apple/2019/iphone-11-128gb/apple-iphone-11-128gb-4gb-128gb-4g-lte.json - Apple iPhone 11 (128GB)
  • smartphone/apple/2019/iphone-11-256gb/apple-iphone-11-256gb-4gb-256gb-4g-lte.json - Apple iPhone 11 (256GB)
  • smartphone/apple/2019/iphone-11-pro-256gb/apple-iphone-11-pro-256gb-4gb-256gb-4g-lte.json - Apple iPhone 11 Pro (256GB)
  • smartphone/apple/2019/iphone-11-pro-512gb/apple-iphone-11-pro-512gb-4gb-512gb-4g-lte.json - Apple iPhone 11 Pro (512GB)
  • smartphone/apple/2019/iphone-11-pro-max-256gb/apple-iphone-11-pro-max-256gb-4gb-256gb-4g-lte.json - Apple iPhone 11 Pro Max (256GB)
  • ... 2318 more

tablet added

  • tablet/xiaomi/2025/redmi-k-pad/xiaomi-redmi-k-pad-8gb-256gb.json - Xiaomi Redmi K Pad

Heuristic review

  • Added records by manufacturer/brand: xiaomi: 336, samsung: 314, vivo: 256, realme: 251, oppo: 232, motorola: 127, iqoo: 86, honor: 82

  • Added records by source class: kaggle: 2359

  • Heuristic warnings: 1 total; showing first 1.

    • smartphone: smartphone/oppo/2026/k14x-5g-6gb-ram-128gb/oppo-k14x-5g-6gb-ram-128gb-6gb-128gb-5g.json: double spaces in name

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine validation stats: PASS

Data summary

Category Total Verified Unverified Missing verified Tracked Verified % of tracked
brand 189 0 60 129 60 0.0%
soc 2104 58 2046 0 2104 2.8%
smartphone 44384 184 44200 0 44384 0.4%
tablet 1172 0 1172 0 1172 0.0%
watch 176 0 176 0 176 0.0%
pda 110 0 110 0 110 0.0%
gpu 2030 0 2030 0 2030 0.0%
cpu 3977 976 3001 0 3977 24.5%
all 54142 1218 52795 129 54013 2.3%

Warning

Tracked verified coverage is below 50% for brand 0.0% (0/60), tablet 0.0% (0/1172), watch 0.0% (0/176), pda 0.0% (0/110), gpu 0.0% (0/2030), smartphone 0.4% (184/44384), all 2.3% (1218/54013), soc 2.8% (58/2104), and 1 more.
Tracked coverage excludes records missing the verified field; see the Missing verified column for those records.
This does not fail validation. Keep imported records verified: false until manual audit, but treat this as follow-up verification work before relying on the affected categories as curated data.

Validation notes

  • Full advisory outlier listings are suppressed on successful runs because they are dataset-wide and mostly stable between PRs.
  • Failure runs still include a detailed log excerpt for debugging.

Key output:

## app.validate
## integrity_check.py --strict
loaded CPU=3977 GPU=2030
✅ integrity gate: no hard anomalies.
Integrity section Flagged lines
structural 0
CPU name/tier consistency (desktop mainstream only) 0
CPU single>multi (cinebench/geekbench — should be multi>=single) 0
CPU era-vs-score outliers 8
CPU cross-source ratio outliers (possible wrong-variant) 152
GPU cross-source ratio outliers + sanity 18

@Seungpyo1007 Seungpyo1007 merged commit 456a69c into main Jun 20, 2026
4 of 5 checks passed
@github-project-automation github-project-automation Bot moved this from In Progress to Done in TechAPI-Project Jun 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Dataset changes enhancement New feature or request site Homepage and public site changes

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

Massive dataset rebuild: CPU + brand + GPU + smartphone + SoC (1989-2026)

2 participants