Skip to content

data(gpu): add remaining Kaggle GPU specs#17

Merged
Seungpyo1007 merged 4 commits into
mainfrom
data/import-staging
Jun 17, 2026
Merged

data(gpu): add remaining Kaggle GPU specs#17
Seungpyo1007 merged 4 commits into
mainfrom
data/import-staging

Conversation

@Seungpyo1007

Copy link
Copy Markdown
Member

Summary

  • add 58 additional GPU records from the Kaggle TechPowerUp export, all with verified: false
  • split the import into legacy, ATI/AMD, and modern GPU commits for reviewable history
  • regenerate the static public dump after the data changes

Import notes

  • source: https://www.kaggle.com/datasets/ellimaaac/gpus-specs-from-1986-to-2026
  • skipped existing slug/name matches to avoid reintroducing duplicate records
  • skipped rows with missing required schema fields, duplicate candidate slugs, future-dated rows, and obvious invalid entries such as Fake Card
  • all imported records remain unverified until manually reviewed against primary/vendor sources

Validation

  • python -m app.validate
  • python TechEngine\\integrity_check.py data --strict

Refs #1

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine change review: PASS

Check Result
python -m app.validate PASS
python integrity_check.py TechAPI/data --strict PASS

Changed data

Category Added Modified Deleted Added verified Added unverified Added Kaggle-sourced
brand 0 0 0 0 0 0
soc 0 0 0 0 0 0
smartphone 0 0 0 0 0 0
gpu 58 0 0 0 58 58
cpu 0 0 0 0 0 0

Changed record examples

gpu added

  • gpu/3dfx/1996/consumer/voodoo-graphics-4-mb.json - Voodoo Graphics 4 MB
  • gpu/3dfx/1998/consumer/voodoo-banshee-agp-16-mb.json - Voodoo Banshee AGP 16 MB
  • gpu/3dfx/1998/consumer/voodoo-banshee-pci-16-mb.json - Voodoo Banshee PCI 16 MB
  • gpu/3dfx/1998/consumer/voodoo2-12-mb.json - Voodoo2 12 MB
  • gpu/3dfx/1998/consumer/voodoo2-8-mb.json - Voodoo2 8 MB
  • gpu/3dfx/1999/consumer/velocity-100.json - Velocity 100
  • gpu/3dfx/1999/consumer/voodoo3-1000-agp.json - Voodoo3 1000 AGP
  • gpu/3dfx/1999/consumer/voodoo3-2000-agp.json - Voodoo3 2000 AGP
  • gpu/3dfx/1999/consumer/voodoo3-2000-pci.json - Voodoo3 2000 PCI
  • gpu/3dfx/1999/consumer/voodoo3-3000-agp.json - Voodoo3 3000 AGP
  • gpu/3dfx/1999/consumer/voodoo3-3000-pci.json - Voodoo3 3000 PCI
  • gpu/3dfx/1999/consumer/voodoo3-3500-tv-agp.json - Voodoo3 3500 TV AGP
  • gpu/amd/2006/consumer/stream-processor.json - Stream Processor
  • gpu/amd/2014/consumer/radeon-r5-a220.json - Radeon R5 A220
  • gpu/amd/2014/consumer/radeon-r5-a240.json - Radeon R5 A240
  • ... 43 more

Heuristic review

  • Added records by manufacturer/brand: ati: 18, nvidia: 14, 3dfx: 12, intel: 8, amd: 6
  • Added records by source class: kaggle: 58
  • Heuristic warnings: none found.

@TechEngineBot

Copy link
Copy Markdown
Member

TechEngine validation stats: PASS

Data summary

Category Total Verified Unverified Missing verified Verified %
brand 129 0 0 129 n/a
soc 123 58 65 0 47.2%
smartphone 367 184 183 0 50.1%
gpu 2030 0 2030 0 0.0%
cpu 3977 976 3001 0 24.5%
all 6626 1218 5279 129 18.7%

Validation notes

  • Full advisory outlier listings are suppressed on successful runs because they are dataset-wide and mostly stable between PRs.
  • Failure runs still include a detailed log excerpt for debugging.

Key output:

## app.validate
## integrity_check.py --strict
loaded CPU=3977 GPU=2030
✅ integrity gate: no hard anomalies.
Integrity section Flagged lines
structural 0
CPU name/tier consistency (desktop mainstream only) 0
CPU single>multi (cinebench/geekbench — should be multi>=single) 0
CPU era-vs-score outliers 8
CPU cross-source ratio outliers (possible wrong-variant) 152
GPU cross-source ratio outliers + sanity 18

@Seungpyo1007 Seungpyo1007 merged commit 6b172ce into main Jun 17, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants