This repository contains an Open Source dataset of bicycle brands, models, and sizes, extracted from a collection of 25,000+ real-world bicycle listings. The data is structured by brand, with each brand's file containing unique models and their corresponding size details.
/bicycle-brands-models
│── /brands # Contains JSON files for each brand
│── README.md # This documentation
/brands: Contains individual JSON files for each brand (e.g.,santa-cruz.json,giant.json).README.md: Documentation about the dataset.
Each brand JSON file contains structured data about bicycle models, including:
- Brand Name (
brand) - List of Models, each with:
model: The model nametype: The bicycle category (e.g., Gravel, Mountain, Road)ebike: Boolean indicating if it is an eBikesuspension: Suspension type, if applicablesizes: Available sizes, each with:size: The size label (e.g., M, L, XL)frame_size: The numerical frame size (if available)rider_height_min: Suggested minimum rider heightrider_height_max: Suggested maximum rider heightmeasurement_unit: Unit of measurement (usuallycm)
{
"brand": "Santa Cruz",
"models": [
{
"model": "Stigmata Force AXS / Carbon CC / 700c",
"type": "Gravel",
"ebike": false,
"suspension": null,
"sizes": [
{
"size": "M",
"frame_size": 56,
"rider_height_min": 174,
"rider_height_max": 184,
"measurement_unit": "cm"
}
]
}
]
}You can use this dataset for:
- Bicycle eCommerce websites: Organize and filter models and sizes.
- Bike classification & categorization: Standardize bike models.
- Data visualization & analytics: Compare brands, frame sizes, and trends.
- Machine Learning / AI applications: Build recommendation models for bikes.
Clone the repository:
git clone https://github.com/reaatech/bicycle-brands-models.git
cd bicycle-brands-models- If you find errors or missing data, feel free to submit an issue.
- If you want to improve the dataset, fork the repo and submit a pull request.
This dataset is open-source and available under the MIT License. Feel free to use, modify, and share!
For questions or collaborations, reach out via GitHub Issues.