Web scraping → structured data — case study

Problem. A website has data you need — products, listings, prices, contacts — but offers no export or API.

Solution. A scraper that crawls the pages, extracts typed, deduped records, and writes clean JSON/CSV ready to load into your database or spreadsheet. Static HTML uses requests + BeautifulSoup; JS-rendered sites swap in Playwright with the same output schema.

Verified run (live crawl):

scraped 60 records from 3 pages
price £12.84–£57.31  ·  avg £35.00  ·  ratings 1–5: {15, 8, 13, 10, 14}
sample: {"title":"A Light in the Attic","price_gbp":51.77,"rating":3,"in_stock":true}

Why it sells: clean, typed, deduped output you can use immediately — with polite crawling and resilience to layout changes. (Demo target: books.toscrape.com, a public scraping sandbox.) Delivered in under a day.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
books.json		books.json
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web scraping → structured data — case study

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Web scraping → structured data — case study

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages