Skip to content

shigeki7777/web-scraper-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Web scraping → structured data — case study

Problem. A website has data you need — products, listings, prices, contacts — but offers no export or API.

Solution. A scraper that crawls the pages, extracts typed, deduped records, and writes clean JSON/CSV ready to load into your database or spreadsheet. Static HTML uses requests + BeautifulSoup; JS-rendered sites swap in Playwright with the same output schema.

Verified run (live crawl):

scraped 60 records from 3 pages
price £12.84–£57.31  ·  avg £35.00  ·  ratings 1–5: {15, 8, 13, 10, 14}
sample: {"title":"A Light in the Attic","price_gbp":51.77,"rating":3,"in_stock":true}

Why it sells: clean, typed, deduped output you can use immediately — with polite crawling and resilience to layout changes. (Demo target: books.toscrape.com, a public scraping sandbox.) Delivered in under a day.

About

Python web scraper → clean structured JSON (requests + BeautifulSoup). Portfolio case study.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages