Selenium Python Scraper

This project is a web scraper built using Python and Selenium. The scraper extracts book information from Audible's search results, including the book title, author, length, and release date. The extracted data is saved into a CSV file for further analysis.

Prerequisites

Python 3.8+
Google Chrome browser installed

Setup

Clone the repository: git clone https://github.com/razvanalexuc/Selenium_Python_Scraper.git cd Selenium_Python_Scraper
Create and activate a virtual environment (optional but recommended): python -m venv .venv source .venv/bin/activate # On Windows: .venv\Scripts\activate
Install the required Python packages: pip install -r requirements.txt
Run the scraper: python src/main.py

How It Works

Selenium WebDriver: Used to automate the browser interactions.
Pagination Handling: The scraper navigates through all pages of the search results to ensure complete data extraction.
Error Handling: The scraper is designed to handle exceptions gracefully, ensuring that partial data is not lost.

Troubleshooting

NoSuchWindowException: This error may occur if the browser window closes unexpectedly. Make sure the browser window remains open throughout the scraping process.
Data Not Saving: Ensure that the data/ directory exists and that you have the necessary permissions to write to this directory.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request for any enhancements or bug fixes.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selenium Python Scraper

Prerequisites

Setup

How It Works

Troubleshooting

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.venv		.venv
data		data
src		src
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Selenium Python Scraper

Prerequisites

Setup

How It Works

Troubleshooting

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages