Skip to content

20127304-AQ/Applied_DataScience_Phone_Analysis

Repository files navigation

Applied_DataScience_Phone_Analysis

CRAWL DATA INSTRUCTION

Set up

Before running the code, here are the pre-requisites:

  • Download the Chromedriver from here. Download the version that matches your current Chrome version. (In my case, I downloaded version 122.0.6261.94)
  • Install the required packages using the following command:
pip install -r requirements.txt
  • Make sure to have a stable internet connection.
  • Store the Chromedriver in the same directory as the source code. We will change the structure of the repository after the code is stable.

How to run the code:

  • Run the following command:
python main.py

It will create a data folder containing the file with links to each product.

To perform crawling, inside the notebook, change the name of the data/some-name.csv and replace the some-name with the current file name.

The execution time depends on the amount of data.

Note: The crawling code can be executed until April 27, 2024. Any necessary adjustments to the code may include handling pop-ups or navigating through next-page buttons. Additionally, ensure compatibility with Chrome version 122.0 or refer to the documentation to determine the suitable Chrome version for the Chrome driver.

Demo video

My Demo

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors