Skip to content

Latest commit

 

History

History
10 lines (6 loc) · 472 Bytes

File metadata and controls

10 lines (6 loc) · 472 Bytes

Python-Web-Scraper-Coursework

Scrapes the source code of Web page

Getwebinfo is main script. It uses the three other scripts (download.py, forensic.py, scan.py).

Download.py downloads files (e.g. images) from webpage.

Forensic.py attempty MD5 Dictionary hash crack on found hashes, as well as checking downloaded files against a dictionary of bad files.

Scan.py extracts contents from website. Looks for emails, phone numbers, md5 hashes, files and hyperlinks.