Skip to content

ch3ngo/linkedin-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

 _     _       _            _ _         ____
| |   (_)_ __ | | _____  __| (_)_ __   / ___|  ___ _ __ __ _ _ __   ___ _ __
| |   | | '_ \| |/ / _ \/ _` | | '_ \  \___ \ / __| '__/ _` | '_ \ / _ \ '__|
| |___| | | | |   <  __/ (_| | | | | |  ___) | (__| | | (_| | |_) |  __/ |
|_____|_|_| |_|_|\_\___|\__,_|_|_| |_| |____/ \___|_|  \__,_| .__/ \___|_|
                                                              |_|

linkedin-scraper

Unauthenticated LinkedIn company OSINT tool. Given only a company profile URL, collects company info, employee names and positions, optionally generates email lists, and integrates DNSDumpster for domain recon.

Built for authorized security engagements and red team initial reconnaissance phases.


Features

  • Company profile scraping (name, description, industry, size, headquarters, website)
  • Employee list extraction via headless browser (handles dynamic JS-rendered content)
  • Email list generation from a provided format pattern (e.g. {first_initial}{lastname}@domain.com)
  • Email verification via SMTP probing and common verification APIs
  • DNSDumpster integration for domain recon from the company domain
  • Multi-format report export: JSON, CSV, HTML
  • Pure CLI interface, no credentials required

Installation

Requirements: Python 3.10+, Google Chrome or Chromium

git clone https://github.com/ch3ngo/linkedin-scraper.git
cd linkedin-scraper
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
playwright install chromium

Usage

Scrape company info and employees

python3 -m linkedin_scraper --url "https://www.linkedin.com/company/target-company/"

Generate email list from employee data

python3 -m linkedin_scraper \
  --url "https://www.linkedin.com/company/target-company/" \
  --email-format "{first}.{last}@target.com" \
  --verify-emails

Run DNSDumpster recon on discovered domain

python3 -m linkedin_scraper \
  --url "https://www.linkedin.com/company/target-company/" \
  --dnsdumpster \
  --domain target.com

Full recon with all outputs

python3 -m linkedin_scraper \
  --url "https://www.linkedin.com/company/target-company/" \
  --email-format "{first_initial}{last}@target.com" \
  --verify-emails \
  --dnsdumpster \
  --domain target.com \
  --output ./output/ \
  --format json csv html

Email format patterns

Pattern Example result
{first}.{last}@domain.com john.doe@domain.com
{first_initial}{last}@domain.com jdoe@domain.com
{first}_{last}@domain.com john_doe@domain.com
{first}@domain.com john@domain.com
{last}@domain.com doe@domain.com

Output formats

Format File
JSON output/report_<timestamp>.json
CSV output/employees_<timestamp>.csv
HTML output/report_<timestamp>.html

Project structure

linkedin-scraper/
├── linkedin_scraper/
│   ├── __main__.py          # CLI entry point
│   ├── modules/
│   │   ├── company.py       # Company info scraper
│   │   ├── employees.py     # Employee list scraper (headless browser)
│   │   ├── email_gen.py     # Email list generator
│   │   ├── email_verify.py  # Email verification module
│   │   └── dnsdumpster.py   # DNSDumpster integration
│   └── utils/
│       ├── browser.py       # Headless browser helpers (Playwright)
│       ├── exporter.py      # Multi-format report exporter
│       └── logger.py        # Logging configuration
├── tests/
├── docs/
├── output/                  # Generated reports (gitignored)
├── requirements.txt
└── README.md

Disclaimer

This tool is intended for use in authorized security engagements only. Scraping LinkedIn without authorization may violate LinkedIn's Terms of Service and applicable laws. The author assumes no responsibility for misuse. Always obtain written authorization before running OSINT tools against any target.


License

MIT

About

Unauthenticated LinkedIn company OSINT tool — collects company info, employees, generates email lists, and integrates DNSDumpster for domain recon.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages