Telegraphcrawler

About

Telegraphcrawler is a tool to index telegra.ph pages based on the provided wordslist. Inspired by nudecrawler, this project uses multithreading alongside with multiprocessing, but doesn't feature any tools to filter discovered content for now. Html pages, links to images and videos are saved into postgreSQL database for future analyzing.

Usage

Create folder to work in:

mkdir telegraphcrawler
cd telegraphcrawler

Download docker-compose and .env_example files:

curl -LO https://raw.githubusercontent.com/Inejka/telegraphcrawler/master/docker-compose.yml
curl -LO https://raw.githubusercontent.com/Inejka/telegraphcrawler/master/.env_example

Rename .env_example to .env and change default values if needed (at least default password):

mv ./.env_example ./.env

Create work folder and dict folder, fill your first wordlist:

mkdir work
mkdir work/dict
echo -e "cat\nkitty\ndog\nduck\nrabbit\nchicken\nguinea-pig\ndonkey\npigeon\ngoose\nllama\nalpaca\ngoldfish\nparrot" > work/dict/first.txt

Run with docker-compose and enter TUI:

docker-compose run --rm crawler

Stop db after work:

docker-compose stop

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
app		app
.env_example		.env_example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build_and_up.bat		build_and_up.bat
build_and_up.sh		build_and_up.sh
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Telegraphcrawler

About

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Telegraphcrawler

About

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages