Skip to content

iggisv9t/tgcrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

What is this

This is simple telegram crawler and parser based on bellingcat's scraper
It parses posts from channels, looks for t.me links and tries to parse them.
Try to look here (may require some RAM) screenshot

(how to publish large graphs online)

Setup

  • You should have basic python knowledge to use this.
  • First create virtualenv
  • Then install dependencies: pip install -r requirements.txt
  • Check source code. It expects creds.txt for sqlalchemy connection string for your DB in ./ and channels.csv as seed list to start scraping. channels.csv should be created before start. Expected columns: chname, degree, where chname is channel id, degree -- is the number of connected channels.

Usage

python scraper.py

Similar projects

https://antcating.github.io/telegram_connections_frontend/

Contributing

  1. Please open issue before any pull request.
  2. All code contributions require forks
  3. Suggest features and ideas in discussions
  4. Feel free to develop it in your own way in your fork

Gratitude

Big thanks to @antcating for inspiration
And Bellingcat for snscrape

About

Simple crawler for telegram channels

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages