░█████╗░░░░░░░░█████╗░░██████╗██╗███╗░░██╗████████╗ ██╔══██╗░░░░░░██╔══██╗██╔════╝██║████╗░██║╚══██╔══╝ ██║░░╚═╝█████╗██║░░██║╚█████╗░██║██╔██╗██║░░░██║░░░ ██║░░██╗╚════╝██║░░██║░╚═══██╗██║██║╚████║░░░██║░░░ ╚█████╔╝░░░░░░╚█████╔╝██████╔╝██║██║░╚███║░░░██║░░░ ░╚════╝░░░░░░░░╚════╝░╚═════╝░╚═╝╚═╝░░╚══╝░░░╚═╝░░░ COVID Open Source Intelligence Tool for the Dark Web
C-OSINT is a python framework for extracting web pages (regular or onion) over the TOR network and the surface web.
-
Please note: Crawling through the TOR network takes time. You can find more information here.
-
Warning: Crawling is not illegal, but copyright infringement is. It's always best to double check a website's T&Cs before crawling it. Some websites set what is called robots.txt to tell the crawlers not to visit those pages. We always recommend complying with robots.txt.
The framework is divided into three parts: