This project is a crawler for Wikipedia pages.
Install the required modules found in requirements.pip
file.
You can run the script using the follow command:
python wikicrawler.py --start-url=<wiki_url> --total-files=<num>
Where wiki_url
is wikipedia page URL to starting crawler and num
is the number of files to be saved(every files correspond to one wikipedia crawled page).