Skip to content

zehpatricio/WikiCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

WikiCrawler

This project is a crawler for Wikipedia pages.

Setting up enviroment

Install the required modules found in requirements.pip file.

How to use

You can run the script using the follow command:

python wikicrawler.py --start-url=<wiki_url> --total-files=<num>

Where wiki_url is wikipedia page URL to starting crawler and num is the number of files to be saved(every files correspond to one wikipedia crawled page).

About

Webcrawler for Wikipedia pages

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages