This repository contains a Wikipedia-scraper that will scrape the entire wikipedia page and tell us the top 10 Most Frequently used words. It is made using Python as a Backend and HTML CSS in the frontend. I have also deployed it on heroku : https://wikipedia-scraper.herokuapp.com/
Enter a valid wikipedia URL and click on submit.
It will show the results i.e. top 10 frequently used words on that page.
Project is created with : HTML, CSS, Flask, Python.
- Clone the repo
$ git clone https://github.com/TanmayThaker/Wikipedia-Scraper.git
$ cd Wikipedia-Scraper
- Initialize and activate a virtualenv:
$ virtualenv --no-site-packages env
$ source env/bin/activate
- Install the dependencies:
$ pip install -r requirements.txt
- Run the development server:
$ python app.py
- Navigate to http://localhost:8000