Wikipedia-Scraper

About the project

This repository contains a Wikipedia-scraper that will scrape the entire wikipedia page and tell us the top 10 Most Frequently used words. It is made using Python as a Backend and HTML CSS in the frontend. I have also deployed it on heroku : https://wikipedia-scraper.herokuapp.com/

Enter a valid wikipedia URL and click on submit.

It will show the results i.e. top 10 frequently used words on that page.

Tech Stack used

Project is created with : HTML, CSS, Flask, Python.

Install and Run

Clone the repo

$ git clone https://github.com/TanmayThaker/Wikipedia-Scraper.git
$ cd Wikipedia-Scraper

Initialize and activate a virtualenv:

$ virtualenv --no-site-packages env
$ source env/bin/activate

Install the dependencies:

$ pip install -r requirements.txt

Run the development server:

$ python app.py

Navigate to http://localhost:8000

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
static		static
templates		templates
Procfile		Procfile
README.md		README.md
app.py		app.py
main_page.PNG		main_page.PNG
requirements.txt		requirements.txt
sample_result.PNG		sample_result.PNG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wikipedia-Scraper

Table of Contents

About the project

Tech Stack used

Install and Run

About

Releases

Packages

Languages

TanmayThaker/Wikipedia-Scraper

Folders and files

Latest commit

History

Repository files navigation

Wikipedia-Scraper

Table of Contents

About the project

Tech Stack used

Install and Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages