Skip to content

Latest commit

 

History

History
15 lines (9 loc) · 547 Bytes

File metadata and controls

15 lines (9 loc) · 547 Bytes

Scrape urls from multiple websites

This repository will help you to download all the press releases from multiple websites. A simple way to build a custom database of press releases without any manual efforts.

Instructions

  • pip install -r requirements
  • Run url_extract.py

Reference

I devised the solution from the following pages of the documentation:

  • [beautyfulsoup4] to scrape information from web pages
  • [pandas] for data structuring