A simple Python script that scrapes web pages for PDF files and downloads them to a local directory.
- Clone this repository.
- Install Python.
- Install Pip.
- Install
pip installl beautifulsoup4
andpip install urllib3
in your terminal. - Place the web page URL and output file location in the
main.py
file here:
# Define your URL
url = "https://yourWebsiteURL"
#If there is no such folder, the script will create one automatically
folder_location = r'/YOUR/OUTPUT/FILE/PATH'
- Run the script:
python main.py
- PDF files will be downloaded to your local directory.
This project is released under the terms of The Unlicense, which allows you to use, modify, and distribute the code as you see fit.
- The Unlicense removes traditional copyright restrictions, giving you the freedom to use the code in any way you choose.
- For more details, see the LICENSE file in this repository.
Author: Scott Grivner
Email: scott.grivner@gmail.com
Website: scottgrivner.dev
Reference: Main Branch