Skip to content

Patent parser to download and parse all patent files between 2010 - 2020 from U.S. Govt

Notifications You must be signed in to change notification settings

eliserust/Patent_Project

Repository files navigation

Patent Project

Elise Rust & Zachary Poley

CLEAR Ventures November 2021

Patent parser to download and parse all patent files available at https://bulkdata.uspto.gov/ from years 2010 - 2020.

Directory:

  • Patent_Link_Scraper.py
  • Patent_Parser.py
    • Unzipping zip files and generating dictionaries for each patent
  • Zip_Download.py
    • Sub script of Patent_Parser.py containing just zip download/unzip process
  • Zip_to_Dict.py
    • Sub script of Patent_Parser.py containing parsing script to generate dictionaries from each patent .html file
  • ZipLinks.csv
    • Table of zip file links containing patents
  • Patents.json
    • Sample dictionaries of first five patents in list

About

Patent parser to download and parse all patent files between 2010 - 2020 from U.S. Govt

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages