Skip to content

Latest commit

 

History

History
34 lines (23 loc) · 1.2 KB

README.rst

File metadata and controls

34 lines (23 loc) · 1.2 KB

HEPcrawl

HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.

The project is currently in early stage of development.

See full documentation at http://pythonhosted.org/hepcrawl