Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRefine and Solr. Part of the FID Romanistik software stack
-
Updated
Nov 28, 2018 - Shell
Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRefine and Solr. Part of the FID Romanistik software stack
A wrapper class on top of oai-pmh module from @andrenarchy
Powerful automation CLI for daily operations for VMware / Pure Storage
Repository containing scripts leveraging the power of python for data mining for small-scale to large scale flexible harvesting. Also implementing NoSQL DB to enhance logic flexibility
Information Gathering / Bing - Certificate - CertSpotter - Google - LinkedIn - OTX- PortScanner - ThreatCrowd - VirusTotal - Yahoo
Harvest all Products on one Page and save it in a database
The Open Access Harvester starts with an xml list of numbered, bibliographic citations (say, from a CV) and tries to find and fetch all open access documents.
Harvest texts from vk.com through API.
Universal node.js scraper, is a simple tool to crawl web pages and extract content that can then be stored in csv files (sheets) or directly into a database
An efficient and flexible web scraper.
The miner for ChiaPP (chia pool protocol).
Add a description, image, and links to the harvester topic page so that developers can more easily learn about it.
To associate your repository with the harvester topic, visit your repo's landing page and select "manage topics."