ARCHIVED--Docker app to crawl URLs and generate WARCs
-
Updated
Apr 11, 2017 - Python
ARCHIVED--Docker app to crawl URLs and generate WARCs
Parser for WARC (aka WebArchive) files
From WARC records to MongoDB documents
Add a description, image, and links to the warc-format topic page so that developers can more easily learn about it.
To associate your repository with the warc-format topic, visit your repo's landing page and select "manage topics."