Skip to content

Initial Release: ArXiv Parsing+Reparsing

Pre-release
Pre-release
Compare
Choose a tag to compare
@tjacovich tjacovich released this 13 Jul 15:56
· 3 commits to main since this release
94c1a60

Initial parser with functionality to:

  • parse output from harvester pipeline
  • reparse harvested records by pulling from S3
    • Allows for reparsing of individual or bulk records
  • resend previously parsed data to Kafka
  • view current parsed data for a given record
  • Monitor current status of parsing for a given record