Skip to content

v0.9.0

Latest
Compare
Choose a tag to compare
@lukehsiao lukehsiao released this 23 Jun 01:24
· 1 commit to master since this release
v0.9.0

0.9.0 - 2021-06-22

This is a long-awaited release with some performance improvements and some breaking changes. See the changelog for details.

Added

Changed

  • @HiromuHota: Renamed VisualLinker to PdfVisualParser, which assumes the followings: (#518)

    • pdf_path should be a directory path, where PDF files exist, and cannot be a file path.
    • The PDF file should have the same basename (os.path.basename) as the document. E.g., the PDF file should be either "123.pdf" or "123.PDF" for "123.html".
  • @HiromuHota: Changed Parser's signature as follows: (#518)

    • Renamed vizlink to visual_parser.
    • Removed pdf_path. Now this is required only by PdfVisualParser.
    • Removed visual. Provide visual_parser if visual information is to be parsed.
  • @YasushiMiyata: Changed UDFRunner's and UDF's data commit process as follows: (#545)

    • Removed add process on single-thread in _apply in UDFRunner.
    • Added UDFRunner._add of y on multi-threads to Parser, Labeler and Featurizer.
    • Removed y of document parsed result from out_queue in UDF.

Fixed