Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

axa-group / Parsr Public

Notifications You must be signed in to change notification settings
Fork 310
Star 5.9k

Code
Issues 58
Pull requests 14
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: axa-group/Parsr

Releases Tags

Releases · axa-group/Parsr

v0.12

04 May 09:58

jvalls-axa

v0.12

1d7a46a

Compare

Choose a tag to compare

View all tags

v0.12

Changes:

PdfJS improved to be compliant with 'Image Detection' module.
Allow Abbyy as pdf extractor.
Python3 client improved.
Added Spanish & Portuguese Readme.
Headings detection improved.
APIpostDocument with optional defaultConfig.
Several Bug fixing.

Assets 2

All reactions

0.11.2

26 Mar 12:54

jvalls-axa

v0.11.2

850c58e

Compare

Choose a tag to compare

View all tags

0.11.2

Fixes:

Fixed PdfMiner wrong split words
Fixed #372 & #367

Assets 2

All reactions

v0.11.1

25 Mar 13:41

jvalls-axa

v0.11.1

9377e4c

Compare

Choose a tag to compare

View all tags

v0.11.1

Changes:

Fixed dependencies with security vulnerability detected
Several bug fixes

Assets 2

All reactions

v0.11

11 Mar 13:58

jvalls-axa

v0.11

6a4534e

Compare

Choose a tag to compare

View all tags

v0.11

Changes

Advanced Image detection module that allows scan images using OCR's
Improved data extraction & reconstruction when a document has pages with rotated content
Parsr bare-metal installation process automated using just one NodeJs script
Removed GraphicsMagick & pdf2pic dependencies
Updated documentation
Several bug fixes

Assets 2

All reactions

0.10.1

25 Feb 09:19

jvalls-axa

0.10.1

bb58291

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.10.1

Security vulnerability fixed

Bump bleach from 3.1.0 to 3.1.1 in /demo/jupyter-notebook

Assets 2

All reactions

0.10

19 Feb 11:23

jvalls-axa

0.10

92e65b3

Compare

Choose a tag to compare

View all tags

0.10

Changes

New input file *.docx
New 'Table of contents' processing module
UI added button for outputs download
Added compatibility for PdfMiner '20200124'
Improved PdfMiner extraction time using xml stream reader
Allow to run new Ocr's using API by extending configuration file
Several bug fixes

Breaking changes

Deprecated pipeline configuration property 'extractor.img'

Assets 2

All reactions

0.9: Merge branch 'develop'

24 Jan 13:39

jvalls-axa

0.9

e39901a

Compare

Choose a tag to compare

View all tags

0.9: Merge branch 'develop'

Changes

Integrated new OCR's in GUI
- Google Vision
- Amazon Textract
- Microsoft Cognitive Services
- Abbyy
Updated GUI: Added oficial Logo and fixed some cosmetic issues
Several bug fixing
Updated Readme.md

Assets 2

All reactions

v0.8: Merge pull request #293 from axa-group/feature/Image_Module_Off

13 Jan 15:33

jvalls-axa

v0.8

26a9936

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.8: Merge pull request #293 from axa-group/feature/Image_Module_Off

Changes

Simple Image detection using PdfMiner.
Allowed *.elm as input to be parsed (message body and attachments are used to extract data).
GUI can display page margins by activating just a switch.
Readme in French.

Assets 2

All reactions

v0.7.1: Merge pull request #263 from axa-group/feature/better-error-trace

16 Dec 08:44

jvalls-axa

v0.7.1

c8d4305

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.7.1: Merge pull request #263 from axa-group/feature/better-error-trace

Changes

Removed 'sharp' dependency from API
Improved errors handling
Allow Tesseract to run multi pages PDF's
Some JS vulnerabilities fixed
Improved Jupyter Notebook document versioning display

Assets 2

All reactions

v0.7

09 Dec 14:14

jvalls-axa

v0.7

2d93b85

Compare

Choose a tag to compare

View all tags

v0.7

Changes

Optimisation of images before tesseract scan (detect rotation & removed shadows)
New input module option Pdf.js (recommended for large Pdf's)
Jupyter Notebook: Added document versioning & comparison
Javascript vulnerability Fixed
Several GUI & Server bug fixes

Assets 2

All reactions

Previous 1 2 3 Next

Previous Next

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.