Document Parser

This project presents a Flask-based web application with a focus on user interface and optional AI integration. The primary command, make run, initiates the web server and provides access to the core functionalities. Advanced users can optionally enhance the application by training a model or updating it with the best version.

Running the Web Application

Prerequisites

Python 3.x
Flask
Other dependencies in requirements.txt

Quick Start

To quickly start the web application:

git clone git@github.com:rlnsanz/document_parser.git
cd document_parser
make install
make run

This command sets up the environment and launches the Flask web server, ready for use.

Storing PDFs for Processing

For privacy and organization, this application processes PDFs and PNGs stored in a specific directory: private/. This directory should be created at the root of this repository (same dir that contains the Makefile), and it will be excluded from version control via .gitignore to ensure privacy and data security.

Contributing

Contributions are welcome. Please use standard fork-and-pull request workflow for any contributions.

License

This project is licensed under the Apache License, Version 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
app		app
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
doc_demux.py		doc_demux.py
doctr.ipynb		doctr.ipynb
export_ckpt.py		export_ckpt.py
favicon.ico		favicon.ico
featurize.py		featurize.py
house_tracker.ipynb		house_tracker.ipynb
infer.py		infer.py
label_by_hand.py		label_by_hand.py
queries.ipynb		queries.ipynb
requirements.txt		requirements.txt
run.py		run.py
setting_the_right_colors.ipynb		setting_the_right_colors.ipynb
split.py		split.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document Parser

Running the Web Application

Prerequisites

Quick Start

Storing PDFs for Processing

Contributing

License

About

Releases

Packages

Languages

License

rlnsanz/document_parser

Folders and files

Latest commit

History

Repository files navigation

Document Parser

Running the Web Application

Prerequisites

Quick Start

Storing PDFs for Processing

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages