Skip to content

gurusura/pdf-text-data-extractor

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text data extractor: PDF to Text

This PDF text data extraction app takes a PDF document as input and return either a txt file that contains all pages or a compressed folder of txt files representing the document pages.

Open in Streamlit

flowchart LR

A[Input: PDF] --> |text conversion| B(Output: Text)
B --> |Option 1| D[One output .txt file]
B --> |Option 2| E[ZIP folder containing .txt file pages]

Loading

Application Demo

demo2.mp4

About

PDF text data extraction web application with streamlit

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%