PDF to Text with Python Introduction This program will: Split your PDF into pages, Extract the text from each pages, and Save them in .txt file. Required PDFtk (Why using this?) PyPDF2 Run $ python main.py <your-pdf-file> Why Using PDFtk? Because PyPDF2's extract function doesn't works on some files.