Extract text from cropped PDF page #2319
Unanswered
WallysFerreira
asked this question in
Q&A
Replies: 2 comments
-
Visitor function should do most of the job. You may have difficulty to cut lines |
Beta Was this translation helpful? Give feedback.
0 replies
-
If do not care about the side effects like changing the render resolution of embedded images etc., you might consider running Ghostscript on the PDF file after cropping. At least this is what usually worked for me when working with |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've cropped a PDF page using mediabox.upper_left and mediabox.lower_right, now I want to extract the text of this cropped page but it gives me text of the entire page still. Visitor function doesn't seem to work also
I've tried converting it to an image and extracting the text with OCR but it's really slow compared to extracting the text off the PDF page, so I would really appreciate any help with this.
Beta Was this translation helpful? Give feedback.
All reactions