Merge StructTreeRoot object when merging tagged pdf files #2508
PeterSlezak
started this conversation in
Ideas
Replies: 1 comment
-
We have given your request some consideration. The fact that So I am afraid we have to turn down your request for the time being. To make sure it is not getting out of sight, let me transfer this to "Discussions" under category "Ideas". |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Is your feature request related to a problem? Please describe.
When merging multiple pdf files the StructTreeRoot object is not kept if new pdf file is created and all other pdf files are appended using the insert_pdf().
Or StructTreeRoot object is kept from the very first file if the remaining pdf files are appended to it.
Describe the solution you'd like
If StructTreeRoot object is present in all (or only some) of the input files, then join the StructTreeRoot objects from individual files to keep the structure information.
Describe alternatives you've considered
I'm not aware of any python library that offered this functionality. (iText seems to offer it though https://stackoverflow.com/questions/19839445/merging-tagged-pdf-without-ruining-the-tags) . I have perusing the section 10.6 of PDF Reference version 1.7 and it's beyond my capabilities to implement it on my own. I found the discussion on qpdf with some details with respect to pdf specification that might help you assessing the feasibility qpdf/qpdf#490 (comment).
Beta Was this translation helpful? Give feedback.
All reactions