Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cosmos pdf extraction content for left-columns on sidarthe interleaves text of left and right columns #84

Open
orm011 opened this issue Oct 6, 2023 · 2 comments

Comments

@orm011
Copy link

orm011 commented Oct 6, 2023

Example shown, notice the first line of text continues into the corresponding line of the right-hand column, whereas the box only includes the left column. Ive noticed this elsewhere in the extractions for the left column, but the right column extractions did not seem to have this problem.
Screenshot 2023-09-22 at 12 11 29 PM

Screenshot 2023-09-22 at 12 10 18 PM

@brandomr
Copy link
Contributor

brandomr commented Oct 6, 2023

@iross @mwestphall maybe this should be an issue on the Cosmos repo? I think this is just purely a Cosmos quality issue if I'm not mistaken?

@iross
Copy link

iross commented Oct 6, 2023

Created UW-COSMOS/Cosmos#200 to track it in the COSMOS repo. We'll take a look at it next week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants