PyPDF2 to extract vertical text from scanned pdf
Asked Answered
H

1

6

I am trying to extract text from the scanned pdf using PyPDF2. Some of the pdf contains text aligned vertically. But the orientation of the page is Portrait. Is there any way to identify if the text is vertically aligned and read vertical lines in PDF using pdfminer or PyPDF2

Hube answered 27/9, 2018 at 5:53 Comment(0)
D
2

There is no way to do this with PyPDF2 at the moment (I'm the maintainer of PyPDF2).

See also: https://github.com/py-pdf/PyPDF2/issues/1071

Deoxyribonuclease answered 10/7, 2022 at 11:24 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.