document-layout-analysis Questions
1
(See below for update with partially working code.)
I have thousands of images that look like this:
I need to run an OCR algorithm on the "1930 E.D." column. I find that when I crop th...
Irretrievable asked 18/7 at 21:40
2
Extracting table data from digital PDFs have been simple using camelot and tabula. However, the solution doesn't work with scanned images of the document pages specifically when the table doesn't h...
Bently asked 24/12, 2019 at 18:28
2
I'm trying to extract data from pdf/image invoices using computer vision.For that i used ocr based pytesseract.
this is sample invoice
you can find code for same below
import pytesseract
img = I...
Nashom asked 17/4, 2020 at 6:52
5
Solved
I need to analyze the layout structure of different documents type like: pdf, doc, docx, odt etc.
My task is:
Giving a document, group the text in blocks finding the correct boundaries of each.
I d...
Daughter asked 4/3, 2021 at 11:20
1
Solved
I'd to detect and, if necessary, correct the orientation of a scanned document image. I am already able to deskew documents, however it still might occur, that a document is upside down and it need...
Vindicate asked 17/11, 2011 at 20:7
2
Solved
I'm trying to sort through several thousand scanned files and sort them into folders based on type (ie: if one of the files is a scanned copy of formA, then it should go in the formA folder, if it'...
Languorous asked 11/7, 2011 at 20:18
1
© 2022 - 2024 — McMap. All rights reserved.