ocr Questions

5

I am having problems while installing tesseract to develop in C++ on Windows 10. Can anyone provide a guide to get: 1. Leptonica (required by tesseract) lib and includes 2. Tesseract lib and inclu...
Nianiabi asked 2/6, 2018 at 16:30

4

Here's my code: import pytesseract import cv2 from PIL import Image pytesseract.pytesseract.tesseract_cmd = r"C:\Program Files (x86)\Tesseract-OCR\tesseract.exe" def main(): original = cv2.imr...
Sunken asked 4/1, 2019 at 22:38

1

So I'm trying to get the orientation of a table image with Tesseract's image_to_osd(). Full code here: import cv2 from PIL import Image import pytesseract from skimage import io from skimage.transf...
Pattern asked 13/11, 2023 at 21:23

1

(See below for update with partially working code.) I have thousands of images that look like this: I need to run an OCR algorithm on the "1930 E.D." column. I find that when I crop th...
Irretrievable asked 18/7 at 21:40

2

I am training an OCR model for recognizing MRZ from passport. To train my model for more accuracy, I need to train it with maximum pictures possible. I tried to find passport's dataset on KAGGLE bu...
Mainsail asked 3/2, 2020 at 13:11

4

Solved

I have some hundreds of images (scanned documents), most of them are skewed. I wanted to de-skew them using Python. Here is the code I used: import numpy as np import cv2 from skimage.transform ...
Plumy asked 12/4, 2019 at 14:41

4

I'm trying to create a real time OCR in python using mss and pytesseract. So far, I've been able to capture my entire screen which has a steady FPS of 30. If I wanted to capture a smaller area of a...
Babe asked 23/2, 2021 at 14:8

2

Solved

Attached is a picture with curved lines, how can you find the Baseline of the text? The goal is to get lines like I drew by hand in the following picture: I tried the following code, but letters ...
Dasilva asked 4/6 at 10:46

5

Solved

How to represent: Create new image with paint (any size) Add letter A to this image Try to recognize -> tesseract will not find any letters Copy-paste this letter 5-6 times to this image Try to r...
Siphon asked 9/3, 2012 at 9:55

4

Solved

For my current ocr project I tried using tesserect using the the python cover pytesseract for converting images into text files. Up till now I was only passing well straight oriented images into my...
Cumber asked 12/3, 2019 at 10:41

2

Solved

I want to highlight specific words/sentences in a website screenshot. Once the screenshot is taken, I extract the text using pytesseract and cv2. That works well and I can get text and data about...
Bullard asked 9/1, 2019 at 17:51

4

I am installing layout-parser and following this link. Did not face any issues with the following packages.   pip install layoutparser pip install "layoutparser[effdet]" pip install lay...
Asymmetry asked 6/2, 2023 at 6:19

9

Solved

I am using python-tesseract to extract words from an image. This is a python wrapper for tesseract which is an OCR code. I am using the following code for getting the words: import tesseract api...
Iatric asked 30/12, 2013 at 0:15

9

Solved

I tried Google Cloud Vision api (TEXT_DETECTION) on 90 degrees rotated image. It still can return recognized text correctly. (see image below) That means the engine can recognize text even the ima...
Illyria asked 22/12, 2016 at 14:36

8

Solved

I sometimes have to take some printed source code and manually type the source code into a text editor. Obviously typing it up takes a long time and always extra time to debug typing errors (oops m...
ocr
Thanhthank asked 11/12, 2009 at 14:54

2

Solved

I am trying to use Tesseract in R to scrape data from an image, however I get an error. This is the code I am using and the error: library(tesseract) eng <- tesseract("eng") text <...
Oralee asked 1/7, 2022 at 7:0

3

Solved

I'm still new to Tesseract OCR and after using it in my script noticed it had a relatively big error rate for the images I was trying to extract text from. I came across Tesseract training, which s...
Stettin asked 23/12, 2016 at 5:13

2

Solved

I have started to use Pytesser, which works great with both english and chinese, but is there a way to have both languages work at the same time? Would I have to make my own traineddata file? My co...
Enthetic asked 20/4, 2016 at 14:25

5

I recently used tesseract OCR with python and I kept getting an error when I was trying to import image_to_string from tesseract. Code causing the problem: # Perform OCR using tesseract-ocr libra...
Sandarac asked 1/2, 2013 at 5:47

1

I want to do OCR on this image. This is pre-define format. ie first five will characters, then next four will be digits and last will be character. When I execute following command $ tesseract in...
Embroider asked 30/10, 2015 at 5:18

2

I want to read a specific character sequence with Tesseract like this post : Tesseract OCR: is it possible to force a specific pattern? I have tried bazaar matching pattern in Tesseract with the pa...
Paraphernalia asked 7/8, 2015 at 9:33

3

The quick brown fox jumps over the lazy dog In such a case like this, assuming there are different font families too, can cloud VIsion API detect this. Or any other OCR API detect this cleanly. Tes...

5

I am building an OCR project and I am using a .Net wrapper for Tesseract. The samples that the wrapper have don't show how to deal with a PDF as input. Using a PDF as input how do I produce a searc...
Elisabeth asked 15/4, 2015 at 17:48

2

Solved

I have a request from a customer who wishes to provide meals to elderly people in different localities. To do this the people fill out a form for the week and tick boxes depending on their choices ...
ocr
Rankle asked 15/5, 2013 at 8:28

22

I have a problem with running tesseract-ocr engine on linux. I've downloaded RUS language data and put it to tessdata directory (/usr/local/share/tessdata). When I'm trying to run tesseract with co...
Rozalin asked 10/2, 2013 at 17:53

© 2022 - 2024 — McMap. All rights reserved.