Image Preprocessing for OCR - Tessaract

About

Asked 4/8, 2018 at 19:33 Answered 5/8, 2018 at 5:48

Solved python ocr image-recognition image-preprocessing python-tesseract

Obviously this image is pretty tough as it is low clarity and is not a real word. However, with this code, I'm detecting nothing close:

import pytesseract
from PIL import Image, ImageEnhance, ImageFilter
image_name = 'NedNoodleArms.jpg'
im = Image.open(image_name) 
im = im.filter(ImageFilter.MedianFilter())
enhancer = ImageEnhance.Contrast(im)
im = enhancer.enhance(2)
im = im.convert('1')
im.save(image_name)
text = pytesseract.image_to_string(Image.open(image_name))
print(text)

outputs

, Mdﬁaodﬁamms

Any ideas here? The image my contrasting function produces is:

Which looks decent? I don't have a ton of OCR experience. What preprocessing would you recommend here? I've tried resizing the image larger, which helps a little bit but not enough, along with a bunch of different filters from PIL. Nothing getting particularly close though

Lithopone answered 4/8, 2018 at 19:33 Comment(3)

Do not convert to 1 bit B/W, use grayscale ('L" IINM). – Gourd 4/8, 2018 at 22:17

Thanks Paulo! That helped a lot. Its outputting 'NedNnodleArrns', which is super reasonable – Lithopone 5/8, 2018 at 0:6

Glad to help, check my answer. – Gourd 5/8, 2018 at 5:50

You are right, tesseract works better with higher resolutions so sometimes resizing the image helps - but don't convert to 1 bit.

I got good results converting to grayscale, making it 3 times as large and making the letters a bit brighter:

>>> im = Image.open('j78TY.png')\
          .convert('L').resize([3 * _ for _ in im.size], Image.BICUBIC)\
          .point(lambda p: p > 75 and p + 100)
>>> pytesseract.image_to_string(im)
'NedNoodleArms'

Check this jupyter notebook:

Gourd answered 5/8, 2018 at 5:48 Comment(4)

Can you explain what this lambda function is doing? Is it like thresholding? – Swoosh 23/10, 2018 at 23:32

@Swoosh yeah, It is a lame hack for making the letters brighter, if a pixel value is over 75 (of 256) then add 100 to its value. – Gourd 24/10, 2018 at 14:58

You mean (of 255), right? What if pixel values is already over 200? – Swoosh 24/10, 2018 at 16:29

Awesome answer for an awesomely effective simple trick! Thanks! – Bertram 15/7, 2019 at 13:39

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags