Junk results when using Tesseract OCR and tess-two
Asked Answered
A

1

0

I have developed OCR Application using Tesseract OCR Library and referred from the following Links.

  1. android-ocr
  2. tesseract

But I am getting junk data as results sometimes. Can anyone help me what to do further to get accurate results.

Alodi answered 31/8, 2016 at 7:43 Comment(1)
You should provide enough information to reproduce your issue. An example image, what is expected, what actually happens. Best regards.Becalmed
H
2

You should provide your test images if you want to get specific help for your case as well as any code you are using but a general rule of thumb for getting accurate results are :

  • Use a high resolution image (if needed) 300 DPI is minimum

  • Make sure there is no shadows or bends in the image

  • If there is any skew, you will need to fix the image in code prior to ocr

  • Use a dictionary to help get good results

  • Adjust the text size (12 pt font is ideal)

  • Binarize the image and use image processing algorithms to remove noise

On top of all this, there are a lot of image processing functions out there that can help increase accuracy depending on your image such as deskew, perspective correction, line removal, border removal, dot removal, despeckle, and many more depending on your image.

Hawkbill answered 3/9, 2016 at 19:22 Comment(4)
Hi @hcham1, Thank you for your valuable information. But could you please also tell me a good tutorial for such kind of image processing?Alodi
I updated my answer with a link to a tutorial on various image processing commands that can help with OCRHawkbill
@Hawkbill You updated your answer with a link to a tutorial. Can you show where is the link please? ThxFibrinolysis
For more information on various image processing functions that can help increase the accuracy of OCR, please check out these links: leadtools.com/help/leadtools/v19/dh/to/… leadtools.com/support/forum/posts/…Hawkbill

© 2022 - 2024 — McMap. All rights reserved.