Most accurate open-source OCR for handwritten numbers? [closed]
Asked Answered
S

1

6

My software needs to read a fixed-length handwritten number.

While I could use a general-purpose library like Tesseract, I am sure there is something smarter. Tesseract will probably misinterpret some of the 1 or 7 as I or l, whereas a software that expects only numbers would not.

Knowing that there are only numbers (American-English way of writing them), the algorithm could focus on 10 potential matches instead of hundreds of symbols.

Any experience OCRing handwritten number-only fields?
What open source library/software did you get the best results with?

Specter answered 1/4, 2010 at 7:5 Comment(0)
L
6

From the FAQ of Tesseract:

How do I recognize only digits?

In 2.03 and above:

Use

TessBaseAPI::SetVariable("tessedit_char_whitelist", "0123456789");

before calling an Init function or put this in a text file called tessdata/configs/digits:

tessedit_char_whitelist 0123456789

and then your command line becomes:

tesseract image.tif outputbase nobatch digits

Warning: Until the old and new config variables get merged, you must have the nobatch parameter too.

But I think since it was designed for printed—not handwritten—text, accuracy might suffer even for digits only.

Lenna answered 1/4, 2010 at 7:12 Comment(6)
Thanks for this! But indeed Tesseract don't seem to be designed for handwritten stuff so it would probably be quite mediocre at it.Specter
@nic: Maybe you could re-train it. It seems to be possible.Lenna
Seems possible indeed. But when I propose this solution to the client company, they might look at me funny... A proven solution with community (even small) would probably be more credible. I would be surprised if it does not exist already.Specter
Now that I think about it, handwritten digits should not be really difficult to recognize... they are not linked into fuzzy words like Latin letters. Much easier to recognize than handwritten text.Specter
@Nicolas, did you manage to find a proper solution for your use case (OCR adapted for handwritten numericals)?Stradivari
@MiroslavDzhokanov: Unfortunately not. By the way, this question was off-topic here, so I recreated it at softwarerecs.stackexchange.com/questions/27834/…Specter

© 2022 - 2024 — McMap. All rights reserved.