I have been implementing an Android OCR tool using tesseract to ocr digits only. So far, it is giving quite high accuracy with normal digit fonts. However, the accuracy is terrible when it comes to 7 segment digits (those found on LCDs) .
I have tried cropping my image, whitelist with 0 to 9 and also some image processing to no avail. Any ideas out there on how to increase the accuracy ? Or perhaps some tips on training the specific 7 segment digits for tesseract will definitely help me a lot.
Thanks in advance.
You can find traineddata for 7 segments at:
https://github.com/arturaugusto/display_ocr/tree/master/letsgodigital
There is also a sample python code at the same repository.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With