Hello I am trying to ocr on an image.
this is the original image after some pre processing (skipping preprocessing part since its not really related to my question but will share if somebody needs it)
I've got this image
when I try to ocr this image with using tesseract
I'm getting a result as
HN'
2809
however when I manually crop half part of the image on photoshop
I recieve
HN'
Z8
as a result.
I wonder whats difference between those two images because one gives 2 instead of Z but the other one gives the Z.
I know I have to smooth edges for more accurate results but motion blur, gaussian blur nor ordinary blur filter did change the results I'm getting.
Tesseract implements an algorithm that picks number 2 over letter Z based on the amount and type of digits in the neighbourhood:
I had this problem before. :(
By the way, I think you should flip the first part of the image so HN'
becomes .NH
.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With