I am struggling with tesseract ocr on ios. Everything works fine but it is really slow. 2 - 3 seconds recogintion time for a single line of digits.
I am reading from a Video Stream.
I am using tesseract 3.01 with a custom training file for my font.
Here is what I do:
Setting up tesseract only to find numbers (0-9)
GetLines
to find the line I want the text ofsetRectangle
to only recognize the line i wantgetUTF8Text
to get my text <- this alone takes 2-3 seconds
Are there any suggestions to speed up the process?
I switched to the SVN of tesseract 3.02
After having some problems with crosscompiling,
I am down to about 1.2 - 0.8 Seconds for getUTF8Text and 0.3 - 0.2 seconds for image preprocessing.
I did some testing
The main gain in usability was to optimize the code with setRectangle not calling getUTF8Text if I suspect the bounds to be incorrect. And some String postprocessing by checking the received string and apply some algorithm to eliminate the most common misinterpretations from tesseract.
maybe this is useful for someone else.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With