Tesseract: Specifying regions of text

1 Answers

I found the answer, thanks to this thread.

It seems that tesseract suports the uzn format (used in the unvl tests).

From the thread:

Calling tesseract with parameter "-psm 4" and renaming the uzn file with the same name of the image seem works.

Example: If we have C:\input.tif and C:\input.uzn, we do this:

tesseract -psm 4 C:\input.tif C:\output

187

answered Nov 06 '22 04:11

sashoalm

Related questions
                            
                                Python OCR: ignore signatures in documents
                            
                                Using Nearest Neighbour Algorithm for image pattern recognition
                            
                                How do I make an OCR Program? [closed]
                            
                                iOS Tesseract: bad results
                            
                                HOCR to HTML for visualizing
                            
                                Training feedforward neural network for OCR [closed]
                            
                                Is there an OCR open source library or sdk (free) for Android & iOS? [closed]
                            
                                tesseract didn't get the little labels
                            
                                OpenCV Python - Fixing Broken Text
                            
                                Android Tess-Two OCR unmappable character 'ﬁ'
                            
                                Why does pytesseract fail to recognise digits from image with darker background?
                            
                                Optical Character Recognition Android with OpenCV
                            
                                edge detection issue on Text detection in images
                            
                                Prepare complex image for OCR
                            
                                Couldn't load lept from loader findLibrary returned null?
                            
                                Increase Accuracy of text recognition through pytesseract & PIL
                            
                                PHP Repairing Bad Text
                            
                                Extracting fields from forms with varying structures
                            
                                Microsoft Azure Cognitive Services Handwriting Detection Bounding Box Parameters

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tesseract: Specifying regions of text

Tags:

ocr

tesseract

sashoalm

People also ask

1 Answers

sashoalm

Recent Activity

Donate For Us