OCR: segmentation of small text

The problem

I've been building a (very) simple OCR engine. Since I'm trying to classify very small (pixel size) characters, I'm having some difficulties on segmentation. Here's an example, after best-effort image-wide thresholding:

image of problematic segmentation on 63 :

What I've tried

Error detection:

large horizontal size of the segments. It works, mostly, but fails (false positive) for a few larger characters.
classify, and reject on low score. This seems a bit wasteful.

Error correction:

add pixels vertically (vertical histogram), find minimum. It cuts many segments on the wrong place, in many of the samples.

What I haven't tried yet

Trying to classify on all possible segmentation points (pixels). This would be very wasteful, and be difficult to expand for a 3-merged-characters segment.
I've been reading up on morphology approaches to turn the characters into mathematical curves, but I don't know really know where to start, or if it's worth the effort

Where to go from here?

I have no idea. Hence this question :)

958

asked Dec 22 '12 04:12

loopbackbee

2 Answers

Lean back and half close your eyes.

63 :-)

Now, if only it was so easy for a computer!

It's tantalisingly close to what double-patterning does (or un-does?) in silicon masks.

I would suggest oversampling (doubling or quadrupling the pixel count in each axis), filtering (probably low pass - or possibly bandpass where the passband = spatial frequency of a line), re-thresholding until they separate. Expensive, so only apply in problem areas.

177

answered Oct 19 '22 20:10

user_1818839

Reinvent your problem so you do not need segmentation.

Really, for this scale I think you better invest in other approaches. For example, if you OCR on text (do you?) you can use the information of lines (character height). There are not many fonts that can be used for small (yet readable) characters. My approach would be a algorithm that scan lines in scanlines (from left to right, take pixels from top to bottom) and try to find correlations between trained text and scanlines (n, n-1... n-x)

And you probably need the information I the grayscale levels as well, so better not to threshold the images.

answered Oct 19 '22 20:10

Rob Audenaerde

Related questions
                            
                                Kivy Opencv Android
                            
                                Fastest way to resize image
                            
                                Python OpenCV plot circles at a list of centre coordinates
                            
                                Opencv Python Crop Image Using Numpy Array
                            
                                Error with matches1to2 with Opencv SIFT
                            
                                Separate rooms in a floor plan using OpenCV
                            
                                How to optimize circle detection with Python OpenCV?
                            
                                Convert pytorch tensor to opencv mat and vice versa in C++
                            
                                Python(17874,0x111e92dc0) malloc: can't allocate region
                            
                                How to test proximity of lines (Hough transform) in OpenCV
                            
                                How to let user select a video recording device (web-cam) with OpenCV?
                            
                                CvSize does not exist?
                            
                                Save CvSeq to an array
                            
                                OpenCV + python -- grab frames from a video file
                            
                                Camera motion compensation
                            
                                OpenCV and Computer Vision, where do we stand now?
                            
                                Basic matrix multiplication in OpenCV for Android
                            
                                How can I detect registration markers on paper using OpenCV?
                            
                                Writing 16 bit uncompressed image using OpenCV
                            
                                how to convert images into video in android using javacv?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

OCR: segmentation of small text

Tags:

language-agnostic

image-processing

opencv

ocr

image-segmentation