How can I segment if the characters are connected? I just tried using watershed with distance transform (http://opencv-code.com/tutorials/count-and-segment-overlapping-objects-with-watershed-and-distance-transform/) to find the number of components but it seems that it does not perform well. <ol> <li>It requires the object to be separated after a threshold in order to perform well.</li> </ol> Having said so, how can I segment the characters effectively? Need helps/ideas. <img src="https://i.stack.imgur.com/ssAmt.png" alt="slightly connected"> As attached is the example of binary image. <img src="https://i.stack.imgur.com/xNoUh.png" alt="heavily connected"> An example of heavily connected. Ans: @mmgp this is my o/p <img src="https://i.stack.imgur.com/J5sYB.png" alt="BP"><img src="https://i.stack.imgur.com/ocec1.png" alt="o/p">

I believe there are two approaches here: 1) redo the binarization step that led to these images you have right now; 2) consider different possibilities based on image size. Let us focus on the second approach given the question. In your smallest image, only two digits are connected, and that happens only when considering 8-connectivity. If you handle your image as 4-connected, then there is nothing to do because there are no two components connected that should be separated. This is shown below. The right image can be obtained simply by finding the points that are connected to another one only when considering 8-connectivity. In this case, there are only two such points, and by removing them we disconnect the two digits '1'. <img src="https://i.stack.imgur.com/kWbiw.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/jOU6z.png" alt="enter image description here"> In your other image this is no longer the case. And I don't have a simple method to apply on it that can be applied on the smaller image without making it worse. But, actually, we could consider upscaling both images to some common size, using interpolation by nearest neighbor so we don't move from the binary representation. By resizing both of your images so they width equal to 200, and keeping the aspect ratio, we can apply the following morphological method to both of them. First do a thinning: <img src="https://i.stack.imgur.com/bY4gG.png" alt="enter image description here"> Now, as can be seen, the morphological branch points are the ones connecting your digits (there is another one at the left-most digit 'six' too, which will be handled). We can extract these branch points and apply a morphological closing with a vertical line of 2*height+1 (height is from your image), so no matter where the point is, its closing will produce a full vertical line. Since your image is not so small anymore, this line doesn't need to be 1 point-wide, in fact I considered a line that is 6 points-wide. Since some of the branch points are horizontally close, this closing operation will join them in the same vertical line. If a branch point is not close to another, then performing an erosion will remove a vertical line. And, by doing this, we eliminate the branch point related to the digit six at left. After applying these steps, we obtain the following image at left. Subtracting the original image from it, we get the image at right. <img src="https://i.stack.imgur.com/gGWZF.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/ZoOvb.png" alt="enter image description here"> If we apply these same steps to the '8011' image, we end with the exactly same image as we started with. But this is still good, because applying the simple method that remove points that are only connected in 8-connectivity, we obtain the separated components as before.

Segmentation for connected characters

Tags:

opencv

image-segmentation

How can I segment if the characters are connected? I just tried using watershed with distance transform (http://opencv-code.com/tutorials/count-and-segment-overlapping-objects-with-watershed-and-distance-transform/) to find the number of components but it seems that it does not perform well.

It requires the object to be separated after a threshold in order to perform well.

Having said so, how can I segment the characters effectively? Need helps/ideas.

slightly connected As attached is the example of binary image.

An example of heavily connected.

Ans:

@mmgp this is my o/p

o/p

973

asked Jan 08 '13 09:01

Mzk

2 Answers

I believe there are two approaches here: 1) redo the binarization step that led to these images you have right now; 2) consider different possibilities based on image size. Let us focus on the second approach given the question.

In your smallest image, only two digits are connected, and that happens only when considering 8-connectivity. If you handle your image as 4-connected, then there is nothing to do because there are no two components connected that should be separated. This is shown below. The right image can be obtained simply by finding the points that are connected to another one only when considering 8-connectivity. In this case, there are only two such points, and by removing them we disconnect the two digits '1'.

enter image description here

In your other image this is no longer the case. And I don't have a simple method to apply on it that can be applied on the smaller image without making it worse. But, actually, we could consider upscaling both images to some common size, using interpolation by nearest neighbor so we don't move from the binary representation. By resizing both of your images so they width equal to 200, and keeping the aspect ratio, we can apply the following morphological method to both of them. First do a thinning:

enter image description here

Now, as can be seen, the morphological branch points are the ones connecting your digits (there is another one at the left-most digit 'six' too, which will be handled). We can extract these branch points and apply a morphological closing with a vertical line of 2*height+1 (height is from your image), so no matter where the point is, its closing will produce a full vertical line. Since your image is not so small anymore, this line doesn't need to be 1 point-wide, in fact I considered a line that is 6 points-wide. Since some of the branch points are horizontally close, this closing operation will join them in the same vertical line. If a branch point is not close to another, then performing an erosion will remove a vertical line. And, by doing this, we eliminate the branch point related to the digit six at left. After applying these steps, we obtain the following image at left. Subtracting the original image from it, we get the image at right.

enter image description here

If we apply these same steps to the '8011' image, we end with the exactly same image as we started with. But this is still good, because applying the simple method that remove points that are only connected in 8-connectivity, we obtain the separated components as before.

118

answered Sep 19 '22 14:09

mmgp

It is common to use "smearing algorithms" for this. Also known as Run Length Smoothing Algorithm (RLSA). It is a method that segments black and white images into blocks. You can find some information here or look around on the internet to find an implementation of the algorithm.

answered Sep 19 '22 14:09

diip_thomas

Related questions
                            
                                Is there a simple method to highlight the mask?
                            
                                Triangle detection using OpenCV
                            
                                OpenCV: apply Rotation matrix from Rodrigues() to a point
                            
                                Three different types of output when reading an image with three different libraries in Python
                            
                                How to crop the biggest object in image with python opencv?
                            
                                Is it possible to extract text from specific portion of image using pytesseract
                            
                                How to remove extra whitespace from image in opencv? [duplicate]
                            
                                How to get the OpenCV image from Python and use it in C++ in pybind11?
                            
                                SIFT & SURF : Set OPENCV_ENABLE_NONFREE CMake ==> Solution OpenCV 3 & OpenCV 4 [closed]
                            
                                How can I speed up array generations in python?
                            
                                What is the particular implementation of Probabilistic Hough Transform in OpenCV?
                            
                                Need help on CvSVM
                            
                                How to get clear image after low frequency suppression of image?
                            
                                Computer vision to calculate the digit (finger) ratio
                            
                                Parameters of opencv_traincascade
                            
                                Convert cv::Mat to Magick::Image
                            
                                Feature Detection in OpenCV Python Bindings
                            
                                Classifying lines with opencv
                            
                                position recognition of simple fiducial in image
                            
                                Open CV Features Extraction and Image Matching

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Segmentation for connected characters

Tags:

opencv

image-segmentation

Mzk

People also ask

2 Answers

mmgp

diip_thomas

Recent Activity

Donate For Us