I'm trying to crack a CAPTCHA image but I can't find a way to segment the characters.
I have this image:
I applied some filters and thresholding which resulted in this image:
Now I need to segment the image to be used in a classifier such as SVM or ANN.
The problem is that some characters are connected, and I couldn't find a way to separate them.
More image examples:
Do anyone have an approach to segment the image and get the characters?
It seems to me that your characters have a maximum stroke width. Whenever you find a horizontal row of black pixels that is wider than this width, that indicates that two characters are joined here.
So
Your approach seems to be too bottom-up to me.
The number of characters is constant and they seem to be mono-spaced
So just split the image by the width and make sure the features that you retrieve are rotational/scale invariant.
This can then be fed into an ANN. I don't see why you have to segment the characters.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With