Generating good training data for haar cascades

Tags:

haar-classifier

I am trying to build haar cascades for doing OCR of a specific font; one classifier per character.

I can generate tons of training data just by drawing the font onto images. So, the plan is to generate positive training data for each character, and use the examples of other characters as negative training data.

I am wondering how much variation I should put into the training data. Normally I'd just try everything, but I gather these things take days to train (for each character!) so some advice would be good.

So, a few questions:

Does the training algorithm recognise that I don't care about transparent pixels? Or will it perform better if I superimpose the characters over different backgrounds?
Should I include images where each character is shown with different prefixes and suffixes, or should I just treat each character individually?
Should I include images where the character is scaled up and down? I gather the algorithm pretty much ignores size, and scales everything down for efficiency anyway?

Thanks!

632

asked Mar 12 '15 23:03

Dave

1 Answers

Does the training algorithm recognise that I don't care about transparent pixels? Or will it perform better if I superimpose the characters over different backgrounds?

The more "noise" you give your images on the parts of the training data then the more robust it will be, but yes the longer it will take to train. This is however where your negative sampels will come into action. If you have as many negative training samples as possible with as many ranges as possible then you will create more robust detectors. THat being said, if you have a particular use case in mind then I would suggest skewing your training sets slightly to match that, it will be less robust but much better in your application.

Should I include images where each character is shown with different prefixes and suffixes, or should I just treat each character individually?

If you want to detect individual letters, then train individually. If you train it to detect "ABC" and you only want "A" then it is going to start getting mixed messages. Simply train each letter "A", "B" etc and then your detector should be able to pick out each individual letter in larger images.

Should I include images where the character is scaled up and down? I gather the algorithm pretty much ignores size, and scales everything down for efficiency anyway?

I don't believe this is correct. AFAIK the HAAR algorithm cannot scale down a trained image. So if you train all your images on 50x50 letters but the letters in your images are 25x25 then you won't detect them. If you train and detect the other way round however you will get results. Start small, let the algorithm change the size (up) for you.

177

answered Nov 17 '22 23:11

GPPK

Related questions
                            
                                How can I use the gluon-cv model_zoo and output to an OpenCV window with Python?
                            
                                OpenCV VideoWriter Not Writing to Output.avi
                            
                                How to fix "TypeError: Expected Ptr<cv::UMat> for argument '%s'"
                            
                                Tracking of rotating objects using opencv
                            
                                Computing HOG features
                            
                                Determining template type when accessing OpenCV Mat elements
                            
                                tbb.dll not found
                            
                                Convert android preview frame to OpenCV Mat
                            
                                drawing sine wave using opencv
                            
                                Using OpenCV Random Forests, is there any way to obtain the "confidence" level for a classification?
                            
                                OpenCV RotatedRect with specified angle
                            
                                Image Processing on CUDA or OpenCV?
                            
                                What does cvHaarDetectObjects() method do?
                            
                                Balancing contrast and brightness between stitched images
                            
                                Parts Recognition / Classification with OpenCV
                            
                                How to make the sample run Open CV
                            
                                Load .yml file into hashmaps using snakeyaml (import junit library)
                            
                                Train our own classifier
                            
                                How to extract specific area of image
                            
                                Error with homebrew + opencv + libpng

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With