Object detection + segmentation

Tags:

I 'm trying to find an efficient way of acceptable complexity to

detect an object in an image so I can isolate it from its surroundings
segment that object to its sub-parts and label them so I can then fetch them at will

It's been 3 weeks since I entered the image processing world and I've read about so many algorithms (sift, snakes, more snakes, fourier-related, etc.), and heuristics that I don't know where to start and which one is "best" for what I'm trying to achieve. Having in mind that the image dataset in interest is a pretty large one, I don't even know if I should use some algorithm implemented in OpenCV or if I should implement one my own.

Summarize:

Which methodology should I focus on? Why?
Should I use OpenCV for that kind of stuff or is there some other 'better' alternative?

Thank you in advance.

EDIT -- More info regarding the datasets

Each dataset consists of 80K images of products sharing the same

concept e.g. t-shirts, watches, shoes
size
orientation (90% of them)
background (95% of them)

All pictures in each datasets look almost identical apart from the product itself, apparently. To make things a little more clear, let's consider only the 'watch dataset':

All the pictures in the set look almost exactly like this:

enter image description here

(again, apart form the watch itself). I want to extract the strap and the dial. The thing is that there are lots of different watch styles and therefore shapes. From what I've read so far, I think I need a template algorithm that allows bending and stretching so as to be able to match straps and dials of different styles.

Instead of creating three distinct templates (upper part of strap, lower part of strap, dial), it would be reasonable to create only one and segment it into 3 parts. That way, I would be confident enough that each part was detected with respect to each other as intended to e.g. the dial would not be detected below the lower part of the strap.

From all the algorithms/methodologies I've encountered, active shape|appearance model seem to be the most promising ones. Unfortunately, I haven't managed to find a descent implementation and I'm not confident enough that that's the best approach so as to go ahead and write one myself.

If anyone could point out what I should be really looking for (algorithm/heuristic/library/etc.), I would be more than grateful. If again you think my description was a bit vague, feel free to ask for a more detailed one.

861

asked Aug 28 '11 13:08

sawidis

1 Answers

From what you've said, here are a few things that pop up at first glance:

Simplest thing to do it binarize the image and do Connected Components using OpenCV or CvBlob library. For simple images with non-complex background this usually yeilds objects
HOwever, looking at your sample image, texture-based segmentation techniques may work better - the watch dial, the straps and the background are wisely variant in texture/roughness, and this could be an ideal way to separate them.

The roughness of a portion can be easily found by the Eigen transform (explained a bit on SO, check the link to the research paper provided there), then the Mean Shift filter can be applied on the output of the Eigen transform. This will give regions clearly separated according to texture. Both the pyramidal Mean Shift and finding eigenvalues by SVD are implemented in OpenCV, so unless you can optimize your own code its better (and easier) to use inbuilt functions (if present) as far as speed and efficiency is concerned.

139

answered Sep 28 '22 09:09

AruniRC

Related questions
                            
                                Resize images in PHP without using third-party libraries?
                            
                                Android Change picture every 10 seconds
                            
                                Show loading gif while image is loading
                            
                                How do you make an image blink?
                            
                                Aligning background image to right
                            
                                Horizontal center dynamic image in div with absolute position
                            
                                Get image extension
                            
                                Fallback (default) image using Angular JS ng-src
                            
                                Android background image size in pixel
                            
                                Code from scratch an image cropper AND resizer (at same time) in jQuery/javascript?
                            
                                Drawing vector images on PDF with PDFBox
                            
                                What is the Maximum image dimensions supported in desktop Chrome?
                            
                                Copy image to clipboard in Android
                            
                                Sending image over sockets (ONLY) in Python, image can not be open
                            
                                User avatar dimension standards or specifications [closed]
                            
                                How to read an animated gif with alpha channel
                            
                                How would I achieve this in opencv with an affine transform?
                            
                                in asp.net.mvc, what is the correct way to reference images inside of css
                            
                                Is it valid to set img src="about:blank"?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Object detection + segmentation

Tags:

image

image-processing

opencv

object-detection

sawidis

People also ask

1 Answers

AruniRC

Recent Activity

Donate For Us