I want to identify lego bricks for building a lego sorting machine (I use c++ with opencv). That means I have to distinguish between objects which look very similar. The bricks are coming to my camera individually on a flat conveyer. But they might lay in any possible way: upside down, on the side or "normal". My approach is to teach the sorting machine the bricks by taping them with the camera in lots of different positions and rotations. Features of each and every view are calculated by surf-algorythm. <pre class="prettyprint"><code>void calculateFeatures(const cv::Mat& image, std::vector<cv::KeyPoint>& keypoints, cv::Mat& descriptors) { // detector == cv::SurfFeatureDetector(10) detector->detect(image,keypoints); // extractor == cv::SurfDescriptorExtractor() extractor->compute(image,keypoints,descriptors); } </code></pre> If there is an unknown brick (the brick that i want to sort) its features also get calculated and matched with known ones. To find wrongly matched features I proceed as described in the book OpenCV 2 Cookbook: <ol> <li> with the matcher (=cv::BFMatcher(cv::NORM_L2)) the two nearest neighbours in both directions are searched <pre class="prettyprint"><code>matcher.knnMatch(descriptorsImage1, descriptorsImage2, matches1, 2); matcher.knnMatch(descriptorsImage2, descriptorsImage1, matches2, 2); </code></pre> </li> <li> I check the ratio between the distances of the found nearest neighbours. If the two distances are very similar it's likely that a false value is used. <pre class="prettyprint"><code>// loop for matches1 and matches2 for(iterator matchIterator over all matches) if( ((*matchIterator)[0].distance / (*matchIterator)[1].distance) > 0.65 ) throw away </code></pre> </li> <li> Finally only symmatrical match-pairs are accepted. These are matches in which not only n1 is the nearest neighbour to feature f1, but also f1 is the nearest neighbour to n1. <pre class="prettyprint"><code>for(iterator matchIterator1 over all matches) for(iterator matchIterator2 over all matches) if ((*matchIterator1)[0].queryIdx == (*matchIterator2)[0].trainIdx && (*matchIterator2)[0].queryIdx == (*matchIterator1)[0].trainIdx) // good Match </code></pre> </li> </ol> Now only pretty good matches remain. To filter out some more bad matches I check which matches fit the projection of img1 on img2 using the fundamental matrix. <pre class="prettyprint"><code>std::vector<uchar> inliers(points1.size(),0); cv::findFundamentalMat( cv::Mat(points1),cv::Mat(points2), // matching points inliers, CV_FM_RANSAC, 3, 0.99); std::vector<cv::DMatch> goodMatches // extract the surviving (inliers) matches std::vector<uchar>::const_iterator itIn= inliers.begin(); std::vector<cv::DMatch>::const_iterator itM= allMatches.begin(); // for all matches for ( ;itIn!= inliers.end(); ++itIn, ++itM) if (*itIn) // it is a valid match </code></pre> <img src="https://i.stack.imgur.com/Nvozq.jpg" alt="good matches"> The result is pretty good. But in cases of extreme alikeness faults still occur. In the picture above you can see that a similar brick is recognized well. <img src="https://i.stack.imgur.com/qymzR.jpg" alt="bad matches"> However in the second picture a wrong brick is recognized just as well. Now the question is how I could improve the matching. I had two different ideas: <img src="https://i.stack.imgur.com/iS6Ac.png" alt="All possible brick views"> <ul> <li>The matches in the second picture trace back to the features really fitting, but only if the visual field is intensely changed. To recognize a brick I have to compare it in many different positions anyway (at least as shown in figure three). This means I know that I am only allowed to minimally change the visual field. The information how intensely the visual field is changed should be hidden in the fundamental matrix. How can I read out of this matrix how far the position in the room has changed? Especially the rotation and strong scaling should be of interest; if the brick once is taped farer on the left side this shouldn't matter.</li> <li>Second idea: I calculated the fundamental matrix out of 2 pictures and filtered out features that don't fit the projections - shouldn't there be a way to do the same using three or more pictures? (keyword Trifocal tensor). This way the matching should become more stable. But I neither know how to do this using OpenCV nor could I find any information on this on google.</li> </ul>

I don't have a complete answer, but I have a few suggestions. On the image analysis side: <ul> <li>It looks like your camera setup is pretty constant. Easy to just separate the brick from the background. I also see your system finding features in the background. This is unnecessary. Set all non-brick pixels to black to remove them from the analysis.</li> <li>When you have located just the brick, your first step should be to just filter likely candidates based on the size (i.e. number of pixels) in the brick. That way the example faulty match you show is already less likely.</li> <li>You can take other features into account such as the aspect ratio of the bounding box of the brick, the major and minor axes (eigevectors of the covariance matrix of the central moments) of the brick etc.</li> </ul> These simpler features will give you a reasonable first filter to limit your search space. On the mechanical side: <ul> <li>If bricks are actually coming down a conveyor you should be able to "straighten" the bricks along a straight edge using something like a rod that lies at an angle to the direction of the conveyor across the belt so that the bricks arrive more uniformly at your camera like so.</li> <li>Similar to the previous point, you could use something like a very loose brush suspended across the belt to topple bricks standing up as they pass. </li> </ul> Again both these points will limit your search space.

distinguishing objects with opencv

Tags:

algorithm

image-processing

opencv

computer-vision

object-detection

I want to identify lego bricks for building a lego sorting machine (I use c++ with opencv). That means I have to distinguish between objects which look very similar.

The bricks are coming to my camera individually on a flat conveyer. But they might lay in any possible way: upside down, on the side or "normal".

My approach is to teach the sorting machine the bricks by taping them with the camera in lots of different positions and rotations. Features of each and every view are calculated by surf-algorythm.

void calculateFeatures(const cv::Mat& image,
        std::vector<cv::KeyPoint>& keypoints,
        cv::Mat& descriptors)
{
  // detector == cv::SurfFeatureDetector(10)
  detector->detect(image,keypoints);
  // extractor == cv::SurfDescriptorExtractor()
  extractor->compute(image,keypoints,descriptors);
}

If there is an unknown brick (the brick that i want to sort) its features also get calculated and matched with known ones. To find wrongly matched features I proceed as described in the book OpenCV 2 Cookbook:

with the matcher (=cv::BFMatcher(cv::NORM_L2)) the two nearest neighbours in both directions are searched

matcher.knnMatch(descriptorsImage1, descriptorsImage2,
  matches1,
      2);
matcher.knnMatch(descriptorsImage2, descriptorsImage1,
  matches2,
  2);

I check the ratio between the distances of the found nearest neighbours. If the two distances are very similar it's likely that a false value is used.

// loop for matches1 and matches2
for(iterator matchIterator over all matches)
  if( ((*matchIterator)[0].distance / (*matchIterator)[1].distance) > 0.65 )
    throw away

Finally only symmatrical match-pairs are accepted. These are matches in which not only n1 is the nearest neighbour to feature f1, but also f1 is the nearest neighbour to n1.

for(iterator matchIterator1 over all matches)
  for(iterator matchIterator2 over all matches)
    if ((*matchIterator1)[0].queryIdx == (*matchIterator2)[0].trainIdx  &&
    (*matchIterator2)[0].queryIdx == (*matchIterator1)[0].trainIdx)
      // good Match

Now only pretty good matches remain. To filter out some more bad matches I check which matches fit the projection of img1 on img2 using the fundamental matrix.

std::vector<uchar> inliers(points1.size(),0);
cv::findFundamentalMat(
    cv::Mat(points1),cv::Mat(points2), // matching points
    inliers,
    CV_FM_RANSAC,
    3,
    0.99);

std::vector<cv::DMatch> goodMatches
// extract the surviving (inliers) matches
std::vector<uchar>::const_iterator itIn= inliers.begin();
std::vector<cv::DMatch>::const_iterator itM= allMatches.begin();
// for all matches
for ( ;itIn!= inliers.end(); ++itIn, ++itM)

  if (*itIn)
    // it is a valid match

good matches The result is pretty good. But in cases of extreme alikeness faults still occur.
In the picture above you can see that a similar brick is recognized well.

bad matches However in the second picture a wrong brick is recognized just as well.

Now the question is how I could improve the matching.

I had two different ideas:

All possible brick views

The matches in the second picture trace back to the features really fitting, but only if the visual field is intensely changed. To recognize a brick I have to compare it in many different positions anyway (at least as shown in figure three). This means I know that I am only allowed to minimally change the visual field. The information how intensely the visual field is changed should be hidden in the fundamental matrix. How can I read out of this matrix how far the position in the room has changed? Especially the rotation and strong scaling should be of interest; if the brick once is taped farer on the left side this shouldn't matter.
Second idea:
I calculated the fundamental matrix out of 2 pictures and filtered out features that don't fit the projections - shouldn't there be a way to do the same using three or more pictures? (keyword Trifocal tensor). This way the matching should become more stable. But I neither know how to do this using OpenCV nor could I find any information on this on google.

502

asked Sep 24 '13 07:09

Ulrikop

1 Answers

I don't have a complete answer, but I have a few suggestions.

On the image analysis side:

It looks like your camera setup is pretty constant. Easy to just separate the brick from the background. I also see your system finding features in the background. This is unnecessary. Set all non-brick pixels to black to remove them from the analysis.
When you have located just the brick, your first step should be to just filter likely candidates based on the size (i.e. number of pixels) in the brick. That way the example faulty match you show is already less likely.
You can take other features into account such as the aspect ratio of the bounding box of the brick, the major and minor axes (eigevectors of the covariance matrix of the central moments) of the brick etc.

These simpler features will give you a reasonable first filter to limit your search space.

On the mechanical side:

If bricks are actually coming down a conveyor you should be able to "straighten" the bricks along a straight edge using something like a rod that lies at an angle to the direction of the conveyor across the belt so that the bricks arrive more uniformly at your camera like so.
Similar to the previous point, you could use something like a very loose brush suspended across the belt to topple bricks standing up as they pass.

Again both these points will limit your search space.

answered Sep 19 '22 00:09

jilles de wit

Related questions
                            
                                Bug when implement "check point inside triangle" algorithm
                            
                                Simple explanation of Frederickson's heap selection algorithm
                            
                                Card Shuffling (SPOJ / Interviewstreet)
                            
                                Java mergesort, should the "merge" step be done with queues or arrays?
                            
                                Finding the minimum cycle path in a dynamically directed graph
                            
                                Geometric pattern quality and filling
                            
                                Fast and efficient computation on arrays
                            
                                Longest common contiguous subsequence - algorithm
                            
                                Can't get clean output in my MATLAB implementation of Canny-Deriche
                            
                                Frequent Itemsets & Association Rules - Apriori Algorithm
                            
                                Enumerate graphs under edge and symmetry constraints
                            
                                LocalBridge of degree k in Graph
                            
                                Test case for Insertion Sort, MergeSort and Quick Sort
                            
                                Find available "number" in a 2d array
                            
                                Finding all pairs of sequences that differ at exactly one position
                            
                                Recognize a Matrix in a group of points
                            
                                How to fix my numberOfDigits function
                            
                                Finding seeds for a 5 byte PRNG
                            
                                Find medians in multiple sub ranges of a unordered list
                            
                                Nearest neighbor 1 dimensional data with a specified range

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With