What happens if I give more inputs in estimateRigidTransform or getAffineTransform?

1 Answers

The functions you mentioned can be split into 3 different types:

Type 1: getAffineTransform and getPerspectiveTransform. Given 3 points on one plane and 3 matching points on another you can calculate affine transform between those planes. And given 4 points you can find perspective transform. This is all what getAffineTransform and getPerspectiveTransform can do: they require 3 and 4 pairs of points, no more no less, and calculate relevant transform. The one and the only.

Type 2: estimateRigidTransform. If you can't get your points with absolute precision (which is normally the case when you get them from an image) then you need more than 3 pairs of points to reduce the error. The more the merrier (i.e. better accuracy). There more than one way to define the error you want to reduce, as well as what approach you use to find the minimal error. estimateRigidTransform is minimizing least square error (I think the most popular error definition). It does this by solving system of equations. If you provide 3 point exactly then the result, of course, will be the same as result of getAffineTransform. You can ask why we need getAffineTransform in OpenCV if estimateRigidTransform can do its work. Alas, this is not the only redundancy in OpenCV.

Type 3: findHomography. This one is more advanced. Not only it can deal with errors in locations of points, but it can also deal with existence of the outliers. If there some wrong matches between points then using them for least square error estimation will lead to very bad accuracy. It can use RANSAC or LMeDs to test possible matches and eliminate those outliers. It works like multiple iterations of estimateRigidTransform: find least square match for different sub-sets of points. If you know that no outliers are present then you can set the 'method' argument to 0, and it will work like estimateRigidTransform - by trying to minimize least square error created from match of all points.

Edit. Thank to Micka for his comment.

I guess my memory is playing tricks on me. I remembered that estimateRigidTransform was implemented through system of equations in OpenCV, but now I checked it and saw that Micka is right. It does uses some hard coded RANSAC in it... Sorry for misguiding you.

For those who are still interested in having closed form solution instead of RANSAC, here it is:

// find affine transformation between two pointsets (use least square matching)
static bool computeAffine(const vector<Point2d> &srcPoints, const vector<Point2d> &dstPoints, Mat &transf)
{
    // sanity check
    if ((srcPoints.size() < 3) || (srcPoints.size() != dstPoints.size()))
        return false;

    // container for output
    transf.create(2, 3, CV_64F);

    // fill the matrices
    const int n = (int)srcPoints.size(), m = 3;
    Mat A(n,m,CV_64F), xc(n,1,CV_64F), yc(n,1,CV_64F);
    for(int i=0; i<n; i++)
    {
        double x = srcPoints[i].x, y = srcPoints[i].y;
        double rowI[m] = {x, y, 1};
        Mat(1,m,CV_64F,rowI).copyTo(A.row(i));
        xc.at<double>(i,0) = dstPoints[i].x;
        yc.at<double>(i,0) = dstPoints[i].y;
    }

    // solve linear equations (for x and for y)
    Mat aTa, resX, resY;
    mulTransposed(A, aTa, true);
    solve(aTa, A.t()*xc, resX, DECOMP_CHOLESKY);
    solve(aTa, A.t()*yc, resY, DECOMP_CHOLESKY);

    // store result
    memcpy(transf.ptr<double>(0), resX.data, m*sizeof(double));
    memcpy(transf.ptr<double>(1), resY.data, m*sizeof(double));

    return true;
}

142

answered Sep 27 '22 19:09

Michael Burdinov

Related questions
                            
                                Setting ROI with mouse from a rectangle on a video
                            
                                drawing a rect with opencv on a frame
                            
                                minMaxLoc datatype
                            
                                Using python opencv to load image from zip
                            
                                Python - Calculate histogram of image
                            
                                cv2.threshold() error (-210)
                            
                                Segmenting characters from Image
                            
                                Estimate 2D transformation between two sets of points using RANSAC
                            
                                How do I open an MP4 video file with python?
                            
                                When build opencv, could not find the file "cv2.so"
                            
                                Use waitKey in order pause and play video
                            
                                OpenCV 2.0 C++ API using imshow: returns unhandled exception and "bad-flag"
                            
                                cv::Mat's release method
                            
                                OpenCV findFundamentalMat very unstable and sensitive
                            
                                From JPG to b64encode to cv2.imread()
                            
                                Compiling OpenCV code on a 64-bit mac
                            
                                Is there a quick and easy way in OpenCV to compute the gradient of an image?
                            
                                OpenCV: where can I find CV_WINDOW_AUTOSIZE constants?
                            
                                Removing borders from an image in Python
                            
                                How to convert an rgb image to ycrcb colour space in opencv python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What happens if I give more inputs in estimateRigidTransform or getAffineTransform?

Tags:

opencv

nelya9227

People also ask

1 Answers

Michael Burdinov

Recent Activity

Donate For Us