<code>calibrateCamera()</code> provides <code>rvec</code>, <code>tvec</code>, <code>distCoeff</code> and <code>cameraMatrix</code> whereas <code>solvePnP()</code> takes <code>cameraMatrix</code>, <code>distCoeff</code> as input and provides <code>rvec</code>, <code>tvec</code> as output. What is the difference between these two functions?

<h3><code>cv::calibrateCamera(...)</code></h3> The function estimates the following parameters of a monocular camera from several views of a calibration pattern. The geometry of this pattern is usually known (i.e. it can be a chessboard): <ul> <li> The linear intrinsic parameters: the focal lengths in terms of pixels (these are basically scale factors), the principal point which would be ideally in the center of the image, and sometimes a skew coefficient between the x and the y axis (but this is often zero).</li> <li> The non-linear intrinsic parameters: the previously mentioned parameters are forming the linear camera matrix, but there are also some non-linear parameters in the tranformation from the 3D camera to the 2D image plane, i.e. the lens distortion.</li> <li> The extrinsic parameters: the tranformation matrix between the 3D world and 3D camera coordinate systems.</li> </ul> The estimation of the above mentioned parameters is usually based on 2D-3D correspondences. The algorithm detects some 2D points in the image (i.e. chessboard) for what the corresponding 3D object points are specified (known 3D geometry). It performs the following steps in the simplest case (can vary on the flags of <code>cv::calibrateCamera(..., int flags, ...)</code>): <ul> <li>Computes the linear intrinsic parameters and considers the non-linear ones to zero.</li> <li>Estimates the initial camera pose (extrinsics) in function of the approximated intrinsics. This is done using <code>cv::solvePnP(...)</code>.</li> <li>Performs the Levenberg-Marquardt optimization algorithm to minimize the re-projection error between the detected 2D image points and 2D projections of the 3D object points. This is done using <code>cv::projectPoints(...)</code>.</li> </ul> <hr> <h3><code>cv::solvePnP(...)</code></h3> At this point, I also answered implicitly the role of <code>cv::solvePnP(...)</code> as this is the part of <code>cv::calibrateCamera(...)</code>. Once you have the intrinsics of a camera, you can assume that these will never change (except you change the optics or zooming). On the other hand the extrinsics can be changed, i.e. you can rotate the camera or put it to another location. You should see that the scenario of changing an object's pose to the camera is very similar in this case. And this is what the <code>cv::solvePnP(...)</code> is used for. The function estimates the object pose given: <ul> <li>A set of 3D object points in a model coordinate system (can be the 3D world as well), </li> <li>Their 2D projections on the image plane, </li> <li>The linear and non-linear intrinsic parameters.</li> </ul> The output of <code>cv::solvePnP(...)</code> is given as a rotation vector (<code>rvec</code>) together with a translation vector (<code>tvec</code>) that bring the 3D object points from the model coordinate system to the 3D camera coordinate system.

What is the difference between solvePnP and calibrateCamera in opencv?

Tags:

opencv

calibrateCamera() provides rvec, tvec, distCoeff and cameraMatrix whereas solvePnP() takes cameraMatrix, distCoeff as input and provides rvec, tvec as output. What is the difference between these two functions?

608

asked Feb 13 '15 04:02

Shashank

3 Answers

calibrateCamera (doc) estimates intrinsics coefficients (i.e. camera matrix and distortion coefficients) for a given camera. This function requires you to provide as input N sets of 2D-3D correspondences, associated to N images taken with the same camera from varying viewpoints (typically N=30, see this tutorial on this topic). The function returns the camera matrix and distortion coefficients for the considered camera. Although those are usually not used, the extrinsics parameters (i.e. position and orientation) are also estimated, hence the function returns one pair of rvec and tvec for each of the N input images.

solvePnP (doc) estimates extrinsics parameters for a given camera image. This function requires you to provide a set of 2D-3D correspondences, associated to a single image taken with a camera with known intrinsics parameters. The function returns a single pair of rvec and tvec, corresponding to the input image.

answered Oct 23 '22 23:10

BConic

`cv::calibrateCamera(...)`

The function estimates the following parameters of a monocular camera from several views of a calibration pattern. The geometry of this pattern is usually known (i.e. it can be a chessboard):

The linear intrinsic parameters: the focal lengths in terms of pixels (these are basically scale factors), the principal point which would be ideally in the center of the image, and sometimes a skew coefficient between the x and the y axis (but this is often zero).
The non-linear intrinsic parameters: the previously mentioned parameters are forming the linear camera matrix, but there are also some non-linear parameters in the tranformation from the 3D camera to the 2D image plane, i.e. the lens distortion.
The extrinsic parameters: the tranformation matrix between the 3D world and 3D camera coordinate systems.

The estimation of the above mentioned parameters is usually based on 2D-3D correspondences. The algorithm detects some 2D points in the image (i.e. chessboard) for what the corresponding 3D object points are specified (known 3D geometry). It performs the following steps in the simplest case (can vary on the flags of cv::calibrateCamera(..., int flags, ...)):

Computes the linear intrinsic parameters and considers the non-linear ones to zero.
Estimates the initial camera pose (extrinsics) in function of the approximated intrinsics. This is done using cv::solvePnP(...).
Performs the Levenberg-Marquardt optimization algorithm to minimize the re-projection error between the detected 2D image points and 2D projections of the 3D object points. This is done using cv::projectPoints(...).

`cv::solvePnP(...)`

At this point, I also answered implicitly the role of cv::solvePnP(...) as this is the part of cv::calibrateCamera(...). Once you have the intrinsics of a camera, you can assume that these will never change (except you change the optics or zooming). On the other hand the extrinsics can be changed, i.e. you can rotate the camera or put it to another location. You should see that the scenario of changing an object's pose to the camera is very similar in this case. And this is what the cv::solvePnP(...) is used for.

The function estimates the object pose given:

A set of 3D object points in a model coordinate system (can be the 3D world as well),
Their 2D projections on the image plane,
The linear and non-linear intrinsic parameters.

The output of cv::solvePnP(...) is given as a rotation vector (rvec) together with a translation vector (tvec) that bring the 3D object points from the model coordinate system to the 3D camera coordinate system.

answered Oct 24 '22 01:10

Kornel

calibrateCamera() provides rvec, tvec, distCoeff, cameraMatrix ---- distCoeffs are related to distortion of the image and cameraMatrix provides the center of image(Cx and Cy) and focal length (Fx and Fy) (projection center). These are called intrinsic parameters. Unless you change the aperture/focus of the camera they will remain the same. [it also provides rvec and tvec, I don't know yet now what can be any possible use of it. These are the position of the camera in the real world. rvec and tvec are also known as extrinsic parameters]

solvePnP() takes cameraMatrix, distCoeff as input and provides rvec, tvec --- Using the Cx, Cy, Fx, Fy it can estimate the current position of the camera i.e. the extrinsic parameters. In other words, first use calibrateCamera() to obtain the CameraMatrix and distCoeff. Use them in solvePNP() and it will tell you the rotation (rvec) and translation (tvec) of the camera as you move the camera with respect to your real world object (with some marker as you can presume).

answered Oct 23 '22 23:10

SanD

Related questions
                            
                                How to detect paragraphs in a text document image for a non-consistent text structure in Python OpenCV
                            
                                Converting YUV into BGR or RGB in OpenCV
                            
                                linking opencv libraries included as an external project via cmake [duplicate]
                            
                                How to include all dll's in exe?
                            
                                CMake RelWithDebInfo links to Debug libs
                            
                                Face Detection with OpenCV for non frontal images
                            
                                Image edge smoothing with opencv
                            
                                Pixel access in OpenCV 2.2
                            
                                OpenCV and creating GUIs
                            
                                Opencv VideoCapture set CV_CAP_PROP_POS_FRAMES not working
                            
                                Loading an image using OpenCV in Android
                            
                                OpenCV cv2.fillPoly vs. cv2.fillConvexPoly: expected data type for array of polygon vertices?
                            
                                SimpleBlobDetector not found in opencv 3.0 for python
                            
                                Error importing cv2 in python3, Anaconda
                            
                                Get the location of all text present in image using opencv
                            
                                How to Mask an image using Numpy/OpenCV?
                            
                                Difference between Mean and Gaussian Filter in Result
                            
                                Key Frame Extraction From Video
                            
                                What's the theory behind computing variance of an image?
                            
                                OpenCV/JavaCV face recognition - Very similar confidence values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With