In a calibrated stereo-vision rig, how does one obtain the "camera matrices" needed for implementing a 3D triangulation algorithm?

Tags:

I am trying to implement the (relatively simple) linear homogeneous (DLT) 3D triangulation method from Hartley & Zisserman's "Multiple View Geometry" (sec 12.2), with the aim of implementing their full, "optimal algorithm" in the future. Right now, based on this question, I'm trying to get it to work in Matlab, and will later port it into C++ and OpenCV, testing for conformity along the way.

The problem is that I'm unsure how to use the data I have. I have calibrated my stereo rig, and obtained the two intrinsic camera matrices, two vectors of distortion coefficients, the rotation matrix and translation vector relating the two cameras, as well as the essential and fundamental matrices. I also have the 2D coordinates of two points that are supposed to be correspondences of a single 3D point in the coordinate systems of the two images (taken by the 1st and 2nd camera respectively).

The algorithm takes as input the two point coordinates and two 4x3 "camera matrices" P and P'. These aren't obviously the intrinsic camera matrices (M, M') obtained from the calibration, because for one they are 3x3, and also because projection using them alone puts a 3D point in two distinct coordinate systems, that is - the extrinsic (rotation/translation) data is missing.

The H&Z book contains information (chapter 9) on recovering the required matrices from either the fundamental or the essential matrix using SVD decomposition, but with additional problems of its own (e.g. scale ambiguity). I feel I don't need that, since I have the rotation and translation explicitly defined.

The question then is: would it be correct to use the first intrinsic matrix, with an extra column of zeros as the first "camera matrix" (P=[M|0]), and then multiply the second intrinsic matrix by a extrinsic matrix composed from the rotation matrix and the translation vector as an extra column to obtain the second required "camera matrix" (P'=M'*[R|t])? Or should it be done differently?

Thanks!

748

asked May 08 '12 16:05

neuviemeporte

1 Answers

I don't have my H&Z to hand - but their old CVPR tutorial on the subject is here (for anyone else to have a look at w.r.t this question).

Just for clarity (and to use their terminology) the projection matrix P maps from Euclidean 3-space point (X) to an image point (x) as:

x = PX

where:

P = K[ R | t ]

defined by the (3x3) camera calibration matrix K and the (3x3) rotation matrix R and translation vector (3x1) t.

The crux of the matter seems to be how to then perform triangulation using your two cameras P and P'.

I believe you are proposing that the world origin is located at a the first camera P, thus:

P = K [ I | 0]

and

P' = K' [ R | t ]

What we then seek for reconstruction in the Fundamental Matrix F such that:

x' F x = 0

The matrix F can of course be computed any number of ways (sometimes more commonly from uncalibrated images!) but here I think you might want to do it on the basis of your already calibrated camera matrices above as:

F = [P' C]_x P' pinv(P)

Where C = (0 1) is the centre of first camera and pinv(P) is the pseudo-inverse of P. The _x indicates the notation used in the literature for matrix multiplication to calculate the vector product.

You can then perform a factorization of the fundamental matrix F (performed via SVD or direct method).

F = [t]_x M

And hence, as you correctly state, we can then compute triangulation directly based on:

P = [ I | 0 ]

and

P' = [ M | t ]

Using these to perform triangulation should then be relatively straightforward (assuming good calibration, lack of noise, etc. etc.)

110

answered Sep 17 '22 02:09

timlukins

Related questions
                            
                                How to detect colors under different illumination conditions
                            
                                How to flip only one axis of transformation matrix?
                            
                                What is the meaning of rank 4 of data In the flow method of ImageDataGenerator (Keras) which has argument x
                            
                                How to remove extra whitespace from image in opencv? [duplicate]
                            
                                computer vision: extracting info about a shape given a contour (e.g. pointy, round...)
                            
                                Computing object statistics from the second central moments
                            
                                camera translation vector - relation to rotation matrix
                            
                                Need help on CvSVM
                            
                                How to get clear image after low frequency suppression of image?
                            
                                Computer vision to calculate the digit (finger) ratio
                            
                                Feature Detection in OpenCV Python Bindings
                            
                                Adaptive parameter for Canny Edge
                            
                                Does Convolutional Neural Network possess localization abilities on images?
                            
                                Point Cloud using iPhone camera
                            
                                expand MNIST - elastic deformations MATLAB
                            
                                How to extract paches from 3D image in python?
                            
                                How Yolo 3 is implemented in Yolo 4?
                            
                                Identify text areas on a Talmud page
                            
                                Moving from Wiimote to camera?
                            
                                ISampleGrabber deprecated?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In a calibrated stereo-vision rig, how does one obtain the "camera matrices" needed for implementing a 3D triangulation algorithm?

Tags:

computer-vision

linear-algebra

stereo-3d

camera-calibration

triangulation

neuviemeporte

People also ask

1 Answers

timlukins

Recent Activity

Donate For Us