Kalman filters for Multiple Object Tracking in videos

Tags:

From what I've understood, tracking algorithms predict where a given object will be in the next frame (after object detection is already performed). The object is then again recognized in the next frame. What isn't clear is how the tracker then knows to associate the object in the 2nd frame as the same as the one in the 1st, especially when there are multiple objects in the frame.

I've seen in a few places that a cost matrix is created using Euclidean distance between the prediction and all detections, and the problem is framed as an assignment problem (Hungarian algorithm).

Is my understanding of tracking correct? Are there other ways of establishing that an object in one frame is the same as an object in the next frame?

536

asked Sep 08 '15 12:09

agent_Jay

1 Answers

Your understanding is correct. You have described a simple cost function, which is likely to work well in many situations. However, there will be times when it fails.

Assuming you have the computational resources, you can try to make your tracker more robust, by making the cost function more complicated.

The simplest thing you can do is take into account the error covariance of the Kalman filter, rather than just using the Euclidean distance. See the distance equation in the documentation for the vision.KalmanFilter object in MATLAB. Also see the Motion-based Multiple Object Tracking example.

You can also include other information in the cost function. You could account for the fact that the size of the object should not change too much between frames, or that the object's appearance should stay the same. For example, you could compute color histograms of your detections, and define your cost function as a weighted sum of the "Kalman filter distance" and some distance between color histograms.

answered Sep 20 '22 14:09

Dima

Related questions
                            
                                Programmatic correction of camera tilt in a positioning system
                            
                                Tensorflow Count Objects in Image [closed]
                            
                                Need help understanding cross_val_score in sklearn python
                            
                                Using OpenCV Hough Tranform for line detection in 2D point cloud
                            
                                make input features map from expansion tensor in keras
                            
                                what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer?
                            
                                How to merge detected edges to a colour capture in Emgu CV
                            
                                What are the standard techniques for removing a segmentation (such as a human or bird) from a video?
                            
                                how to find 3d position of a point with intrinsic and extrinsic parameters with opencv
                            
                                How to compute the rotation and translation between 2 cameras?
                            
                                Improve face detection performances with OpenCV/EmguCV
                            
                                What's a simple and efficient method for extracting line segments from a simple 2D image?
                            
                                OpenCV Sum of squared differences speed
                            
                                Viola Jones face detection - variations in object/face size
                            
                                Image Comparison for vector images (based on edge detection)?
                            
                                What's the difference between resize and pryDown/pryUp in opencv?
                            
                                OpenCv depth estimation from Disparity map
                            
                                Surface normal on depth image
                            
                                Analytic method for calculate a mirror angle
                            
                                Camera pose estimation: How do I interpret rotation and translation matrices?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Kalman filters for Multiple Object Tracking in videos

Tags:

computer-vision

video-tracking

agent_Jay

People also ask

1 Answers

Dima

Recent Activity

Donate For Us