OCR: Difference between two frames

Tags:

I am trying to find an easy solution to implement the OCR algorithm from OPenCV. I am very new to Image Processing ! I am playing a video that is decoded with specific codec using RLE algorithm.

What I would like to do is that for each decoded frame, I would like to compare it with the previous one and store the pixels that have changed between the two frames.

Most of the existing solutions gives a difference between the two frames but I would like to just keep the new pixels that have changed and store it in a table and then be able to analyze every group of pixels that have changed instead of analyzing the whole image each time.

I planned to use the "blobs detection" algoritm mais I'm stuck before being able to implement it.

Today, I'm trying this:

char *prevFrame;
char *curFrame;
QVector DiffPixel<LONG>;

//for each frame
DiffPixel.push_back(curFrame-prevFrame);

enter image description here

I really want to have the "Only changed pixel result" solution. Could anyone give me some tips or correct me if I'm going to a wrong way ?

EDIT:

New question, what if there are multiple areas of changed pixels ? Will it be possible to have one table per blocs of changed pixels or will it be only one unique table ? Take the example below:

Multiple Areas Pixels

The best thing as a result would be to have 2 mat matrices. The first matrix with the first orange square and the second matrix with the second orange square. This way, it avoids having to "scan" almost the entire frame if we store the result in one matrix only with a resolution being almost the same as the full frame.

The main goal here is to minimize the area (aka the resolution) to analyze to find text.

679

asked Dec 01 '15 16:12

Robert Jones

1 Answers

After loading your images:

img1

enter image description here

img2

enter image description here

you can apply XOR operation to get the differences. The result has the same number of channels of the input images:

XOR

enter image description here

You can then create a binary mask OR-ing all channels:

mask

enter image description here

The you can copy the values of img2 that correspond to non-zero elements in the mask to a white image:

diff

enter image description here

UPDATE

If you have multiple areas where pixel changed, like this:

enter image description here

You'll find a difference mask (after binarization all non-zero pixels are set to 255) like:

enter image description here

You can then extract connected components and draw each connected component on a new black-initialized mask:

enter image description here

Then, as before, you can copy the values of img2 that correspond to non-zero elements in each mask to a white image.

enter image description here

The complete code for reference. Note that this is the code for the updated version of the answer. You can find the original code in the revision history.

#include <opencv2\opencv.hpp>
#include <vector>
using namespace cv;
using namespace std;

int main()
{
    // Load the images
    Mat img1 = imread("path_to_img1");
    Mat img2 = imread("path_to_img2");

    imshow("Img1", img1);
    imshow("Img2", img2);

    // Apply XOR operation, results in a N = img1.channels() image
    Mat maskNch = (img1 ^ img2);

    imshow("XOR", maskNch);

    // Create a binary mask

    // Split each channel
    vector<Mat1b> masks;
    split(maskNch, masks);

    // Create a black mask
    Mat1b mask(maskNch.rows, maskNch.cols, uchar(0));

    // OR with each channel of the N channels mask
    for (int i = 0; i < masks.size(); ++i)
    {
        mask |= masks[i];
    }

    // Binarize mask
    mask = mask > 0;

    imshow("Mask", mask);

    // Find connected components
    vector<vector<Point>> contours;
    findContours(mask.clone(), contours, RETR_LIST, CHAIN_APPROX_SIMPLE);

    for (int i = 0; i < contours.size(); ++i)
    {
        // Create a black mask
        Mat1b mask_i(mask.rows, mask.cols, uchar(0));
        // Draw the i-th connected component
        drawContours(mask_i, contours, i, Scalar(255), CV_FILLED);

        // Create a black image
        Mat diff_i(img2.rows, img2.cols, img2.type());
        diff_i.setTo(255);

        // Copy into diff only different pixels
        img2.copyTo(diff_i, mask_i);

        imshow("Mask " + to_string(i), mask_i);
        imshow("Diff " + to_string(i), diff_i);
    }

    waitKey();
    return 0;
}

177

answered Oct 05 '22 23:10

Miki

Related questions
                            
                                Closing the listening socket after a fork()
                            
                                using boost::karma to format latitude/longitude strings
                            
                                calling virtual method without pointing to an object?
                            
                                Strange error trying to do a shared_ptr swap()
                            
                                Give nullptr a type for template deduction
                            
                                Lambda capture reference by copy and decltype
                            
                                Global formatting options for floating point numbers
                            
                                Scope of `using namespace` within another namespace [duplicate]
                            
                                Why must unused virtual functions be defined?
                            
                                Creating a new C++ Project in Eclipse CDT with the same settings as another project
                            
                                What image formats other than "Y800" does zbar::Image::Image() accept?
                            
                                Doubling buffering in CUDA so the CPU can operate on data produced by a persistent kernel
                            
                                What's the difference between auto a = A(3) and A a(3)?
                            
                                Does this code provide memory leaks?
                            
                                decltype error C2440 cannot convert from 'int *' to 'int *&'
                            
                                Why is there no definition for std::regex_traits<char32_t> (and thus no std::basic_regex<char32_t>) provided?
                            
                                Best practices coding c++ [closed]
                            
                                Double comparison - numeric limits
                            
                                Change string by index
                            
                                Qt and Android - Get path from image in gallery

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

OCR: Difference between two frames

Tags:

c++

image-processing

opencv

blob

Robert Jones

People also ask

1 Answers

Miki

Recent Activity

Donate For Us