I am working on a program in C++ which should detect faces from webcam stream, than crop them using face landmarks and swap them. I programmed face detection using OpenCV and Viola-Jones face detection. Works fine. Than I searched for how to segment just face from ROI. I tried few skin detection implementations but none was successful. Than I found dlib face landmarks. I decided to try it. Just in beginning I faced problems because I had to convert <code>cv::Mat</code> to <code>cv_image</code>, Rect to rectangle etc. So I tried to do it just with dlib. I just get stream using <code>cv::VideoCapture</code> and than I wanted to show what is captured using dlib <code>image_window</code>. But here was the problem it was reeeealy slow. Down is used code. Commented lines are lines which do that same but using OpenCV. OpenCV is much more faster, smooth, continuous than code which is not commented whis is like 5 FPS. That's horrible. I can't imagine how slow it will be when I apply face detection and face landmarks. Am I doing something wrong? How can I make it faster? Or should I use OpenCV for video capture and showing? <pre class="prettyprint"><code>cv::VideoCapture cap; image_window output_frame; if (!cap.open(0)) { cout << "ERROR: Opening video device 0 FAILED." << endl; return -1; } cv::Mat cap_frame; //HWND hwnd; do { cap >> cap_frame; if (!cap_frame.empty()) { cv_image<bgr_pixel> dlib_frame(cap_frame); output_frame.set_image(dlib_frame); //cv::imshow("output",dlib::toMat(dlib_frame)); } //if (27 == char(cv::waitKey(10))) //{ // return 0; //} //hwnd = FindWindowA(NULL, "output"); } while(!output_frame.is_closed())//while (hwnd != NULL); </code></pre> EDIT: After switching to Release mode showing capured frames becomes fine. But I go on and tried to do face detection and shape prediction with dlib just like in example here http://dlib.net/face_landmark_detection_ex.cpp.html. It was quite laggy. So I turned off shape prediction. Still "laggy. So I assumed face detection is slowing it down. So I tried face detection using OpenCV because it was significantly better than dlib detector. I needed to convert detected cv::Rect to dlib::rectangle. I used this. <pre class="prettyprint"><code>std::vector<dlib::rectangle> dlib_rois; long l, t, r, b; for (int i = cv_rois.size() - 1; i >= 0; i--) { l = cv_rois[i].x; t = cv_rois[i].y; r = cv_rois[i].x + cv_rois[i].width; b = cv_rois[i].y + cv_rois[i].height; dlib_rois.push_back(dlib::rectangle(l, t, r, b)); } </code></pre> But this combination of OpenCV face detection and dlib shape prediction become brutal laggy. It takes about 4s to process single frame. I can't figure out why. OpenCV face detection was absolutely fine, dlib shape prediction doesn't seem to be hard to process. Can somebody help me with?

You can take several actions to make Dlib run faster, before assuming that it is slow. You only have to read more documentation and try. <ul> <li>Dlib is capable of detecting faces in very small areas (80x80 pixels). You are probably sending raw WebCam frames at approximately 1280x720 resolution, which is not necessary. I recommend from my experience to reduce the frames about a quarter of the original resolution. Yes, 320x180 is fine for Dlib. In consequence you will get 4x speed.</li> <li>As mentioned in the comments, by turning on the compilation optimizations while building Dlib, you will get significantly improvement in speed.</li> <li>Dlib works faster with grayscale images. You do not need the color on the webcam frame. You can use OpenCV to convert into grayscale the previously reduced in size frame.</li> <li>Dlib takes its time finding faces but is extremely fast finding landmarks on faces. Only if your Webcam provides a high framerate (24-30fps), you could skip some frames because faces normally doesn't move so much.</li> </ul> Given that optimizations, I am confident you will get at least 12x faster detection.

Dlib webcam capture with face detection and shape prediction is slow

Tags:

c++

opencv

dlib

webcam-capture

I am working on a program in C++ which should detect faces from webcam stream, than crop them using face landmarks and swap them.

I programmed face detection using OpenCV and Viola-Jones face detection. Works fine. Than I searched for how to segment just face from ROI. I tried few skin detection implementations but none was successful.

Than I found dlib face landmarks. I decided to try it. Just in beginning I faced problems because I had to convert cv::Mat to cv_image, Rect to rectangle etc. So I tried to do it just with dlib. I just get stream using cv::VideoCapture and than I wanted to show what is captured using dlib image_window. But here was the problem it was reeeealy slow. Down is used code. Commented lines are lines which do that same but using OpenCV. OpenCV is much more faster, smooth, continuous than code which is not commented whis is like 5 FPS. That's horrible. I can't imagine how slow it will be when I apply face detection and face landmarks.

Am I doing something wrong? How can I make it faster? Or should I use OpenCV for video capture and showing?

cv::VideoCapture cap;
image_window output_frame;

if (!cap.open(0))
{
    cout << "ERROR: Opening video device 0 FAILED." << endl;
    return -1;
}

cv::Mat cap_frame;
//HWND hwnd;
do
{
    cap >> cap_frame;

    if (!cap_frame.empty())
    {
        cv_image<bgr_pixel> dlib_frame(cap_frame);
        output_frame.set_image(dlib_frame);
        //cv::imshow("output",dlib::toMat(dlib_frame));
    }

    //if (27 == char(cv::waitKey(10)))
    //{
    //  return 0;
    //}

    //hwnd = FindWindowA(NULL, "output");
} while(!output_frame.is_closed())//while (hwnd != NULL);

EDIT: After switching to Release mode showing capured frames becomes fine. But I go on and tried to do face detection and shape prediction with dlib just like in example here http://dlib.net/face_landmark_detection_ex.cpp.html. It was quite laggy. So I turned off shape prediction. Still "laggy.

So I assumed face detection is slowing it down. So I tried face detection using OpenCV because it was significantly better than dlib detector. I needed to convert detected cv::Rect to dlib::rectangle. I used this.

std::vector<dlib::rectangle> dlib_rois;
long l, t, r, b;

for (int i = cv_rois.size() - 1; i >= 0; i--)
{
    l = cv_rois[i].x;
    t = cv_rois[i].y;
    r = cv_rois[i].x + cv_rois[i].width;
    b = cv_rois[i].y + cv_rois[i].height;
    dlib_rois.push_back(dlib::rectangle(l, t, r, b));
}

But this combination of OpenCV face detection and dlib shape prediction become brutal laggy. It takes about 4s to process single frame.

I can't figure out why. OpenCV face detection was absolutely fine, dlib shape prediction doesn't seem to be hard to process. Can somebody help me with?

505

asked Mar 27 '16 10:03

Gondil

1 Answers

You can take several actions to make Dlib run faster, before assuming that it is slow. You only have to read more documentation and try.

Dlib is capable of detecting faces in very small areas (80x80 pixels). You are probably sending raw WebCam frames at approximately 1280x720 resolution, which is not necessary. I recommend from my experience to reduce the frames about a quarter of the original resolution. Yes, 320x180 is fine for Dlib. In consequence you will get 4x speed.
As mentioned in the comments, by turning on the compilation optimizations while building Dlib, you will get significantly improvement in speed.
Dlib works faster with grayscale images. You do not need the color on the webcam frame. You can use OpenCV to convert into grayscale the previously reduced in size frame.
Dlib takes its time finding faces but is extremely fast finding landmarks on faces. Only if your Webcam provides a high framerate (24-30fps), you could skip some frames because faces normally doesn't move so much.

Given that optimizations, I am confident you will get at least 12x faster detection.

112

answered Oct 02 '22 00:10

Ezequiel Adrian

Related questions
                            
                                No op delete for unique_ptr
                            
                                what defines a recursive function?
                            
                                Will web assembly (wasm) have its own syntax?
                            
                                N-body algorithm: why is this slower in parallel?
                            
                                higher precision floating point using boost lib (higher then 16 digits)
                            
                                Return value optimization: ho can I avoid copy construction of huge STL containers.
                            
                                IsWindows10OrGreater() is failing on Windows 10
                            
                                undefined reference to `vtable for MainWindow'
                            
                                Using auto in output parameter
                            
                                What is the difference between warpPerspective and perspectiveTransform?
                            
                                Why does my Arduino Class Constructor require an argument?
                            
                                Dynamically Find the Edge of a Rectangle
                            
                                How to sort two vectors simultaneously in c++ without using boost or creating templates?
                            
                                Variable name same as function name giving compiler error... Why?
                            
                                Does order of method declarations in a class matter to the compiler?
                            
                                I want to create something like a python dictionary in C++
                            
                                Arduino reading SD file line by line C++
                            
                                How to get Position, Width and Height of Mac OS X Dock? Cocoa/Carbon/C++/Qt
                            
                                What is the need for enable_shared_from_this? [duplicate]
                            
                                Do macros in C++ improve performance?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With