Python OpenCV: Detecting a general direction of movement?

Tags:

I'm still hacking together a book scanning script, and for now, all I need is to be able to automagically detect a page turn. The book fills up 90% of the screen (I'm using a cruddy webcam for the motion detection), so when I turn a page, the direction of motion is basically in that same direction.

I have modified a motion-tracking script, but derivatives are getting me nowhere:

#!/usr/bin/env python

import cv, numpy

class Target:
    def __init__(self):
        self.capture = cv.CaptureFromCAM(0)
        cv.NamedWindow("Target", 1)

    def run(self):
        # Capture first frame to get size
        frame = cv.QueryFrame(self.capture)
        frame_size = cv.GetSize(frame)
        grey_image = cv.CreateImage(cv.GetSize(frame), cv.IPL_DEPTH_8U, 1)
        moving_average = cv.CreateImage(cv.GetSize(frame), cv.IPL_DEPTH_32F, 3)
        difference = None
        movement = []

        while True:
            # Capture frame from webcam
            color_image = cv.QueryFrame(self.capture)

            # Smooth to get rid of false positives
            cv.Smooth(color_image, color_image, cv.CV_GAUSSIAN, 3, 0)

            if not difference:
                # Initialize
                difference = cv.CloneImage(color_image)
                temp = cv.CloneImage(color_image)
                cv.ConvertScale(color_image, moving_average, 1.0, 0.0)
            else:
                cv.RunningAvg(color_image, moving_average, 0.020, None)

            # Convert the scale of the moving average.
            cv.ConvertScale(moving_average, temp, 1.0, 0.0)

            # Minus the current frame from the moving average.
            cv.AbsDiff(color_image, temp, difference)

            # Convert the image to grayscale.
            cv.CvtColor(difference, grey_image, cv.CV_RGB2GRAY)

            # Convert the image to black and white.
            cv.Threshold(grey_image, grey_image, 70, 255, cv.CV_THRESH_BINARY)

            # Dilate and erode to get object blobs
            cv.Dilate(grey_image, grey_image, None, 18)
            cv.Erode(grey_image, grey_image, None, 10)

            # Calculate movements
            storage = cv.CreateMemStorage(0)
            contour = cv.FindContours(grey_image, storage, cv.CV_RETR_CCOMP, cv.CV_CHAIN_APPROX_SIMPLE)
            points = []

            while contour:
                # Draw rectangles
                bound_rect = cv.BoundingRect(list(contour))
                contour = contour.h_next()

                pt1 = (bound_rect[0], bound_rect[1])
                pt2 = (bound_rect[0] + bound_rect[2], bound_rect[1] + bound_rect[3])
                points.append(pt1)
                points.append(pt2)
                cv.Rectangle(color_image, pt1, pt2, cv.CV_RGB(255,0,0), 1)

            num_points = len(points)

            if num_points:
                x = 0
                for point in points:
                    x += point[0]
                x /= num_points

                movement.append(x)

            if len(movement) > 0 and numpy.average(numpy.diff(movement[-30:-1])) > 0:
              print 'Left'
            else:
              print 'Right'

            # Display frame to user
            cv.ShowImage("Target", color_image)

            # Listen for ESC or ENTER key
            c = cv.WaitKey(7) % 0x100
            if c == 27 or c == 10:
                break

if __name__=="__main__":
    t = Target()
    t.run()

It detects the average motion of the average center of all of the boxes, which is extremely inefficient. How would I go about detecting such motions quickly and accurately (i.e. within a threshold)?

I'm using Python, and I plan to stick with it, as my whole framework is based on Python.

And help is appreciated, so thank you all in advance. Cheers.

605

asked Dec 21 '10 03:12

Blender

2 Answers

I haven't used OpenCV in Python before, just a bit in C++ with openframeworks.

For this I presume OpticalFlow's velx,vely properties would work.

For more on how Optical Flow works check out this paper.

HTH

answered Oct 04 '22 19:10

George Profenza

why don't you use cv.GoodFeaturesToTrack ? it may solve the script runtime ... and shorten the code ...

answered Oct 04 '22 18:10

Conex

Related questions
                            
                                Hermetic / Non Hermetic Packages in Python
                            
                                Python 3 requests how to force use a new connection for each request?
                            
                                guvectorize Not resolving types in nopython mode
                            
                                NumPy memory leak in np.ones?
                            
                                Data Synchronization framework / algorithm for server<->device?
                            
                                Can I view the doc string of a function in Python using VIM?
                            
                                Django serialization of inherited model
                            
                                Separate Admin/User authentication system in Django
                            
                                multiprocessing Pool hangs when there is a exception in any of the thread
                            
                                Is there an IntelliJ Python plugin for the community edition?
                            
                                Detect user logout / shutdown in Python / GTK under Linux - SIGTERM/HUP not received
                            
                                Pushing data once a URL is requested
                            
                                Opinions about Dabo
                            
                                The anatomy of a Python web project: development, packaging, deployment
                            
                                Optimising memory usage in numpy
                            
                                Python cProfile: how to filter out specific calls from the profiling data?
                            
                                Interactive debugging with nosetests in PyDev
                            
                                How to solve Python memory leak when using urrlib2?
                            
                                Using SlopeOne algorithm to predict if a gamer can complete a level in a Game?
                            
                                Python subprocess using import subprocess

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python OpenCV: Detecting a general direction of movement?

Tags:

python

opencv

motion-detection

Blender

People also ask

2 Answers

George Profenza

Conex

Recent Activity

Donate For Us