How do I programmatically find the pixel locations of specific features in an image?

Tags:

I'm building an automated electricity / gas meter reader using OpenCV and Python. I've got as far as taking shots with a webcam:

enter image description here

I can then use afine transform to unwarp the image (an adaptation of this example):

def unwarp_image(img):
    rows,cols = img.shape[:2]
    # Source points
    left_top = 12
    left_bottom = left_top+2
    top_left = 24
    top_right = 13
    bottom = 47
    right = 180
    srcTri = np.array([(left_top,top_left),(right,top_right),(left_bottom,bottom)], np.float32)

    # Corresponding Destination Points. Remember, both sets are of float32 type
    dst_height=30
    dstTri = np.array([(0,0),(cols-1,0),(0,dst_height)],np.float32)

    # Affine Transformation
    warp_mat = cv2.getAffineTransform(srcTri,dstTri)   # Generating affine transform matrix of size 2x3
    dst = cv2.warpAffine(img,warp_mat,(cols,dst_height))     # Now transform the image, notice dst_size=(cols,rows), not (rows,cols)

    #cv2.imshow("crop_img", dst)
    #cv2.waitKey(0)

    return dst

..which gives me an image something like this:

enter image description here

I still need to extract the text using some sort of OCR routine but first I'd like to automate the part that identifies what pixel locations to apply the affine transform to. So if someone knocks the webcam it doesn't stop the software working.

743

asked Jun 23 '13 09:06

Jon Cage

1 Answers

Since your image is pretty much planar, you can look into finding the homography between the image you get from the webcam and the desired image (in the upright position).

Edit: This will rotate the image in the upright position. Once you've registered your image (brought it in the upright position), you could do row-wise or column-wise projections (sum all the pixels along the columns to get one vector, sum all the pixels along the rows to get one vector). You can use these vectors to figure out where you have a jump in color, and crop it there.

Alternatively you can use the Hough transform, which gives you lines in an image. You can probably get away with not registering the image if you do this.

answered Oct 19 '22 06:10

Diana

Related questions
                            
                                CamShift + Face detection in OpenCv
                            
                                Verbose mode in Django template tags
                            
                                strategy/pattern to prevent lost update / race conditions on GAE / memcache
                            
                                Quickly dumping a database in memory to file
                            
                                Delete items from ListView in Django 1.5
                            
                                NSUserNotificationCenter.defaultUserNotificationCenter() returns None in python
                            
                                How to use Django for effective geolocation?
                            
                                Python: Define a module by several files
                            
                                How to use Python and HTML to build a desktop software?
                            
                                Python/Numpy/Scipy: Draw Poisson random values with different lambda
                            
                                Find Unique Dates in Numpy Datetime Array
                            
                                Layout of the output array img is incompatible with cv::Mat (step[ndims-1] != elemsize or step[1] != elemsize*nchannels)
                            
                                iPython didn't catch the empty line as a `\n`
                            
                                Make divider without changing size of original axis?
                            
                                Python logger not respecting setLevel?
                            
                                Which C++ libraries should I use for a large parallel computing number-crunching project exploiting third-party applications
                            
                                Python scripts on DD-WRT embedded router
                            
                                How to configure python cffi library to use mingw?
                            
                                Is there a way to access the keyring in Windows without giving a master password?
                            
                                Python reload module

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I programmatically find the pixel locations of specific features in an image?

Tags:

python

image-processing

opencv

Jon Cage

People also ask

1 Answers

Diana

Recent Activity

Donate For Us