I'm trying to perform a skew on an image, like one shown here <img src="https://i.stack.imgur.com/FPYpx.png" alt=""> (source: microsoft.com) . I have an array of pixels representing my image and am unsure of what to do with them.

A much better way to do this is by inverse mapping. Essentially, you want to "warp" the image, right? Which means every pixel in the source image goes to a predefined point - the predefinition is a transformation matrix which tells you how to rotate, scale, translate, shear, etc. the image which is essentially taking some coordinate <code>(x,y)</code> on your image and saying that, "Ok, the new position for this pixel is <code>(f(x),g(y))</code>. That's essentially what "warping" does. Now, think about scaling an image ... say, to ten times the size. So that means, the pixel at <code>(1,1)</code> becomes the pixel at <code>(10,10)</code> - and then the next pixel, <code>(1,2)</code> becomes the pixel <code>(10,20)</code> in the new image. But if you keep doing this, you will have no values for a pixel, <code>(13,13)</code> because, <code>(1.3,1.3)</code> is not defined in your original image and you will have a bunch of holes in your new image - you'll have to interpolate for that value using the four pixels around it in the new image, i.e. <code>(10,10) , (10,20), (20,10), (200,2)</code> - this is called bilinear interpolation. But here's another problem, suppose your transformation wasn't simple scaling and was affine (like the sample image you've posted)- then <code>(1,1)</code> would become something like <code>(2.34,4.21)</code> and then you'd have to round them in the output image to <code>(2,4)</code> and then you'd have to do bilinear interpolation on the new image to fill in the holes or more complicated interpolation - messy right? Now, there's no way to get out of interpolation, but we can get away with doing bilinear interpolation, just once. How? Simple, inverse mapping. Instead of looking at it as the source image going to the new image, think of where the data for the new image will come from in the source image! So, <code>(1,1)</code> in the new image will come from some reverse mapping in the source image, say, <code>(3.4, 2.1)</code> and then do bilinear interpolation on the source image to figure out the corresponding value! <h3>Transformation matrix</h3> Ok, so how do you define a transformation matrix for an affine transformation? This website tells you how to do it by compositing different transformation matrices for rotation, shearing, etc. <h3>Transformations:</h3> <img src="https://people.gnome.org/%7Emathieu/libart/art-affines.png" alt="alt text"> <h3>Compositing:</h3> <img src="https://people.gnome.org/%7Emathieu/libart/art-affine-matrix.png" alt="alt text"> The final matrix can be achieved by compositing each matrix in the order and you invert it to get the the inverse mapping - use this compute the positions of the pixels in the source image and interpolate.

Skewing an image using Perspective Transforms

2 Answers

A much better way to do this is by inverse mapping.

Essentially, you want to "warp" the image, right? Which means every pixel in the source image goes to a predefined point - the predefinition is a transformation matrix which tells you how to rotate, scale, translate, shear, etc. the image which is essentially taking some coordinate (x,y) on your image and saying that, "Ok, the new position for this pixel is (f(x),g(y)).

That's essentially what "warping" does.

Now, think about scaling an image ... say, to ten times the size. So that means, the pixel at (1,1) becomes the pixel at (10,10) - and then the next pixel, (1,2) becomes the pixel (10,20) in the new image. But if you keep doing this, you will have no values for a pixel, (13,13) because, (1.3,1.3) is not defined in your original image and you will have a bunch of holes in your new image - you'll have to interpolate for that value using the four pixels around it in the new image, i.e. (10,10) , (10,20), (20,10), (200,2) - this is called bilinear interpolation.

But here's another problem, suppose your transformation wasn't simple scaling and was affine (like the sample image you've posted)- then (1,1) would become something like (2.34,4.21) and then you'd have to round them in the output image to (2,4) and then you'd have to do bilinear interpolation on the new image to fill in the holes or more complicated interpolation - messy right?

Now, there's no way to get out of interpolation, but we can get away with doing bilinear interpolation, just once. How? Simple, inverse mapping.

Instead of looking at it as the source image going to the new image, think of where the data for the new image will come from in the source image! So, (1,1) in the new image will come from some reverse mapping in the source image, say, (3.4, 2.1) and then do bilinear interpolation on the source image to figure out the corresponding value!

Transformation matrix

Ok, so how do you define a transformation matrix for an affine transformation? This website tells you how to do it by compositing different transformation matrices for rotation, shearing, etc.

Transformations:

alt text

Compositing:

alt text

The final matrix can be achieved by compositing each matrix in the order and you invert it to get the the inverse mapping - use this compute the positions of the pixels in the source image and interpolate.

136

answered Sep 28 '22 04:09

Jacob

If you don't feel like re-inventing the wheel, check out the OpenCV library. It implements many useful image processing functions including perspective transformations. Check out the cvWarpPerspective which I've used to accomplish this task quite easily.

answered Sep 28 '22 04:09

jeff7

Related questions
                            
                                Full width image with fixed height
                            
                                Java - Image encoding in XML
                            
                                How to make image hover in css?
                            
                                IE6 - can't load a normal JPG
                            
                                Detect if specified url is an image in Android?
                            
                                How to Convert System.Drawing.Image to Byte Array?
                            
                                Easy way to display a continuously updating image in C/Linux
                            
                                CSS transparent background image using "data:"
                            
                                Flutter How to move file
                            
                                How to detect if the jpg jpeg image file is corrupted(incomplete)?
                            
                                How do I set a background image for a grouped table view?
                            
                                Magento images not showing on front end
                            
                                Android and Facebook: How to get picture of logged in User
                            
                                Resize bitmap image
                            
                                How to create image with rounded corners in C#?
                            
                                asp.net display image from byte array
                            
                                Using image sprites on android
                            
                                How to parse data-uri in python?
                            
                                Programmatically change drawableLeft of Button
                            
                                PHP Uploading files - image only checking

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Skewing an image using Perspective Transforms

Tags:

image

image-processing

computer-vision

transform

perspective

user293895

People also ask