PIL's <code>Image.transform</code> has a perspective-mode which requires an 8-tuple of data but I can't figure out how to convert let's say a right tilt of 30 degrees to that tuple. Can anyone explain it?

To apply a perspective transformation you first have to know four points in a plane A that will be mapped to four points in a plane B. With those points, you can derive the homographic transform. By doing this, you obtain your 8 coefficients and the transformation can take place. The site http://xenia.media.mit.edu/~cwren/interpolator/ (mirror: WebArchive), as well as many other texts, describes how those coefficients can be determined. To make things easy, here is a direct implementation according from the mentioned link: <pre class="prettyprint"><code>import numpy def find_coeffs(pa, pb): matrix = [] for p1, p2 in zip(pa, pb): matrix.append([p1[0], p1[1], 1, 0, 0, 0, -p2[0]*p1[0], -p2[0]*p1[1]]) matrix.append([0, 0, 0, p1[0], p1[1], 1, -p2[1]*p1[0], -p2[1]*p1[1]]) A = numpy.matrix(matrix, dtype=numpy.float) B = numpy.array(pb).reshape(8) res = numpy.dot(numpy.linalg.inv(A.T * A) * A.T, B) return numpy.array(res).reshape(8) </code></pre> where <code>pb</code> is the four vertices in the current plane, and <code>pa</code> contains four vertices in the resulting plane. So, suppose we transform an image as in: <pre class="prettyprint"><code>import sys from PIL import Image img = Image.open(sys.argv[1]) width, height = img.size m = -0.5 xshift = abs(m) * width new_width = width + int(round(xshift)) img = img.transform((new_width, height), Image.AFFINE, (1, m, -xshift if m > 0 else 0, 0, 1, 0), Image.BICUBIC) img.save(sys.argv[2]) </code></pre> Here is a sample input and output with the code above: <img src="https://i.stack.imgur.com/dHGcB.png" alt="enter image description here"><img src="https://i.stack.imgur.com/EOwht.png" alt="enter image description here"> We can continue on the last code and perform a perspective transformation to revert the shear: <pre class="prettyprint"><code>coeffs = find_coeffs( [(0, 0), (256, 0), (256, 256), (0, 256)], [(0, 0), (256, 0), (new_width, height), (xshift, height)]) img.transform((width, height), Image.PERSPECTIVE, coeffs, Image.BICUBIC).save(sys.argv[3]) </code></pre> Resulting in: <img src="https://i.stack.imgur.com/wY6iQ.png" alt="enter image description here"> You can also have some fun with the destination points: <img src="https://i.stack.imgur.com/GicNm.png" alt="enter image description here"><img src="https://i.stack.imgur.com/tYwvt.png" alt="enter image description here">

I'm going to hijack this question just a tiny bit because it's the only thing on Google pertaining to perspective transformations in Python. Here is some slightly more general code based on the above which creates a perspective transform matrix and generates a function which will run that transform on arbitrary points: <pre class="prettyprint"><code>import numpy as np def create_perspective_transform_matrix(src, dst): """ Creates a perspective transformation matrix which transforms points in quadrilateral ``src`` to the corresponding points on quadrilateral ``dst``. Will raise a ``np.linalg.LinAlgError`` on invalid input. """ # See: # * http://xenia.media.mit.edu/~cwren/interpolator/ # * http://stackoverflow.com/a/14178717/71522 in_matrix = [] for (x, y), (X, Y) in zip(src, dst): in_matrix.extend([ [x, y, 1, 0, 0, 0, -X * x, -X * y], [0, 0, 0, x, y, 1, -Y * x, -Y * y], ]) A = np.matrix(in_matrix, dtype=np.float) B = np.array(dst).reshape(8) af = np.dot(np.linalg.inv(A.T * A) * A.T, B) return np.append(np.array(af).reshape(8), 1).reshape((3, 3)) def create_perspective_transform(src, dst, round=False, splat_args=False): """ Returns a function which will transform points in quadrilateral ``src`` to the corresponding points on quadrilateral ``dst``:: >>> transform = create_perspective_transform( ... [(0, 0), (10, 0), (10, 10), (0, 10)], ... [(50, 50), (100, 50), (100, 100), (50, 100)], ... ) >>> transform((5, 5)) (74.99999999999639, 74.999999999999957) If ``round`` is ``True`` then points will be rounded to the nearest integer and integer values will be returned. >>> transform = create_perspective_transform( ... [(0, 0), (10, 0), (10, 10), (0, 10)], ... [(50, 50), (100, 50), (100, 100), (50, 100)], ... round=True, ... ) >>> transform((5, 5)) (75, 75) If ``splat_args`` is ``True`` the function will accept two arguments instead of a tuple. >>> transform = create_perspective_transform( ... [(0, 0), (10, 0), (10, 10), (0, 10)], ... [(50, 50), (100, 50), (100, 100), (50, 100)], ... splat_args=True, ... ) >>> transform(5, 5) (74.99999999999639, 74.999999999999957) If the input values yield an invalid transformation matrix an identity function will be returned and the ``error`` attribute will be set to a description of the error:: >>> tranform = create_perspective_transform( ... np.zeros((4, 2)), ... np.zeros((4, 2)), ... ) >>> transform((5, 5)) (5.0, 5.0) >>> transform.error 'invalid input quads (...): Singular matrix """ try: transform_matrix = create_perspective_transform_matrix(src, dst) error = None except np.linalg.LinAlgError as e: transform_matrix = np.identity(3, dtype=np.float) error = "invalid input quads (%s and %s): %s" %(src, dst, e) error = error.replace("\n", "") to_eval = "def perspective_transform(%s):\n" %( splat_args and "*pt" or "pt", ) to_eval += " res = np.dot(transform_matrix, ((pt[0], ), (pt[1], ), (1, )))\n" to_eval += " res = res / res[2]\n" if round: to_eval += " return (int(round(res[0][0])), int(round(res[1][0])))\n" else: to_eval += " return (res[0][0], res[1][0])\n" locals = { "transform_matrix": transform_matrix, } locals.update(globals()) exec to_eval in locals, locals res = locals["perspective_transform"] res.matrix = transform_matrix res.error = error return res </code></pre>

How does perspective transformation work in PIL?

2 Answers

To apply a perspective transformation you first have to know four points in a plane A that will be mapped to four points in a plane B. With those points, you can derive the homographic transform. By doing this, you obtain your 8 coefficients and the transformation can take place.

The site http://xenia.media.mit.edu/~cwren/interpolator/ (mirror: WebArchive), as well as many other texts, describes how those coefficients can be determined. To make things easy, here is a direct implementation according from the mentioned link:

import numpy  def find_coeffs(pa, pb):     matrix = []     for p1, p2 in zip(pa, pb):         matrix.append([p1[0], p1[1], 1, 0, 0, 0, -p2[0]*p1[0], -p2[0]*p1[1]])         matrix.append([0, 0, 0, p1[0], p1[1], 1, -p2[1]*p1[0], -p2[1]*p1[1]])      A = numpy.matrix(matrix, dtype=numpy.float)     B = numpy.array(pb).reshape(8)      res = numpy.dot(numpy.linalg.inv(A.T * A) * A.T, B)     return numpy.array(res).reshape(8)

where pb is the four vertices in the current plane, and pa contains four vertices in the resulting plane.

So, suppose we transform an image as in:

import sys from PIL import Image  img = Image.open(sys.argv[1]) width, height = img.size m = -0.5 xshift = abs(m) * width new_width = width + int(round(xshift)) img = img.transform((new_width, height), Image.AFFINE,         (1, m, -xshift if m > 0 else 0, 0, 1, 0), Image.BICUBIC) img.save(sys.argv[2])

Here is a sample input and output with the code above:

enter image description here

We can continue on the last code and perform a perspective transformation to revert the shear:

coeffs = find_coeffs(         [(0, 0), (256, 0), (256, 256), (0, 256)],         [(0, 0), (256, 0), (new_width, height), (xshift, height)])  img.transform((width, height), Image.PERSPECTIVE, coeffs,         Image.BICUBIC).save(sys.argv[3])

Resulting in:

enter image description here

You can also have some fun with the destination points:

enter image description here

179

answered Sep 19 '22 03:09

mmgp

I'm going to hijack this question just a tiny bit because it's the only thing on Google pertaining to perspective transformations in Python. Here is some slightly more general code based on the above which creates a perspective transform matrix and generates a function which will run that transform on arbitrary points:

import numpy as np  def create_perspective_transform_matrix(src, dst):     """ Creates a perspective transformation matrix which transforms points         in quadrilateral ``src`` to the corresponding points on quadrilateral         ``dst``.          Will raise a ``np.linalg.LinAlgError`` on invalid input.         """     # See:     # * http://xenia.media.mit.edu/~cwren/interpolator/     # * http://stackoverflow.com/a/14178717/71522     in_matrix = []     for (x, y), (X, Y) in zip(src, dst):         in_matrix.extend([             [x, y, 1, 0, 0, 0, -X * x, -X * y],             [0, 0, 0, x, y, 1, -Y * x, -Y * y],         ])      A = np.matrix(in_matrix, dtype=np.float)     B = np.array(dst).reshape(8)     af = np.dot(np.linalg.inv(A.T * A) * A.T, B)     return np.append(np.array(af).reshape(8), 1).reshape((3, 3))   def create_perspective_transform(src, dst, round=False, splat_args=False):     """ Returns a function which will transform points in quadrilateral         ``src`` to the corresponding points on quadrilateral ``dst``::              >>> transform = create_perspective_transform(             ...     [(0, 0), (10, 0), (10, 10), (0, 10)],             ...     [(50, 50), (100, 50), (100, 100), (50, 100)],             ... )             >>> transform((5, 5))             (74.99999999999639, 74.999999999999957)          If ``round`` is ``True`` then points will be rounded to the nearest         integer and integer values will be returned.              >>> transform = create_perspective_transform(             ...     [(0, 0), (10, 0), (10, 10), (0, 10)],             ...     [(50, 50), (100, 50), (100, 100), (50, 100)],             ...     round=True,             ... )             >>> transform((5, 5))             (75, 75)          If ``splat_args`` is ``True`` the function will accept two arguments         instead of a tuple.              >>> transform = create_perspective_transform(             ...     [(0, 0), (10, 0), (10, 10), (0, 10)],             ...     [(50, 50), (100, 50), (100, 100), (50, 100)],             ...     splat_args=True,             ... )             >>> transform(5, 5)             (74.99999999999639, 74.999999999999957)          If the input values yield an invalid transformation matrix an identity         function will be returned and the ``error`` attribute will be set to a         description of the error::              >>> tranform = create_perspective_transform(             ...     np.zeros((4, 2)),             ...     np.zeros((4, 2)),             ... )             >>> transform((5, 5))             (5.0, 5.0)             >>> transform.error             'invalid input quads (...): Singular matrix         """     try:         transform_matrix = create_perspective_transform_matrix(src, dst)         error = None     except np.linalg.LinAlgError as e:         transform_matrix = np.identity(3, dtype=np.float)         error = "invalid input quads (%s and %s): %s" %(src, dst, e)         error = error.replace("\n", "")      to_eval = "def perspective_transform(%s):\n" %(         splat_args and "*pt" or "pt",     )     to_eval += "  res = np.dot(transform_matrix, ((pt[0], ), (pt[1], ), (1, )))\n"     to_eval += "  res = res / res[2]\n"     if round:         to_eval += "  return (int(round(res[0][0])), int(round(res[1][0])))\n"     else:         to_eval += "  return (res[0][0], res[1][0])\n"     locals = {         "transform_matrix": transform_matrix,     }     locals.update(globals())     exec to_eval in locals, locals     res = locals["perspective_transform"]     res.matrix = transform_matrix     res.error = error     return res

answered Sep 23 '22 03:09

David Wolever

Related questions
                            
                                Asynchronous context manager
                            
                                Using subprocess to run Python script on Windows
                            
                                Django edit form based on add form?
                            
                                How to import from config file in Flask?
                            
                                How to concatenate element-wise two lists in Python
                            
                                Python readlines() usage and efficient practice for reading
                            
                                Python 3 Get and parse JSON API
                            
                                Anaconda version with Python 3.5
                            
                                connect to a DB using psycopg2 without password
                            
                                Why does Python installed via Homebrew not include Tkinter
                            
                                Set specific DNS server using dns.resolver (pythondns)
                            
                                Range with step of type float [duplicate]
                            
                                range in jinja2 inside a for loop
                            
                                python + SQLAlchemy: deleting with the Session object
                            
                                Boolean Indexing with multiple conditions [duplicate]
                            
                                Filtering a Pyspark DataFrame with SQL-like IN clause
                            
                                Why does integer division round down in many scripting languages?
                            
                                Python k-means algorithm
                            
                                How to get pdf filename with Python requests?
                            
                                Extracting the first day of month of a datetime type column in pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does perspective transformation work in PIL?

Tags:

python

python-imaging-library

perspective

Hedge

People also ask

2 Answers

mmgp

David Wolever

Recent Activity

Donate For Us