For a class, I need to transform RGB image into YIQ. We have been told that the conversion can be made by: <img src="https://i.stack.imgur.com/R22jh.png" alt="transforming rgb to yiq"> I started to write a messy code with loops to have the matrix multiplication and then I found out a function <pre class="prettyprint"><code> skimage.color.yiq2rgb(imYIQ) </code></pre> and when I looked inside to see what they were doing I saw the following (I'm copying stuff so it will be more clear): <pre class="prettyprint"><code>yiq_from_rgb = yiq_from_rgb = np.array([[0.299, 0.587, 0.114], [0.59590059, -0.27455667, -0.32134392], [0.21153661, -0.52273617, 0.31119955]]) return np.dot(arr, yiq_from_rgb.T.copy()) </code></pre> when <code>arr</code> is just the RGB pic as a matrix I'm trying to understand why this works? why do they take the Transpose matrix? (.T) And how exactly does the dot product work when the <code>arr</code> shape is different than the yiq_from_rgb?

In your reference figure containing the matrix for the conversion, the transformation matrix is on the left of the RGB channels. So, for the first pixel in your RGB image, let's call it <code>(p1r, p1g, p1b)</code> corresponding to R, G, B channels respectively, we need to multiply with the transformation matrix and sum the results like: <pre class="prettyprint"><code>y1y = (0.299*p1r + 0.587*p1g + 0.114*p1b) y1i = (0.596*p1r - 0.275*p1g - 0.321*p1b) y1q = (0.212*p1r - 0.523*p1g + 0.311*p1b) </code></pre> where <code>(y1y,y1i,y1q)</code> is the value for the first pixel in the resulting YIQ image, after rounding/taking <code>int</code>. We do the same kind of multiplication for all the pixels in the whole RGB image and obtain the desired YIQ image. Now, since they do this whole implementation using <code>np.dot(arr, yiq_from_rgb.T)</code>, to have the weighting work out correctly the transformation matrix needs to be transposed. And <code>copy</code> is just to have a dedicated of the transposed transformation matrix for the purpose of this conversion. Also, notice that contrary to your figure, in <code>np.dot()</code> the RGB array is on the left of transformation matrix.

numpy transforming RGB image to YIQ color space

Tags:

python

multidimensional-array

image-processing

matrix

numpy

For a class, I need to transform RGB image into YIQ. We have been told that the conversion can be made by:

transforming rgb to yiq

I started to write a messy code with loops to have the matrix multiplication and then I found out a function

 skimage.color.yiq2rgb(imYIQ)

and when I looked inside to see what they were doing I saw the following (I'm copying stuff so it will be more clear):

yiq_from_rgb = yiq_from_rgb = np.array([[0.299,      0.587,        0.114],
                                 [0.59590059, -0.27455667, -0.32134392],
                                 [0.21153661, -0.52273617, 0.31119955]])
return np.dot(arr, yiq_from_rgb.T.copy())

when arr is just the RGB pic as a matrix

I'm trying to understand why this works? why do they take the Transpose matrix? (.T) And how exactly does the dot product work when the arr shape is different than the yiq_from_rgb?

576

asked Oct 28 '17 14:10

Dvir Itzko

1 Answers

In your reference figure containing the matrix for the conversion, the transformation matrix is on the left of the RGB channels. So, for the first pixel in your RGB image, let's call it (p1r, p1g, p1b) corresponding to R, G, B channels respectively, we need to multiply with the transformation matrix and sum the results like:

y1y = (0.299*p1r + 0.587*p1g + 0.114*p1b)
y1i = (0.596*p1r - 0.275*p1g - 0.321*p1b)
y1q = (0.212*p1r - 0.523*p1g + 0.311*p1b)

where (y1y,y1i,y1q) is the value for the first pixel in the resulting YIQ image, after rounding/taking int. We do the same kind of multiplication for all the pixels in the whole RGB image and obtain the desired YIQ image.

Now, since they do this whole implementation using np.dot(arr, yiq_from_rgb.T), to have the weighting work out correctly the transformation matrix needs to be transposed. And copy is just to have a dedicated of the transposed transformation matrix for the purpose of this conversion.

Also, notice that contrary to your figure, in np.dot() the RGB array is on the left of transformation matrix.

108

answered Oct 05 '22 03:10

kmario23

Related questions
                            
                                Run Makefile on pip install
                            
                                Pandas DataFrame eval with space in column names [duplicate]
                            
                                Dynamodb get_item and put_item without data types in python
                            
                                Python: Find running median with Max-Heap and Min-Heap
                            
                                legacy_init_op in TensorFlow Serving
                            
                                Multidimensional Input to Keras
                            
                                Does using print() too much cause it to fail?
                            
                                SnowballStemmer for Russian words list
                            
                                Flask-Migrate hangs on table modificiation
                            
                                How to plot date data evenly along x-axis?
                            
                                How to implement readinto() method
                            
                                pandas- changing the start and end date of resampled timeseries
                            
                                Replacement of dict type for numba as parameters of a python function
                            
                                wxPython installation on ubuntu 16.04 taking very long time
                            
                                Why does localhost:5000 not work in Flask?
                            
                                Pandas Select last 20 days of data.
                            
                                tf.GraphKeys.TRAINABLE_VARIABLES on output_graph.pb resulting in empty list
                            
                                Very large numpy array doesn't throw memory error. Where does it live? [duplicate]
                            
                                TypeError: must be str, not list in Python 3 when calling a function
                            
                                No module error in Python 3.6 with Click library

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With