I have a three dimensional numpy array of images (CIFAR-10 dataset). The image array shape is like below: <pre class="prettyprint"><code>a = np.random.rand(32, 32, 3) </code></pre> Before I do any deep learning, I want to normalize the data to get better result. With a 1D array, I know we can do min max normalization like this: <pre class="prettyprint"><code>v = np.random.rand(6) (v - v.min())/(v.max() - v.min()) Out[68]: array([ 0.89502294, 0. , 1. , 0.65069468, 0.63657915, 0.08932196]) </code></pre> However, when it comes to a 3D array, I am totally lost. Specifically, I have the following questions: <ol> <li>Along which axis do we take the min and max?</li> <li>How do we implement this with the 3D array?</li> </ol> I appreciate your help! <hr> EDIT: It turns out I need to work with a 4D Numpy array with shape <code>(202, 32, 32, 3)</code>, so the first dimension would be the index for the image, and the last 3 dimensions are the actual image. It'll be great if someone can provide me with the code to normalize such a 4D array. Thanks! <hr> EDIT 2: Thanks to @Eric's code below, I've figured it out: <pre class="prettyprint"><code>x_min = x.min(axis=(1, 2), keepdims=True) x_max = x.max(axis=(1, 2), keepdims=True) x = (x - x_min)/(x_max-x_min) </code></pre>

Assuming you're working with image data of shape <code>(W, H, 3)</code>, you should probably normalize over each channel (<code>axis=2</code>) separately, as mentioned in the other answer. You can do this with: <pre class="prettyprint"><code># keepdims makes the result shape (1, 1, 3) instead of (3,). This doesn't matter here, but # would matter if you wanted to normalize over a different axis. v_min = v.min(axis=(0, 1), keepdims=True) v_max = v.max(axis=(0, 1), keepdims=True) (v - v_min)/(v_max - v_min) </code></pre>

How to normalize a 4D numpy array?

Tags:

python

arrays

numpy

deep-learning

I have a three dimensional numpy array of images (CIFAR-10 dataset). The image array shape is like below:

a = np.random.rand(32, 32, 3)

Before I do any deep learning, I want to normalize the data to get better result. With a 1D array, I know we can do min max normalization like this:

v = np.random.rand(6)
(v - v.min())/(v.max() - v.min())

Out[68]:
array([ 0.89502294,  0.        ,  1.        ,  0.65069468,  0.63657915,
        0.08932196])

However, when it comes to a 3D array, I am totally lost. Specifically, I have the following questions:

Along which axis do we take the min and max?
How do we implement this with the 3D array?

I appreciate your help!

EDIT: It turns out I need to work with a 4D Numpy array with shape (202, 32, 32, 3), so the first dimension would be the index for the image, and the last 3 dimensions are the actual image. It'll be great if someone can provide me with the code to normalize such a 4D array. Thanks!

EDIT 2: Thanks to @Eric's code below, I've figured it out:

x_min = x.min(axis=(1, 2), keepdims=True)
x_max = x.max(axis=(1, 2), keepdims=True)

x = (x - x_min)/(x_max-x_min)

562

asked Feb 25 '17 18:02

George Liu

1 Answers

Assuming you're working with image data of shape (W, H, 3), you should probably normalize over each channel (axis=2) separately, as mentioned in the other answer.

You can do this with:

# keepdims makes the result shape (1, 1, 3) instead of (3,). This doesn't matter here, but
# would matter if you wanted to normalize over a different axis.
v_min = v.min(axis=(0, 1), keepdims=True)
v_max = v.max(axis=(0, 1), keepdims=True)
(v - v_min)/(v_max - v_min)

101

answered Sep 18 '22 08:09

Eric

Related questions
                            
                                cannot use current_user in jinja2 macro?
                            
                                How to concatenate videos in moviepy?
                            
                                sklearn dumping model using joblib, dumps multiple files. Which one is the correct model?
                            
                                Writing a parallel loop
                            
                                Checking whether function has been called multiple times with different parameters
                            
                                Share Python code when handling multiple exceptions
                            
                                Subtract every column in dataframe with the mean of that column with Python
                            
                                Change size/alpha of markers in the legend box of matplotlib
                            
                                To_CSV unique values of a pandas column [duplicate]
                            
                                Best way to handle a keyerror in a dict
                            
                                Python - NameError: name itemgetter not defined
                            
                                How to decide the size of layers in Keras' Dense method?
                            
                                BeautifulSoup extract top-level tags only [duplicate]
                            
                                hackerrank new year chaos code optimization
                            
                                What does sys.exit really do with multiple threads?
                            
                                time complexity of random access in deque in Python [duplicate]
                            
                                Using pip on Windows installed with both python 2.7 and 3.5
                            
                                Python kernel dies for second run of PyQt5 GUI
                            
                                Use Numpy to convert rgb pixel array into grayscale [duplicate]
                            
                                Single instance of class in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With