Remove mean from numpy matrix

Tags:

I have a numpy matrix A where the data is organised column-vector-vise i.e A[:,0] is the first data vector, A[:,1] is the second and so on. I wanted to know whether there was a more elegant way to zero out the mean from this data. I am currently doing it via a for loop:

Click to copy

mean=A.mean(axis=1)
for k in range(A.shape[1]):
    A[:,k]=A[:,k]-mean

So does numpy provide a function to do this? Or can it be done more efficiently another way?

855

asked Dec 07 '11 21:12

pratikm

4 Answers

As is typical, you can do this a number of ways. Each of the approaches below works by adding a dimension to the mean vector, making it a 4 x 1 array, and then NumPy's broadcasting takes care of the rest. Each approach creates a view of mean, rather than a deep copy. The first approach (i.e., using newaxis) is likely preferred by most, but the other methods are included for the record.

In addition to the approaches below, see also ovgolovin's answer, which uses a NumPy matrix to avoid the need to reshape mean altogether.

For the methods below, we start with the following code and example array A.

Click to copy

import numpy as np

A = np.array([[1,2,3], [4,5,6], [7,8,9], [10, 11, 12]])
mean = A.mean(axis=1)

Using `numpy.newaxis`

Click to copy

>>> A - mean[:, np.newaxis]
array([[-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.]])

Using `None`

The documentation states that None can be used instead of newaxis. This is because

Click to copy

>>> np.newaxis is None
True

Therefore, the following accomplishes the task.

Click to copy

>>> A - mean[:, None]
array([[-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.]])

That said, newaxis is clearer and should be preferred. Also, a case can be made that newaxis is more future proof. See also: Numpy: Should I use newaxis or None?

Using `ndarray.reshape`

Click to copy

>>> A - mean.reshape((mean.shape[0]), 1)
array([[-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.]])

Changing `ndarray.shape` directly

You can alternatively change the shape of mean directly.

Click to copy

>>> mean.shape = (mean.shape[0], 1)
>>> A - mean
array([[-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.],
       [-1.,  0.,  1.]])

100

answered Oct 19 '22 18:10

David Alber

You can also use matrix instead of array. Then you won't need to reshape:

Click to copy

>>> A = np.matrix([[1,2,3], [4,5,6], [7,8,9], [10, 11, 12]])
>>> m = A.mean(axis=1)
>>> A - m
matrix([[-1.,  0.,  1.],
        [-1.,  0.,  1.],
        [-1.,  0.,  1.],
        [-1.,  0.,  1.]])

answered Oct 19 '22 18:10

ovgolovin

Yes. pylab.demean:

Click to copy

In [1]: X = scipy.rand(2,3)

In [2]: X.mean(axis=1)
Out[2]: array([ 0.42654669,  0.65216704])

In [3]: Y = pylab.demean(X, axis=1)

In [4]: Y.mean(axis=1)
Out[4]: array([  1.85037171e-17,   0.00000000e+00])

Source:

Click to copy

In [5]: pylab.demean??
Type:           function
Base Class:     <type 'function'>
String Form:    <function demean at 0x38492a8>
Namespace:      Interactive
File:           /usr/lib/pymodules/python2.7/matplotlib/mlab.py
Definition:     pylab.demean(x, axis=0)
Source:
def demean(x, axis=0):
    "Return x minus its mean along the specified axis"
    x = np.asarray(x)
    if axis == 0 or axis is None or x.ndim <= 1:
        return x - x.mean(axis)
    ind = [slice(None)] * x.ndim
    ind[axis] = np.newaxis
    return x - x.mean(axis)[ind]

answered Oct 19 '22 18:10

Steve Tjoa

Looks like some of these answers are pretty old, I just tested this on numpy 1.13.3:

Click to copy

>>> import numpy as np
>>> a = np.array([[1,1,3],[1,0,4],[1,2,2]])
>>> a
array([[1, 1, 3],
       [1, 0, 4],
       [1, 2, 2]])
>>> a = a - a.mean(axis=0)
>>> a
array([[ 0.,  0.,  0.],
       [ 0., -1.,  1.],
       [ 0.,  1., -1.]])

I think this is much cleaner and simpler. Have a try and let me know if this is somehow inferior than the other answers.

answered Oct 19 '22 18:10

nnaj20

Related questions
                            
                                Java Delay/Wait
                            
                                Wrapping calls to method on a class with a standard try/catch
                            
                                Copying raw file into SDCard?
                            
                                Persisting SCSS variables in rails asset pipeline?
                            
                                Is it possible to override the required attribute on a property in a model?
                            
                                Why was the constructor interface of std::vector changed with C++11?
                            
                                How do I change Eclipse to use spaces instead of tabs in Javascript editor?
                            
                                Render Highcharts canvas as a PNG on the page
                            
                                How to "validate" a NSTimer after invalidating it?
                            
                                Resize text size of a label when the text gets longer than the label size?
                            
                                why i cannot use the auto keyword in the last version of gcc
                            
                                Alter SQL table - allow NULL column value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Remove mean from numpy matrix

Tags:

python

numpy

pratikm

People also ask

4 Answers

Using `numpy.newaxis`

Using `None`

Using `ndarray.reshape`

Changing `ndarray.shape` directly

David Alber

ovgolovin

Steve Tjoa

nnaj20

Recent Activity

Donate For Us

Remove mean from numpy matrix

Tags:

python

numpy

pratikm

People also ask

4 Answers

Using numpy.newaxis

Using None

Using ndarray.reshape

Changing ndarray.shape directly

David Alber

ovgolovin

Steve Tjoa

nnaj20

Related questions

Recent Activity

Donate For Us

Using `numpy.newaxis`

Using `None`

Using `ndarray.reshape`

Changing `ndarray.shape` directly