I have seen a couple of codes using <code>numpy.apply_along_axis</code> and I always have to test the codes to see how this works 'cause I didn't understand the <code>axis</code> idea in Python yet. For example, I tested this simple codes from the reference. I can see that for the first case it was took the first column of each row of the matrix, and in the second case, the row itself was considered. So I build an example to test how this works with an array of matrices (the problem that took me to this axis question), which can also be seen as a 3d matrix, where each row is a matrix, right? <pre class="prettyprint"><code>a = [[[1,2,3],[2,3,4]],[[4,5,6],[9,8,7]]] import numpy data = numpy.array([b for b in a]) def my_func(x): return (x[0] + x[-1]) * 0.5 b = numpy.apply_along_axis(my_func, 0, data) b = numpy.apply_along_axis(my_func, 1, data) </code></pre> Which gave me: <pre class="prettyprint"><code>array([[ 2.5, 3.5, 4.5], [ 5.5, 5.5, 5.5]]) </code></pre> And: <pre class="prettyprint"><code>array([[ 1.5, 2.5, 3.5], [ 6.5, 6.5, 6.5]]) </code></pre> For the first result I got what I expected. But for the second one, I though I would receive: <pre class="prettyprint"><code>array([[ 2., 3.], [ 5., 8.]]) </code></pre> Then I though that maybe should be an <code>axis=2</code> and I got the previous result testing it. So, I'm wondering how this works to work it properly. Thank you.

First, <code>data=numpy.array(a)</code> is already enough, no need to use <code>numpy.array([b for b in a])</code>. <code>data</code> is now a 3D <code>ndarray</code> with the shape <code>(2,2,3)</code>, and has 3 axes <code>0, 1, 2</code>. The first axis has a length of 2, the second axis's length is also 2 and the third axis's length is 3. Therefore both <code>numpy.apply_along_axis(my_func, 0, data)</code> and <code>numpy.apply_along_axis(my_func, 1, data)</code> will result in a 2D array of shape <code>(2,3)</code>. In both cases the shape is <code>(2,3)</code>, those of the remaining axes, 2nd and 3rd or 1st and 3rd. <code>numpy.apply_along_axis(my_func, 2, data)</code> returns the <code>(2,2)</code> shape array you showed, where <code>(2,2)</code> is the shape of the first 2 axes, as you <code>apply</code> along the 3rd axis (by giving index <code>2</code>). The way to understand it is whichever axis you apply along will be 'collapsed' into the shape of your <code>my_func</code>, which in this case returns a single value. The order and shape of the remaining axis will remain unchanged. The alternative way to think of it is: <code>apply_along_axis</code> means apply that function to the values on that axis, for each combination of the remaining axis/axes. Fetch the result, and organize them back into the shape of the remaining axis/axes. So, if <code>my_func</code> returns a <code>tuple</code> of 4 values: <pre class="prettyprint"><code>def my_func(x): return (x[0] + x[-1]) * 2,1,1,1 </code></pre> we will expect <code>numpy.apply_along_axis(my_func, 0, data).shape</code> to be <code>(4,2,3)</code>. <ul> <li>See also <code>numpy.apply_over_axes</code> for applying a function repeatedly over multiple axes</li> </ul>

Let there be an <code>array</code> of <code>shape (2,2,3)</code>. It can be seen that <code>axis 0</code>, <code>axis 1</code>, <code>axis 2</code> has 2 ,2, 3 data values respectively. These are the indexes of the elements of the array <pre class="prettyprint"><code>[ [ [(0,0,0) (0,0,1), (0,0,2)], [(0,1,0) (0,1,1), (0,1,2)] ], [ [(1,0,0) (1,0,1), (1,0,2)], [(1,1,0) (1,1,1), (1,1,2)] ] ] </code></pre> Now if you apply some operation along some axis, then vary the indexes along this axis only keeping the indices along the two other axis constant. Example: If we apply some operation F along <code>axis 0</code>, then the elements of the result would be <pre class="prettyprint"><code>[ [F((0,0,0),(1,0,0)), F((0,0,1),(1,0,1)), F((0,0,2),(1,0,2))], [F((0,1,0),(1,1,0)), F((0,1,1),(1,1,1)), F((0,1,2),(1,1,2))] ] </code></pre> Along <code>axis 1</code>: <pre class="prettyprint"><code>[ [F((0,0,0),(0,1,0)), F((0,0,1),(0,1,1)), F((0,0,2),(0,1,2))], [F((0,1,0),(1,1,0)), F((0,1,1),(1,1,1)), F((0,1,2),(1,1,2))] ] </code></pre> Along <code>axis 2</code>: <pre class="prettyprint"><code>[ [F((0,0,0),(0,0,1),(0,0,2)), F((0,1,0),(0,1,1),(0,1,2))], [F((1,0,0),(1,0,1),(1,0,2)), F((1,1,0),(1,1,1),(1,1,2))] ] </code></pre> Also the shape of the resulting array can be inferred by omitting the given axis in the shape of given data.

Understanding axis in Python

Tags:

python

matrix

numpy

axis

I have seen a couple of codes using numpy.apply_along_axis and I always have to test the codes to see how this works 'cause I didn't understand the axis idea in Python yet.

For example, I tested this simple codes from the reference.

I can see that for the first case it was took the first column of each row of the matrix, and in the second case, the row itself was considered.

So I build an example to test how this works with an array of matrices (the problem that took me to this axis question), which can also be seen as a 3d matrix, where each row is a matrix, right?

Click to copy

a = [[[1,2,3],[2,3,4]],[[4,5,6],[9,8,7]]]

import numpy
data = numpy.array([b for b in a])

def my_func(x):
    return (x[0] + x[-1]) * 0.5

b = numpy.apply_along_axis(my_func, 0, data)
b = numpy.apply_along_axis(my_func, 1, data)

Which gave me:

Click to copy

array([[ 2.5,  3.5,  4.5],
       [ 5.5,  5.5,  5.5]])

And:

Click to copy

array([[ 1.5,  2.5,  3.5],
       [ 6.5,  6.5,  6.5]])

For the first result I got what I expected. But for the second one, I though I would receive:

Click to copy

array([[ 2.,  3.],
       [ 5.,  8.]])

Then I though that maybe should be an axis=2 and I got the previous result testing it. So, I'm wondering how this works to work it properly.

Thank you.

713

asked Apr 28 '14 18:04

pceccon

2 Answers

First, data=numpy.array(a) is already enough, no need to use numpy.array([b for b in a]).

data is now a 3D ndarray with the shape (2,2,3), and has 3 axes 0, 1, 2. The first axis has a length of 2, the second axis's length is also 2 and the third axis's length is 3.

Therefore both numpy.apply_along_axis(my_func, 0, data) and numpy.apply_along_axis(my_func, 1, data) will result in a 2D array of shape (2,3). In both cases the shape is (2,3), those of the remaining axes, 2nd and 3rd or 1st and 3rd.

numpy.apply_along_axis(my_func, 2, data) returns the (2,2) shape array you showed, where (2,2) is the shape of the first 2 axes, as you apply along the 3rd axis (by giving index 2).

The way to understand it is whichever axis you apply along will be 'collapsed' into the shape of your my_func, which in this case returns a single value. The order and shape of the remaining axis will remain unchanged.

The alternative way to think of it is: apply_along_axis means apply that function to the values on that axis, for each combination of the remaining axis/axes. Fetch the result, and organize them back into the shape of the remaining axis/axes. So, if my_func returns a tuple of 4 values:

Click to copy

def my_func(x):
    return (x[0] + x[-1]) * 2,1,1,1

we will expect numpy.apply_along_axis(my_func, 0, data).shape to be (4,2,3).

See also numpy.apply_over_axes for applying a function repeatedly over multiple axes

134

answered Sep 17 '22 23:09

CT Zhu

Let there be an array of shape (2,2,3). It can be seen that axis 0, axis 1, axis 2 has 2 ,2, 3 data values respectively.

These are the indexes of the elements of the array

Click to copy

[
    [
        [(0,0,0) (0,0,1), (0,0,2)],
        [(0,1,0) (0,1,1), (0,1,2)]
    ],
    [
        [(1,0,0) (1,0,1), (1,0,2)],
        [(1,1,0) (1,1,1), (1,1,2)]
    ]
]

Now if you apply some operation along some axis, then vary the indexes along this axis only keeping the indices along the two other axis constant.

Example: If we apply some operation F along axis 0, then the elements of the result would be

Click to copy

[
    [F((0,0,0),(1,0,0)), F((0,0,1),(1,0,1)), F((0,0,2),(1,0,2))],
    [F((0,1,0),(1,1,0)), F((0,1,1),(1,1,1)), F((0,1,2),(1,1,2))]
]

Along axis 1:

Click to copy

[
    [F((0,0,0),(0,1,0)), F((0,0,1),(0,1,1)), F((0,0,2),(0,1,2))],
    [F((0,1,0),(1,1,0)), F((0,1,1),(1,1,1)), F((0,1,2),(1,1,2))]
]

Along axis 2:

Click to copy

[
    [F((0,0,0),(0,0,1),(0,0,2)), F((0,1,0),(0,1,1),(0,1,2))],
    [F((1,0,0),(1,0,1),(1,0,2)), F((1,1,0),(1,1,1),(1,1,2))]
]

Also the shape of the resulting array can be inferred by omitting the given axis in the shape of given data.

answered Sep 17 '22 23:09

Gaurav Gupta

Related questions
                            
                                KeyError: 0 using multiprocessing in python
                            
                                Force selenium to use the portable firefox application
                            
                                Controller classes in Flask
                            
                                Check for binary content with Python requests library
                            
                                Can I use one route for multiple functions?
                            
                                How to get pip to point to newer version of Python
                            
                                Getting task by name from taskqueue
                            
                                Saving many arrays of different lengths
                            
                                Python mixin to extend class property
                            
                                Go subprocess communication
                            
                                Porting pyMC2 Bayesian A/B testing example to pyMC3
                            
                                Listing attributes of namedtuple subclass
                            
                                Tkinter canvas resizing automatically
                            
                                Why is PyQt executing my actions three times?
                            
                                Comparing pandas.Series for equality when they are in different orders
                            
                                Animating pngs in matplotlib using ArtistAnimation
                            
                                Python 3.4 asyncio task doesn't get fully executed
                            
                                Wrapping a LAPACKE function using Cython
                            
                                How to get a list of most popular pages from Google Analytics in Python (Django)?
                            
                                With PyQt, what is the preferred (efficient) method for monitoring window size and adjusting layouts?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding axis in Python

Tags:

python

matrix

numpy

axis

pceccon

People also ask

2 Answers

CT Zhu

Gaurav Gupta

Recent Activity

Donate For Us