I am learning Python, and have encountered <code>numpy.sum</code>. It has an optional parameter <code>axis</code>. This parameter is used to get either column-wise summation or row-wise summation. When <code>axis = 0</code> we imply to sum it over columns only. For example, <pre class="prettyprint"><code>a = np.array([[1, 2, 3], [4, 5, 6]]) np.sum(a, axis = 0) </code></pre> This snippet of code produces output: <code>array([5, 7, 9])</code>, fine. But if I do: <pre class="prettyprint"><code>a = np.array([1, 2, 3]) np.sum(a, axis = 0) </code></pre> I get result: <code>6</code>, why is that? Shouldn't I get <code>array([1, 2, 3])</code>?

If someone need this visual description: <img src="https://i.stack.imgur.com/Z29Nn.jpg" alt="numpy axis 0 and axis 1">

All that is going on is that numpy is summing across the first (0th) and only axis. Consider the following: <pre class="prettyprint"><code>In [2]: a = np.array([1, 2, 3]) In [3]: a.shape Out[3]: (3,) In [4]: len(a.shape) # number of dimensions Out[4]: 1 In [5]: a1 = a.reshape(3,1) In [6]: a2 = a.reshape(1,3) In [7]: a1 Out[7]: array([[1], [2], [3]]) In [8]: a2 Out[8]: array([[1, 2, 3]]) In [9]: a1.sum(axis=1) Out[9]: array([1, 2, 3]) In [10]: a1.sum(axis=0) Out[10]: array([6]) In [11]: a2.sum(axis=1) Out[11]: array([6]) In [12]: a2.sum(axis=0) Out[12]: array([1, 2, 3]) </code></pre> So, to be more explicit: <pre class="prettyprint"><code>In [15]: a1.shape Out[15]: (3, 1) </code></pre> <code>a1</code> is 2-dimensional, the "long" axis being the first. <pre class="prettyprint"><code>In [16]: a1[:,0] # give me everything in the first axis, and the first part of the second Out[16]: array([1, 2, 3]) </code></pre> Now, sum along the first axis: <pre class="prettyprint"><code>In [17]: a1.sum(axis=0) Out[17]: array([6]) </code></pre> Now, consider a less trivial two-dimensional case: <pre class="prettyprint"><code>In [20]: b = np.array([[1,2,3],[4,5,6]]) In [21]: b Out[21]: array([[1, 2, 3], [4, 5, 6]]) In [22]: b.shape Out[22]: (2, 3) </code></pre> The first axis is the "rows". Sum along the rows: <pre class="prettyprint"><code>In [23]: b.sum(axis=0) Out[23]: array([5, 7, 9]) </code></pre> The second axis are the "columns". Sum along the columns: <pre class="prettyprint"><code>In [24]: b.sum(axis=1) Out[24]: array([ 6, 15]) </code></pre>

What does axis = 0 do in Numpy's sum function?

Tags:

python

arrays

numpy

I am learning Python, and have encountered numpy.sum. It has an optional parameter axis. This parameter is used to get either column-wise summation or row-wise summation. When axis = 0 we imply to sum it over columns only. For example,

a = np.array([[1, 2, 3], [4, 5, 6]]) np.sum(a, axis = 0)

This snippet of code produces output: array([5, 7, 9]), fine. But if I do:

a = np.array([1, 2, 3]) np.sum(a, axis = 0)

I get result: 6, why is that? Shouldn't I get array([1, 2, 3])?

394

asked Oct 23 '16 06:10

Bishwajit Purkaystha

2 Answers

If someone need this visual description:

numpy axis 0 and axis 1

answered Oct 06 '22 12:10

debaonline4u

All that is going on is that numpy is summing across the first (0th) and only axis. Consider the following:

In [2]: a = np.array([1, 2, 3])  In [3]: a.shape Out[3]: (3,)  In [4]: len(a.shape) # number of dimensions Out[4]: 1  In [5]: a1 = a.reshape(3,1)  In [6]: a2 = a.reshape(1,3)  In [7]: a1 Out[7]:  array([[1],        [2],        [3]])  In [8]: a2 Out[8]: array([[1, 2, 3]])  In [9]: a1.sum(axis=1) Out[9]: array([1, 2, 3])  In [10]: a1.sum(axis=0) Out[10]: array([6])  In [11]: a2.sum(axis=1) Out[11]: array([6])  In [12]: a2.sum(axis=0) Out[12]: array([1, 2, 3])

So, to be more explicit:

In [15]: a1.shape Out[15]: (3, 1)

a1 is 2-dimensional, the "long" axis being the first.

In [16]: a1[:,0] # give me everything in the first axis, and the first part of the second Out[16]: array([1, 2, 3])

Now, sum along the first axis:

In [17]: a1.sum(axis=0) Out[17]: array([6])

Now, consider a less trivial two-dimensional case:

In [20]: b = np.array([[1,2,3],[4,5,6]])  In [21]: b Out[21]:  array([[1, 2, 3],        [4, 5, 6]])  In [22]: b.shape Out[22]: (2, 3)

The first axis is the "rows". Sum along the rows:

In [23]: b.sum(axis=0) Out[23]: array([5, 7, 9])

The second axis are the "columns". Sum along the columns:

In [24]: b.sum(axis=1) Out[24]: array([ 6, 15])

answered Oct 06 '22 13:10

juanpa.arrivillaga

Related questions
                            
                                Get local timezone in django
                            
                                Writing json to file in s3 bucket
                            
                                How to check if file exists in Google Cloud Storage?
                            
                                Check unread count of Gmail messages with Python
                            
                                Detect tap with pyaudio from live mic
                            
                                Clamping floating numbers in Python? [duplicate]
                            
                                Python: unsigned 32 bit bitwise arithmetic
                            
                                Display the date, like "May 5th", using pythons strftime? [duplicate]
                            
                                Python to JSON Serialization fails on Decimal [duplicate]
                            
                                Python requests module sends JSON string instead of x-www-form-urlencoded param string
                            
                                How to safely get the file extension from a URL?
                            
                                How do I modify the session in the Django test framework
                            
                                Read a file line by line from S3 using boto?
                            
                                foreignkey (user) in models
                            
                                Flask-SQLAlchemy Constructor
                            
                                How to fix AttributeError: partially initialized module?
                            
                                execute *.sql file with python MySQLdb
                            
                                Django Templates First element of a List
                            
                                How to assign a variable in an IF condition, and then return it?
                            
                                Convert decimal mark when reading numbers as input

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With