How can I divide a numpy array row by the sum of all values in this row? This is one example. But I'm pretty sure there is a fancy and much more efficient way of doing this: <pre class="prettyprint"><code>import numpy as np e = np.array([[0., 1.],[2., 4.],[1., 5.]]) for row in xrange(e.shape[0]): e[row] /= np.sum(e[row]) </code></pre> Result: <pre class="prettyprint"><code>array([[ 0. , 1. ], [ 0.33333333, 0.66666667], [ 0.16666667, 0.83333333]]) </code></pre>

You can do it mathematically as <img src="https://i.stack.imgur.com/RhQZn.gif" alt="enter image description here">. Here, <code>E</code> is your original matrix and <code>D</code> is a diagonal matrix where each entry is the sum of the corresponding row in <code>E</code>. If you're lucky enough to have an invertible <code>D</code>, this is a pretty mathematically convenient way to do things. In numpy: <pre class="prettyprint"><code>import numpy as np diagonal_entries = [sum(e[row]) for row in range(e.shape[0])] D = np.diag(diagonal_entries) D_inv = np.linalg.inv(D) e = np.dot(e, D_inv) </code></pre>

numpy divide row by row sum

Tags:

python

multidimensional-array

numpy

How can I divide a numpy array row by the sum of all values in this row?

This is one example. But I'm pretty sure there is a fancy and much more efficient way of doing this:

import numpy as np e = np.array([[0., 1.],[2., 4.],[1., 5.]]) for row in xrange(e.shape[0]):     e[row] /= np.sum(e[row])

Result:

array([[ 0.        ,  1.        ],        [ 0.33333333,  0.66666667],        [ 0.16666667,  0.83333333]])

992

asked Apr 24 '13 21:04

Stefan Profanter

2 Answers

Method #1: use None (or np.newaxis) to add an extra dimension so that broadcasting will behave:

>>> e array([[ 0.,  1.],        [ 2.,  4.],        [ 1.,  5.]]) >>> e/e.sum(axis=1)[:,None] array([[ 0.        ,  1.        ],        [ 0.33333333,  0.66666667],        [ 0.16666667,  0.83333333]])

Method #2: go transpose-happy:

>>> (e.T/e.sum(axis=1)).T array([[ 0.        ,  1.        ],        [ 0.33333333,  0.66666667],        [ 0.16666667,  0.83333333]])

(You can drop the axis= part for conciseness, if you want.)

Method #3: (promoted from Jaime's comment)

Use the keepdims argument on sum to preserve the dimension:

>>> e/e.sum(axis=1, keepdims=True) array([[ 0.        ,  1.        ],        [ 0.33333333,  0.66666667],        [ 0.16666667,  0.83333333]])

answered Oct 19 '22 09:10

DSM

You can do it mathematically as enter image description here .

Here, E is your original matrix and D is a diagonal matrix where each entry is the sum of the corresponding row in E. If you're lucky enough to have an invertible D, this is a pretty mathematically convenient way to do things.

In numpy:

import numpy as np  diagonal_entries = [sum(e[row]) for row in range(e.shape[0])] D = np.diag(diagonal_entries) D_inv = np.linalg.inv(D) e = np.dot(e, D_inv)

answered Oct 19 '22 08:10

Ali

Related questions
                            
                                Which JSON module can I use in Python 2.5?
                            
                                Flask - Accessing the config variable in the template
                            
                                NameError: name 'datetime' is not defined
                            
                                Running a Dash app within a Flask app
                            
                                Get HTTP Error code from requests.exceptions.HTTPError
                            
                                R and Python in one Jupyter notebook
                            
                                What's the purpose of the + (pos) unary operator in Python?
                            
                                Python Date Comparisons
                            
                                How to use C++ classes with ctypes?
                            
                                In a matplotlib plot, can I highlight specific x-value ranges?
                            
                                ImportError: No module named concurrent.futures.process
                            
                                Reading named command arguments
                            
                                Slice 2d array into smaller 2d arrays
                            
                                ImportError: cannot import name HTTPSHandler using PIP
                            
                                Upgrade package without upgrading dependencies using pip?
                            
                                Proxying to another web service with Flask
                            
                                How can I use Django OAuth Toolkit with Python Social Auth?
                            
                                How to store a dictionary on a Django Model?
                            
                                Command line execution in different folder
                            
                                Python decorator handling docstrings

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With