I have a dataframe of shape (4, 3) as following: <pre class="prettyprint"><code>In [1]: import pandas as pd In [2]: import numpy as np In [3]: x = pd.DataFrame(np.random.randn(4, 3), index=np.arange(4)) In [4]: x Out[4]: 0 1 2 0 0.959322 0.099360 1.116337 1 -0.211405 -2.563658 -0.561851 2 0.616312 -1.643927 -0.483673 3 0.235971 0.023823 1.146727 </code></pre> I want to multiply each column of the dataframe with a numpy array of shape (4,): <pre class="prettyprint"><code>In [9]: y = np.random.randn(4) In [10]: y Out[10]: array([-0.34125522, 1.21567883, -0.12909408, 0.64727577]) </code></pre> In numpy, the following broadcasting trick works: <pre class="prettyprint"><code>In [12]: x.values * y[:, None] Out[12]: array([[-0.32737369, -0.03390716, -0.38095588], [-0.25700028, -3.11658448, -0.68303043], [-0.07956223, 0.21222123, 0.06243928], [ 0.15273815, 0.01541983, 0.74224861]]) </code></pre> However, it doesn't work in the case of pandas dataframe, I get the following error: <pre class="prettyprint"><code>In [13]: x * y[:, None] --------------------------------------------------------------------------- ValueError Traceback (most recent call last) <ipython-input-13-21d033742c49> in <module>() ----> 1 x * y[:, None] ... ValueError: Shape of passed values is (1, 4), indices imply (3, 4) </code></pre> Any suggestions? Thanks!

I find an alternative way to do the multiplication between pandas dataframe and numpy array. <pre class="prettyprint"><code>In [14]: x.multiply(y, axis=0) Out[14]: 0 1 2 0 0.195346 0.443061 1.219465 1 0.194664 0.242829 0.180010 2 0.803349 0.091412 0.098843 3 0.365711 -0.388115 0.018941 </code></pre>

how to multiply pandas dataframe with numpy array with broadcasting

Tags:

python

pandas

numpy

array-broadcasting

I have a dataframe of shape (4, 3) as following:

In [1]: import pandas as pd

In [2]: import numpy as np

In [3]: x = pd.DataFrame(np.random.randn(4, 3), index=np.arange(4))

In [4]: x
Out[4]: 
          0         1         2
0  0.959322  0.099360  1.116337
1 -0.211405 -2.563658 -0.561851
2  0.616312 -1.643927 -0.483673
3  0.235971  0.023823  1.146727

I want to multiply each column of the dataframe with a numpy array of shape (4,):

In [9]: y = np.random.randn(4)

In [10]: y
Out[10]: array([-0.34125522,  1.21567883, -0.12909408,  0.64727577])

In numpy, the following broadcasting trick works:

In [12]: x.values * y[:, None]
Out[12]: 
array([[-0.32737369, -0.03390716, -0.38095588],
       [-0.25700028, -3.11658448, -0.68303043],
       [-0.07956223,  0.21222123,  0.06243928],
       [ 0.15273815,  0.01541983,  0.74224861]])

However, it doesn't work in the case of pandas dataframe, I get the following error:

In [13]: x * y[:, None]
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-13-21d033742c49> in <module>()
----> 1 x * y[:, None]
...
ValueError: Shape of passed values is (1, 4), indices imply (3, 4)

Any suggestions?

Thanks!

378

asked Aug 12 '15 16:08

Wei Li

1 Answers

I find an alternative way to do the multiplication between pandas dataframe and numpy array.

In [14]: x.multiply(y, axis=0)
Out[14]: 
          0         1         2
0  0.195346  0.443061  1.219465
1  0.194664  0.242829  0.180010
2  0.803349  0.091412  0.098843
3  0.365711 -0.388115  0.018941

125

answered Sep 25 '22 02:09

Wei Li

Related questions
                            
                                Plot multiple boxplot in one graph in pandas or matplotlib?
                            
                                AttributeError: 'Pool' object has no attribute '__exit__'
                            
                                Python QuickSort maximum recursion depth
                            
                                Printing lists in python without spaces
                            
                                Python: How to find two equal/closest values between two separate arrays?
                            
                                Sympy Simplification with Square Root
                            
                                How to convert a dictionary into a flat list?
                            
                                selenium move_to_element does not always mouse-hover
                            
                                Python: Munging data with '.join' (TypeError: sequence item 0: expected string, tuple found)
                            
                                How do I inspect one specific object in IPython
                            
                                Visualize Optical Flow with color model
                            
                                Convert Bitstring (String of 1 and 0s) to numpy array
                            
                                Django: extending user model vs creating user profile model
                            
                                '400 Bad Request' when post json in Flask
                            
                                Python pandas summary table plot
                            
                                How to set bandwidth on Mininet custom topology?
                            
                                Serialize Objects with One-to-One Relationship Django
                            
                                Beautifulsoup split text in tag by <br/>
                            
                                Linear programming with scipy.optimize.linprog
                            
                                dtype changes when using DataFrame.to_dict

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to multiply pandas dataframe with numpy array with broadcasting

Tags:

python

pandas

numpy

array-broadcasting

Wei Li

People also ask

1 Answers

Wei Li

Recent Activity

Donate For Us