How can I multiply a nm DataFrame with a 1m DataFrame in pandas?

Tags:

I have 2 pandas DataFrame that I want to multiply:

frame_score: 
   Score1  Score2
0     100      80
1    -150      20
2    -110      70
3     180      99
4     125      20

frame_weights: 
   Score1  Score2
0     0.6     0.4

I tried:

import pandas as pd
import numpy as np

frame_score = pd.DataFrame({'Score1'  : [100, -150, -110, 180, 125], 
                      'Score2'  : [80,  20, 70, 99, 20]})

frame_weights = pd.DataFrame({'Score1': [0.6], 'Score2' : [0.4]})

print('frame_score: \n{0}'.format(frame_score))
print('\nframe_weights: \n{0}'.format(frame_weights))

# Each of the following alternatives yields the same results
frame_score_weighted = frame_score.mul(frame_weights, axis=0)
frame_score_weighted = frame_score * frame_weights
frame_score_weighted = frame_score.multiply(frame_weights, axis=1)

print('\nframe_score_weighted: \n{0}'.format(frame_score_weighted))

returns:

frame_score_weighted:   
    Score1  Score2
0    60.0    32.0
1     NaN     NaN
2     NaN     NaN
3     NaN     NaN
4     NaN     NaN

The rows 1 to 4 are NaN. How can I avoid that? For example, row 1 should be -90 8 (-90=-150*0.6; 8=20*0.4).

For example, Numpy may broadcast to match dimensions.

278

asked Jul 31 '17 17:07

2 Answers

Edit: for arbitrary dimension, try using values to manipulate the values of the dataframes in an array-like fashion:

# element-wise multiplication
frame_score_weighted = frame_score.values*frame_weights.values

# change to pandas dataframe and rename columns
frame_score_weighted = pd.DataFrame(data=frame_score_weighted, columns=['Score1','Score2'])

#Out: 
   Score1  Score2
0    60.0    32.0
1   -90.0     8.0
2   -66.0    28.0
3   108.0    39.6
4    75.0     8.0

Just use some additional indexing to make sure you extract the desired weights as a scalar when you do the multiplication.

frame_score['Score1'] = frame_score['Score1']*frame_weights['Score1'][0]
frame_score['Score2'] = frame_score['Score2']*frame_weights['Score2'][0]

frame_score
#Out: 
   Score1  Score2
0    60.0    32.0
1   -90.0     8.0
2   -66.0    28.0
3   108.0    39.6
4    75.0     8.0

171

answered Oct 20 '22 10:10

By default, when pd.DataFrame is multiplied by a pd.Series, pandas aligns the index of the pd.Series with the columns of the pd.DataFrame. So, we get the relevant pd.Series from frame_weights by accessing just the first row.

frame_score * frame_weights.loc[0]

   Score1  Score2
0    60.0    32.0
1   -90.0     8.0
2   -66.0    28.0
3   108.0    39.6
4    75.0     8.0

You can edit frame_score in place with

frame_score *= frame_weights.loc[0]

answered Oct 20 '22 08:10

piRSquared

Related questions
                            
                                Python: How to update a value in Google BigQuery in less than 40 seconds?
                            
                                Python .loc confusion
                            
                                Maxvalue in cv2.minMaxLoc()?
                            
                                Handle 1000 concurrent requests for Flask/Gunicorn web service
                            
                                Iterating over all notes in Music21
                            
                                Fill a matrix from a matrix of indices
                            
                                Python define function inside if block or vice versa
                            
                                Python: interpolating in a triangular mesh
                            
                                Formatting an entire pandas dataframe as a string, row by row
                            
                                python pandas pivot: How to do a proper tidyr-like spread?
                            
                                How to pipe Picamera video to FFMPEG with subprocess (Python)
                            
                                Intersection of sets as columns in pandas
                            
                                Flask Unit Testing and not understanding my fix for "TypeError: a bytes-like object is required, not 'str'"
                            
                                Merge two lists of dicts of different lengths using a single key in Python
                            
                                Tkinter Scale slider with float values doesn't work with locale of language that uses comma for floats
                            
                                What are noisy samples in Scikit's DBSCAN clustering algorithm?
                            
                                pandas map column data based on value from another column using if to determine which dict to use
                            
                                Python 3.6 tkinter window icon on Linux error
                            
                                create pirate plot in seaborn (combination of box and point plot)
                            
                                Unknown column 'nan' in 'field list' python pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I multiply a nm DataFrame with a 1m DataFrame in pandas?

Tags:

python

pandas

dataframe

multiplication

Franck Dernoncourt

People also ask

2 Answers

hausdork

piRSquared

Recent Activity

Donate For Us

How can I multiply a n*m DataFrame with a 1*m DataFrame in pandas?

Tags:

python

pandas

dataframe

multiplication

Franck Dernoncourt

People also ask

2 Answers

hausdork

piRSquared

Related questions

Recent Activity

Donate For Us

How can I multiply a nm DataFrame with a 1m DataFrame in pandas?