I have two dataframes: df1 and df2. df1 <pre class="prettyprint"><code>Index date | X1 | X2 0 01-01-2020 | H | 10 1 01-02-2020 | H | 30 2 01-03-2020 | Y | 15 3 01-04-2020 | Y | 20 </code></pre> df2 <pre class="prettyprint"><code>Index | X1 | X2 0 | H | 5 1 | Y | 10 </code></pre> I want to multiply df1 and df2 when the value on column X1 matches. Desired result: <pre class="prettyprint"><code>Index date | X1 | X2 0 01-01-2020 | H | 50 1 01-02-2020 | H | 150 2 01-03-2020 | Y | 150 3 01-04-2020 | Y | 200 </code></pre>

Use <code>Series.map</code> for match by <code>X1</code> and then multiple by <code>X2</code>: <pre class="prettyprint"><code>df1['X2'] *= df1['X1'].map(df2.set_index('X1')['X2']) print (df1) date X1 X2 0 01-01-2020 H 50 1 01-02-2020 H 150 2 01-03-2020 Y 150 3 01-04-2020 Y 200 </code></pre> Or use <code>DataFrame.merge</code> with left join: <pre class="prettyprint"><code>df1['X2'] *= df2.merge(df1, on='X1', how='left')['X2_x'] print (df1) date X1 X2 0 01-01-2020 H 50 1 01-02-2020 H 150 2 01-03-2020 Y 150 3 01-04-2020 Y 200 </code></pre>

You can set the index on both dataframes and assign the array to <code>df</code> : <pre class="prettyprint"><code>df["X2"] = df.set_index("X1").X2.mul(df1.set_index("X1").X2).array df date X1 X2 0 01-01-2020 H 50 1 01-02-2020 H 150 2 01-03-2020 Y 150 3 01-04-2020 Y 200 </code></pre>

Multiply two dataframes condition on another column

Tags:

pandas

dataframe

I have two dataframes: df1 and df2.

df1

Index date       | X1 | X2 
 0    01-01-2020 | H  | 10   
 1    01-02-2020 | H  | 30   
 2    01-03-2020 | Y  | 15    
 3    01-04-2020 | Y  | 20

df2

Index | X1 | X2 
 0    | H  | 5   
 1    | Y  | 10

I want to multiply df1 and df2 when the value on column X1 matches.

Desired result:

Index date       | X1 | X2 
 0    01-01-2020 | H  | 50   
 1    01-02-2020 | H  | 150   
 2    01-03-2020 | Y  | 150    
 3    01-04-2020 | Y  | 200

310

asked Sep 03 '20 05:09

user2512443

Video Answer

2 Answers

Use Series.map for match by X1 and then multiple by X2:

df1['X2'] *= df1['X1'].map(df2.set_index('X1')['X2'])
print (df1)
         date X1   X2
0  01-01-2020  H   50
1  01-02-2020  H  150
2  01-03-2020  Y  150
3  01-04-2020  Y  200

Or use DataFrame.merge with left join:

df1['X2'] *= df2.merge(df1, on='X1', how='left')['X2_x']
print (df1)
         date X1   X2
0  01-01-2020  H   50
1  01-02-2020  H  150
2  01-03-2020  Y  150
3  01-04-2020  Y  200

answered Oct 23 '22 06:10

jezrael

You can set the index on both dataframes and assign the array to df :

df["X2"] = df.set_index("X1").X2.mul(df1.set_index("X1").X2).array

df

        date    X1  X2
0   01-01-2020  H   50
1   01-02-2020  H   150
2   01-03-2020  Y   150
3   01-04-2020  Y   200

answered Oct 23 '22 05:10

sammywemmy

Related questions
                            
                                How to efficiently iterate a pandas DataFrame and increment a NumPy array on these values?
                            
                                Consider duplicate index in drop_duplicates method of a pandas DataFrame
                            
                                Is replace row-wise and will overwrite the value within the dict twice?
                            
                                Check if a value exists using multiple conditions within group in pandas
                            
                                Create dataframe from dictionary of list with variable length
                            
                                Pandas column reformatting
                            
                                Fastest way to drop rows / get subset with difference from large DataFrame in Pandas
                            
                                The most elegant way to modify messy and overlapping date labels below x axis? (Seaborn, barplot)
                            
                                Pandas sum of next n rows
                            
                                Pandas dataframe groupby and sort
                            
                                How to update a Postgres table column using a pandas data frame?
                            
                                pandas style tag give "ValueError: style is not supported for non-unique indices"
                            
                                Remove rows of a dataframe based on the row number
                            
                                Assigning a scalar value to an empty DataFrame doesn't appear to do anything
                            
                                Pandas Groupby: 'observed' parameter with multiple categoricals
                            
                                Is it possible to specify the pickle protocol when writing pandas to HDF5?
                            
                                Errors running Pandas Profile Report
                            
                                Is there a way to add autofilter to all columns using xlsxwriter without specifying a column range?
                            
                                Merge rows based on value (pandas to excel - xlsxwriter)
                            
                                Error: astype() got an unexpected keyword argument 'categories'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With