Usually when using the <code>.apply()</code> method, one passes a function that takes exactly one argument. <pre class="prettyprint"><code>def somefunction(group): group['ColumnC'] == group['ColumnC']**2 return group df.groupby(['ColumnA', 'ColumnB']).apply(somefunction) </code></pre> Here <code>somefunction</code> is applied for each <code>group</code>, which is then returned. Basically I'm using this example here. I want to have the ability to not specify the column name <code>ColumnC</code> beforehand. Passing it along as an argument of <code>somefunction</code> would make the code more flexible. <pre class="prettyprint"><code>def somefunction(group, column_name): group[column_name] == group[column_name]**2 return group df.groupby(['ColumnA', 'ColumnB']).apply(somefunction) </code></pre> Is there any way to make this work? I can't pass <code>group</code> to <code>somefunction</code>, because that is magically done by <code>.apply()</code> in the background.

you can pass key word arguments through <code>apply</code> <pre class="prettyprint"><code>df.groupby(['ColumnA', 'ColumnB']).apply(somefunction, column_name='col') </code></pre> <hr> MCVE <pre class="prettyprint"><code>df = pd.DataFrame(dict(A=list(range(2)) * 5, B=range(10)[::-1])) def f(df, arg1): return df * arg1 df.groupby('A').apply(f, arg1=3) A B 0 0 27 1 3 24 2 0 21 3 3 18 4 0 15 5 3 12 6 0 9 7 3 6 8 0 3 9 3 0 </code></pre>

Pandas GroupBy: apply a function with two arguments

Tags:

python

pandas

Usually when using the .apply() method, one passes a function that takes exactly one argument.

def somefunction(group):
    group['ColumnC'] == group['ColumnC']**2
    return group

df.groupby(['ColumnA', 'ColumnB']).apply(somefunction)

Here somefunction is applied for each group, which is then returned. Basically I'm using this example here.

I want to have the ability to not specify the column name ColumnC beforehand. Passing it along as an argument of somefunction would make the code more flexible.

def somefunction(group, column_name):
    group[column_name] == group[column_name]**2
    return group

df.groupby(['ColumnA', 'ColumnB']).apply(somefunction)

Is there any way to make this work? I can't pass group to somefunction, because that is magically done by .apply() in the background.

865

asked Apr 25 '17 16:04

Michael Gecht

1 Answers

you can pass key word arguments through apply

df.groupby(['ColumnA', 'ColumnB']).apply(somefunction, column_name='col')

MCVE

df = pd.DataFrame(dict(A=list(range(2)) * 5, B=range(10)[::-1]))

def f(df, arg1):
    return df * arg1

df.groupby('A').apply(f, arg1=3)

   A   B
0  0  27
1  3  24
2  0  21
3  3  18
4  0  15
5  3  12
6  0   9
7  3   6
8  0   3
9  3   0

198

answered Sep 24 '22 16:09

piRSquared

Related questions
                            
                                Serving interactive bokeh figure on heroku
                            
                                Data Conversion Error while applying a function to each row in pandas Python
                            
                                Pandas: Is there a way to use something like 'droplevel' and in process, rename the other level using the dropped level labels as prefix/suffix?
                            
                                How to debug external .py functions run from Jupyter/IPython notebook
                            
                                How to use a complex type from a WSDL with zeep in Python
                            
                                Replace duplicate values across columns in Pandas
                            
                                airflow startup failed due to gunicorn
                            
                                How to check if a CSV has a header using Python?
                            
                                Convert a numpy array of lists to a numpy array
                            
                                Select data when specific columns have null value in pandas
                            
                                How does one enter a Python virtualenv when executing a bashscript?
                            
                                How to drop the index column while writing the DataFrame in a .csv file in Pandas? [duplicate]
                            
                                Using url_for in tests
                            
                                Find string within JSON with Python
                            
                                Pandas use and operator in LOC function
                            
                                How should we pad text sequence in keras using pad_sequences?
                            
                                How to detect current keyboard language in python
                            
                                How can I see the formulas of an excel spreadsheet in pandas / python?
                            
                                Why we need python packaging (e.g. egg)? [duplicate]
                            
                                How can I create a DataFrame slice object piece by piece?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With