I have the following data frame: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame({'AAA' : ['w','x','y','z'], 'BBB' : [10,20,30,40],'CCC' : [100,50,-30,-50]}) </code></pre> Which looks like this: <pre class="prettyprint"><code>In [32]: df Out[32]: AAA BBB CCC 0 w 10 100 1 x 20 50 2 y 30 -30 3 z 40 -50 </code></pre> What I want to do is to perform function operation on every row for every column except those with non-numerical value (in this case <code>AAA</code>). In the real case the non-numerical case is always on first column, and the rest (could be greater than 2 columns) are always numerical. The final desired output is: <pre class="prettyprint"><code> AAA BBB CCC Score 0 w 10 100 110 1 x 20 50 70 2 y 30 -30 0 3 z 40 -50 -10 </code></pre> I tried this but failed: <pre class="prettyprint"><code>import numpy as np df["Score"] = df.apply(np.sum, axis=1) </code></pre> What's the right way to do it? Update2: This is the code that give <code>SettingWithCopyWarning</code>. Please fresh start the ipython for testing. <pre class="prettyprint"><code>import pandas as pd import numpy as np def cvscore(fclist): sd = np.std(fclist) mean = np.mean(fclist) cv = sd/mean return cv def calc_cvscore_on_df(df): df["CV"] = df.iloc[:,1:].apply(cvscore, axis=1) return df df3 = pd.DataFrame(np.random.randn(1000, 3), columns=['a', 'b', 'c']) calc_cvscore_on_df(df3[["a","b"]]) </code></pre>

To select everything but the first column, you could use <code>df.iloc[:, 1:]</code>: <pre class="prettyprint"><code>In [371]: df['Score'] = df.iloc[:, 1:].sum(axis=1) In [372]: df Out[372]: AAA BBB CCC Score 0 w 10 100 110 1 x 20 50 70 2 y 30 -30 0 3 z 40 -50 -10 </code></pre> To apply an arbitrary function, <code>func</code>, to each row: <pre class="prettyprint"><code>df.iloc[:, 1:].apply(func, axis=1) </code></pre> <hr> For example, <pre class="prettyprint"><code>import numpy as np import pandas as pd def cvscore(fclist): sd = np.std(fclist) mean = np.mean(fclist) cv = sd/mean return cv df = pd.DataFrame({'AAA' : ['w','x','y','z'], 'BBB' : [10,20,30,40], 'CCC' : [100,50,-30,-50]}) df['Score'] = df.iloc[:, 1:].apply(cvscore, axis=1) print(df) </code></pre> yields <pre class="prettyprint"><code> AAA BBB CCC Score 0 w 10 100 1.211386 1 x 20 50 0.868377 2 y 30 -30 NaN 3 z 40 -50 -5.809058 </code></pre>

Apply function row wise on pandas data frame on columns with numerical values

Tags:

python

pandas

I have the following data frame:

Click to copy

import pandas as pd
df = pd.DataFrame({'AAA' : ['w','x','y','z'], 'BBB' : [10,20,30,40],'CCC' : [100,50,-30,-50]})

Which looks like this:

Click to copy

In [32]: df
Out[32]:
  AAA  BBB  CCC
0   w   10  100
1   x   20   50
2   y   30  -30
3   z   40  -50

What I want to do is to perform function operation on every row for every column except those with non-numerical value (in this case AAA). In the real case the non-numerical case is always on first column, and the rest (could be greater than 2 columns) are always numerical.

The final desired output is:

Click to copy

  AAA  BBB  CCC  Score
0   w   10  100  110
1   x   20   50   70
2   y   30  -30    0
3   z   40  -50  -10

I tried this but failed:

Click to copy

import numpy as np
df["Score"] = df.apply(np.sum, axis=1)

What's the right way to do it?

Update2:

This is the code that give SettingWithCopyWarning. Please fresh start the ipython for testing.

Click to copy

import pandas as pd
import numpy as np 
def cvscore(fclist):
    sd = np.std(fclist)
    mean = np.mean(fclist)
    cv = sd/mean
    return cv

def calc_cvscore_on_df(df):
    df["CV"] = df.iloc[:,1:].apply(cvscore, axis=1)
    return df

df3 = pd.DataFrame(np.random.randn(1000, 3), columns=['a', 'b', 'c'])
calc_cvscore_on_df(df3[["a","b"]])

484

asked Mar 27 '15 02:03

pdubois

1 Answers

To select everything but the first column, you could use df.iloc[:, 1:]:

Click to copy

In [371]: df['Score'] = df.iloc[:, 1:].sum(axis=1)

In [372]: df
Out[372]: 
  AAA  BBB  CCC  Score
0   w   10  100    110
1   x   20   50     70
2   y   30  -30      0
3   z   40  -50    -10

To apply an arbitrary function, func, to each row:

Click to copy

df.iloc[:, 1:].apply(func, axis=1)

For example,

Click to copy

import numpy as np
import pandas as pd

def cvscore(fclist):
    sd = np.std(fclist)
    mean = np.mean(fclist)
    cv = sd/mean
    return cv

df = pd.DataFrame({'AAA' : ['w','x','y','z'], 'BBB' : [10,20,30,40],
                   'CCC' : [100,50,-30,-50]})

df['Score'] = df.iloc[:, 1:].apply(cvscore, axis=1)
print(df)

yields

Click to copy

  AAA  BBB  CCC     Score
0   w   10  100  1.211386
1   x   20   50  0.868377
2   y   30  -30       NaN
3   z   40  -50 -5.809058

answered Nov 14 '22 23:11

unutbu

Related questions
                            
                                Bitwise Rotate Right
                            
                                Including a compiled module in module that is wrapped with f2py (Minimum working example)?
                            
                                Remove matplotlib text plot border
                            
                                Open a text file with accents in python
                            
                                ValueError: dictionary update sequence element #0 has length 1; 2 is required
                            
                                Algorithm to solve for water accumulation given building heights
                            
                                Do scrapers need to be written for every site they target?
                            
                                Executing shell mail command using python
                            
                                How to iterate over a dictionary - n key-value pairs at a time
                            
                                How can I integrate Tkinter with Python log in screen?
                            
                                Python Format Best Practices
                            
                                How to Multiply Decimals in Python
                            
                                Invalid block tag: 'bootstrap_icon', expected 'endblock'
                            
                                How to turn a list/tuple into a space separated string in python using a single line?
                            
                                Log Normal Random Variables with Scipy
                            
                                Loading global data for server using Flask and gunicorn
                            
                                PyPI API - How to get stable package version
                            
                                How can I format a float with given precision and zero padding?
                            
                                how to Count the number of non zero pixels of the canny image in my python program
                            
                                Distinguish matches in pyparsing

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Apply function row wise on pandas data frame on columns with numerical values

Tags:

python

pandas

pdubois

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us