Given the following data frame: <pre class="prettyprint"><code>import pandas as pd import numpy as np DF = pd.DataFrame({'COL1': ['a','b','b'], 'COL2' : [0,np.nan,1],}) DF COL1 COL2 0 a 0 1 b NaN 2 b 1 </code></pre> I want to be able to assign a new column <code>COL3</code> that has a value of <code>2</code> for every row where <code>COL1</code> is <code>b</code> and <code>COL2</code> is not null. The desired result is as follows: <pre class="prettyprint"><code> COL1 COL2 COL3 0 a 0 0 1 b NaN 0 2 b 1 2 </code></pre> Thanks in advance!

Define a function to return your value based on other columns. <pre class="prettyprint"><code>def value_handle (row): if row['COL1'] == 'b' and not pd.isnull(row['COL2']) : return 2 else: return 0 </code></pre> Then call the new function when introducing the new column. <pre class="prettyprint"><code>DF['COL3'] = DF.apply (lambda row: value_handle (row),axis=1) </code></pre>

This can be achieved using the apply method on the DataFrame. You'll need to pass in a function to apply to each row and set the axis to <code>1</code> to set it to the correct mode (apply for each row, instead of for each column). Here's a working example: <pre class="prettyprint"><code>def row_handler(row): if row['COL1'] == 'b' and not np.isnan(row['COL2']): return 2 return 0 DF['COL3'] = DF.apply(row_handler, axis=1) </code></pre> Which returns this: <pre class="prettyprint"><code>>> print DF COL1 COL2 COL3 0 a 0 0 1 b NaN 0 2 b 1 2 </code></pre>

You can use <code>numpy.where</code> with <code>isin</code> and <code>notnull</code>: <pre class="prettyprint"><code>DF['COL3'] = np.where((DF['COL1'].isin(['b'])) &(DF['COL2'].notnull()), 2, 0) print DF COL1 COL2 COL3 0 a 0 0 1 b NaN 0 2 b 1 2 </code></pre>

Pandas assign value to cell based on values of other cells in row

Tags:

python

python-3.x

pandas

dataframe

Given the following data frame:

import pandas as pd
import numpy as np
DF = pd.DataFrame({'COL1': ['a','b','b'], 
                   'COL2' : [0,np.nan,1],})

DF

    COL1    COL2
0    a        0      
1    b       NaN     
2    b        1

I want to be able to assign a new column COL3 that has a value of 2 for every row where COL1 is b and COL2 is not null.

The desired result is as follows:

    COL1    COL2    COL3
0    a        0      0
1    b       NaN     0
2    b        1      2

Thanks in advance!

495

asked Jan 17 '16 06:01

Dance Party

3 Answers

Define a function to return your value based on other columns.

def value_handle (row):
    if row['COL1'] == 'b' and not pd.isnull(row['COL2']) :
        return 2
    else:
        return 0

Then call the new function when introducing the new column.

DF['COL3'] = DF.apply (lambda row: value_handle (row),axis=1)

answered Oct 30 '22 07:10

madawa

This can be achieved using the apply method on the DataFrame. You'll need to pass in a function to apply to each row and set the axis to 1 to set it to the correct mode (apply for each row, instead of for each column).

Here's a working example:

def row_handler(row):
    if row['COL1'] == 'b' and not np.isnan(row['COL2']):
        return 2
    return 0

DF['COL3'] = DF.apply(row_handler, axis=1)

Which returns this:

>> print DF
  COL1  COL2  COL3
0    a     0     0
1    b   NaN     0
2    b     1     2

200

answered Oct 30 '22 07:10

lextoumbourou

You can use numpy.where with isin and notnull:

DF['COL3'] = np.where((DF['COL1'].isin(['b'])) &(DF['COL2'].notnull()), 2, 0)
print DF 


  COL1  COL2  COL3
0    a     0     0
1    b   NaN     0
2    b     1     2

answered Oct 30 '22 08:10

jezrael

Related questions
                            
                                Google Authenticator code does not match server generated code
                            
                                How to install igraph for python on windows
                            
                                Cannot load main class from JAR file in Spark Submit
                            
                                How to run Odoo ORM methods in the python console?
                            
                                Find all occurrences of integer within text in Python
                            
                                Where to implement python classes in Django?
                            
                                Can't terminate a sudo process created with python, in Ubuntu 15.10
                            
                                Copying and renaming excel files with Python [duplicate]
                            
                                Group items of a list with a step size python?
                            
                                How do I set the output of exec to variable python?
                            
                                Python logging: can dictConfig be read from a file?
                            
                                Mock returning an ImportError when patching a module that's imported
                            
                                How to gather results from multiprocesses?
                            
                                Implementing XorShift the same in Java and Python
                            
                                Which special characters must be escaped when using Python regex module re?
                            
                                2D gaussian distribution does not sum to one?
                            
                                Scikit-learn: How to normalize row values horizontally?
                            
                                re.finditer() returning same value for start and end methods
                            
                                Advanced custom sort
                            
                                Should this method be a classmethod and why can't it access vars?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With