I have two columns as below: <pre class="prettyprint"><code>id, colA, colB 0, a, 13 1, a, 52 2, b, 16 3, a, 34 4, b, 946 etc... </code></pre> I am trying to create a third column, <code>colC</code>, that is <code>colB</code> if <code>colA == a</code>, otherwise <code>0</code>. This is what I was thinking, but it does not work: <pre class="prettyprint"><code>data[data['colA']=='a']['colC'] = data[data['colA']=='a']['colB'] </code></pre> I was also thinking about using <code>np.where()</code>, but I don't think that would work here. Any thoughts?

Use <code>loc</code> with a mask to assign: <pre class="prettyprint"><code>In [300]: df.loc[df['colA'] == 'a', 'colC'] = df['colB'] df['colC'] = df['colC'].fillna(0) df Out[300]: id colA colB colC 0 0 a 13 13 1 1 a 52 52 2 2 b 16 0 3 3 a 34 34 4 4 b 946 0 </code></pre> EDIT or use <code>np.where</code>: <pre class="prettyprint"><code>In [296]: df['colC'] = np.where(df['colA'] == 'a', df['colC'],0) df Out[296]: id colA colB colC 0 0 a 13 13 1 1 a 52 52 2 2 b 16 0 3 3 a 34 34 4 4 b 946 0 </code></pre>

pandas create one column equal to another if condition is satisfied

id, colA, colB
0, a, 13
1, a, 52
2, b, 16
3, a, 34
4, b, 946
etc...

I am trying to create a third column, colC, that is colB if colA == a, otherwise 0.

This is what I was thinking, but it does not work:

data[data['colA']=='a']['colC'] = data[data['colA']=='a']['colB']

I was also thinking about using np.where(), but I don't think that would work here.

Any thoughts?

953

asked Nov 12 '15 16:11

As3adTintin

1 Answers

Use loc with a mask to assign:

In [300]:
df.loc[df['colA'] == 'a', 'colC'] = df['colB']
df['colC'] = df['colC'].fillna(0)
df

Out[300]:
   id colA  colB  colC
0   0    a    13    13
1   1    a    52    52
2   2    b    16     0
3   3    a    34    34
4   4    b   946     0

EDIT

or use np.where:

In [296]:
df['colC'] = np.where(df['colA'] == 'a', df['colC'],0)
df

Out[296]:
   id colA  colB  colC
0   0    a    13    13
1   1    a    52    52
2   2    b    16     0
3   3    a    34    34
4   4    b   946     0

165

answered Oct 19 '22 16:10

EdChum

Related questions
                            
                                Combinations of MultiIndex levels which occur in a DataFrame
                            
                                Accessing serializer instances in nested serializer's field
                            
                                Getting the date of the last day of this [week/month/quarter/year]
                            
                                How to use psycopg2 connection string with variables?
                            
                                Assign value to a list using slice notation with assignee [duplicate]
                            
                                Round off floating point values in dict
                            
                                Python 3.4 lxml.etree: Start tag expected, '<' not found, line 1, column 1
                            
                                how Python cvxopt solvers qp basically works
                            
                                Is there a python construct that is a dummy function?
                            
                                Plot semi transparent contour plot over image file using matplotlib
                            
                                Comparing first element of the consecutive lists of tuples in Python
                            
                                pandas how to convert all the string value to float
                            
                                Removing first elements of tuples in a list
                            
                                retrieve intermediate features from a pipeline in Scikit (Python)
                            
                                VisibleDeprecationWarning: boolean index did not match indexed array along dimension 1; dimension is 2 but corresponding boolean dimension is 1
                            
                                Django how to use the ``receiver`` decorator on a class instead on a function
                            
                                Seaborn PairGrid: show axes labels for each subplot
                            
                                Pyspark .toPandas() results in object column where expected numeric one
                            
                                How to create a very simple DNS server using Python?
                            
                                efficient concatenation of lists in pandas series

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas create one column equal to another if condition is satisfied

Tags:

python

pandas

if-statement

As3adTintin

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us