I have a dataframe like this: <pre class="prettyprint"><code> match_id inn1 bat bowl runs1 inn2 runs2 is_score_chased 1 1 KKR RCB 222 2 82 1 2 1 CSK KXIP 240 2 207 1 8 1 CSK MI 208 2 202 1 9 1 DC RR 214 2 217 1 33 1 KKR DC 204 2 181 1 </code></pre> Now i want to change the values in is_score_chased column by comparing the values in runs1 and runs2 . If runs1>runs2, then the corresponding value in the row should be 'yes' else it should be no. I tried the following code: <pre class="prettyprint"><code>for i in (high_scores1): if(high_scores1['runs1']>=high_scores1['runs2']): high_scores1['is_score_chased']='yes' else: high_scores1['is_score_chased']='no' </code></pre> But it didn't work. How do i change the values in the column?

You can more easily use <code>np.where</code>. <pre class="prettyprint"><code>high_scores1['is_score_chased'] = np.where(high_scores1['runs1']>=high_scores1['runs2'], 'yes', 'no') </code></pre> Typically, if you find yourself trying to iterate explicitly as you were to set a column, there is an abstraction like <code>apply</code> or <code>where</code> which will be both faster and more concise.

How to compare two columns of the same dataframe?

Tags:

python

pandas

dataframe

I have a dataframe like this:

 match_id inn1  bat  bowl  runs1 inn2   runs2   is_score_chased
    1     1     KKR  RCB    222  2      82          1
    2     1     CSK  KXIP   240  2      207         1
    8     1     CSK  MI     208  2      202         1
    9     1     DC   RR     214  2      217         1
   33     1     KKR  DC     204  2      181         1

Now i want to change the values in is_score_chased column by comparing the values in runs1 and runs2 . If runs1>runs2, then the corresponding value in the row should be 'yes' else it should be no. I tried the following code:

for i in (high_scores1):
  if(high_scores1['runs1']>=high_scores1['runs2']):
      high_scores1['is_score_chased']='yes'
  else:
      high_scores1['is_score_chased']='no'

But it didn't work. How do i change the values in the column?

902

asked Feb 23 '17 01:02

user517696

1 Answers

You can more easily use np.where.

high_scores1['is_score_chased'] = np.where(high_scores1['runs1']>=high_scores1['runs2'], 
                                           'yes', 'no')

Typically, if you find yourself trying to iterate explicitly as you were to set a column, there is an abstraction like apply or where which will be both faster and more concise.

answered Oct 13 '22 17:10

miradulo

Related questions
                            
                                pandas qcut not putting equal number of observations into each bin
                            
                                Is there any method to get the number of rows and columns present in .xlsx sheet using openpyxl?
                            
                                Short way to serialize datetime with marshmallow
                            
                                GitPython create and push tags
                            
                                How to get value by multi-index with python pandas?
                            
                                Density map (heatmaps) in matplotlib
                            
                                pandas equivalent for R dcast
                            
                                Logging to python file doesn't overwrite file when using the mode='w' argument to FileHandler
                            
                                Wait for complete deletion of a DynamoDB table using boto3
                            
                                Imputer on some Dataframe columns in Python
                            
                                How to get around this memoryview error in numpy?
                            
                                Compress/Zip numpy arrays in Memory
                            
                                Build 2 lists in one go while reading from file, pythonically
                            
                                How does "tf.train.replica_device_setter" work?
                            
                                How to schedule and cancel tasks with asyncio
                            
                                Numpy: how I can determine if all elements of numpy array are equal to a number
                            
                                Django migrate error : TypeError expected string or bytes-like object
                            
                                Retrieve list of training features names from classifier
                            
                                What is the difference between import numpy and import math [duplicate]
                            
                                pytest monkeypatch.setattr() inside of test class method

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With