I have a Dataframe like so: <pre class="prettyprint"><code> p_rel y_BET sq_resid 1 0.069370 41.184996 0.292942 2 0.116405 43.101090 0.010953 3 0.173409 44.727748 0.036832 4 0.225629 46.681293 0.540616 5 0.250682 46.980616 0.128191 6 0.294650 47.446113 0.132367 7 0.322530 48.078038 0.235047 </code></pre> How do I get rid of the fourth row because it has the max value of sq_resid? note: the max will change from dataset to dataset so just removing the 4th row isn't enough. I have tried several things such as I can remove the max value which leaves the dataframe like below but haven't been able to remove the whole row. <pre class="prettyprint"><code> p_rel y_BET sq_resid 1 0.069370 41.184996 0.292942 2 0.116405 43.101090 0.010953 3 0.173409 44.727748 0.036832 4 0.225629 46.681293 Nan 5 0.250682 46.980616 0.128191 6 0.294650 47.446113 0.132367 7 0.322530 48.078038 0.235047 </code></pre>

You could just filter the df like so: <pre class="prettyprint"><code>In [255]: df.loc[df['sq_resid']!=df['sq_resid'].max()] Out[255]: p_rel y_BET sq_resid 1 0.069370 41.184996 0.292942 2 0.116405 43.101090 0.010953 3 0.173409 44.727748 0.036832 5 0.250682 46.980616 0.128191 6 0.294650 47.446113 0.132367 </code></pre> or <code>drop</code> using <code>idxmax</code> which will return the label row of the max value: <pre class="prettyprint"><code>In [257]: df.drop(df['sq_resid'].idxmax()) Out[257]: p_rel y_BET sq_resid 1 0.069370 41.184996 0.292942 2 0.116405 43.101090 0.010953 3 0.173409 44.727748 0.036832 5 0.250682 46.980616 0.128191 6 0.294650 47.446113 0.132367 7 0.322530 48.078038 0.235047 </code></pre>

Drop pandas dataframe row based on max value of a column

Tags:

python

pandas

dataframe

numpy

I have a Dataframe like so:

      p_rel      y_BET  sq_resid
1  0.069370  41.184996  0.292942
2  0.116405  43.101090  0.010953
3  0.173409  44.727748  0.036832
4  0.225629  46.681293  0.540616
5  0.250682  46.980616  0.128191
6  0.294650  47.446113  0.132367
7  0.322530  48.078038  0.235047

How do I get rid of the fourth row because it has the max value of sq_resid? note: the max will change from dataset to dataset so just removing the 4th row isn't enough.

I have tried several things such as I can remove the max value which leaves the dataframe like below but haven't been able to remove the whole row.

  p_rel      y_BET  sq_resid
1  0.069370  41.184996  0.292942
2  0.116405  43.101090  0.010953
3  0.173409  44.727748  0.036832
4  0.225629  46.681293  Nan
5  0.250682  46.980616  0.128191
6  0.294650  47.446113  0.132367
7  0.322530  48.078038  0.235047

818

asked Jan 29 '16 15:01

Fungie

1 Answers

You could just filter the df like so:

In [255]:
df.loc[df['sq_resid']!=df['sq_resid'].max()]

Out[255]:
      p_rel      y_BET  sq_resid
1  0.069370  41.184996  0.292942
2  0.116405  43.101090  0.010953
3  0.173409  44.727748  0.036832
5  0.250682  46.980616  0.128191
6  0.294650  47.446113  0.132367

or drop using idxmax which will return the label row of the max value:

In [257]:
df.drop(df['sq_resid'].idxmax())

Out[257]:
      p_rel      y_BET  sq_resid
1  0.069370  41.184996  0.292942
2  0.116405  43.101090  0.010953
3  0.173409  44.727748  0.036832
5  0.250682  46.980616  0.128191
6  0.294650  47.446113  0.132367
7  0.322530  48.078038  0.235047

121

answered Oct 30 '22 21:10

EdChum

Related questions
                            
                                how to cast a variable in xpath python
                            
                                Django: Dynamically set SITE_ID in settings.py based on URL?
                            
                                Would it be pythonic to use `or`, similar to how PHP would use `or die()`?
                            
                                Partial function application with the original docstring in Python?
                            
                                How to produce a "Callable function"
                            
                                Python Multiply tuples of equal length
                            
                                Running command line commands within PyCharm
                            
                                Extract substring from string in dataframe
                            
                                Better way to add constant column to pandas data frame
                            
                                How does the 'with' statement work in Flask (Jinja2)?
                            
                                How is it possible to evaluate +5 in Python?
                            
                                What is a reliable isnumeric() function for python 3?
                            
                                Add object to start of dictionary
                            
                                scipy sparse matrix: remove the rows whose all elements are zero
                            
                                Using IFNULL in sqlalchemy core
                            
                                Sum multiple values for same key in lists using python
                            
                                How to install SIP & PyQT on windows 7
                            
                                unsafe use of relative rpath libboost.dylib when making boost.python helloword demo?
                            
                                Add a dictionary to a `set()` with union
                            
                                passing arguments to functions in python using argv

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With