Median of pandas dataframe column

Tags:

I have a DataFrame df:

name   count    
aaaa   2000    
bbbb   1900    
cccc    900    
dddd    500    
eeee    100

I would like to look at the rows that are within a factor of 10 from the median of the count column.

I tried df['count'].median() and got the median. But don't know how to proceed further. Can you suggest how I could use pandas/numpy for this.

Expected Output :

name count distance from median

aaaa  2000   *****

I can use any measure as the distance from median (absolute deviation from median, quantiles etc.).

327

asked Apr 21 '15 16:04

Ssank

2 Answers

If you're looking for how to calculate the Median Absolute Deviation -

In [1]: df['dist'] = abs(df['count'] - df['count'].median())

In [2]: df
Out[2]:
   name  count  dist
0  aaaa   2000  1100
1  bbbb   1900  1000
2  cccc    900     0
3  dddd    500   400
4  eeee    100   800

In [3]: df['dist'].median()
Out[3]: 800.0

140

answered Sep 28 '22 08:09

ComputerFellow

If you want to see the median, you can use df.describe(). The 50% value is the median.

answered Sep 28 '22 10:09

Marjan Alavi

Related questions
                            
                                plot vlines with matplotlib.pyplot
                            
                                Matplotlib: show labels for minor ticks also
                            
                                OpenCV cv2.fillPoly vs. cv2.fillConvexPoly: expected data type for array of polygon vertices?
                            
                                python BeautifulSoup get select.value not text
                            
                                What is the difference if I don't use stdout=subprocess.PIPE in subprocess.Popen()?
                            
                                What is the best way to save image metadata alongside a tif?
                            
                                Flask view raises TypeError: 'bool' object is not callable
                            
                                Improve current implementation of a setInterval
                            
                                Formatting datetime xlabels in matplotlib (pandas df.plot() method)
                            
                                Getting captured group in one line
                            
                                Create Contour Plot from Pandas Groupby Dataframe
                            
                                Django Admin without Authentication
                            
                                How can I override the static file handler in Flask?
                            
                                How to use colormaps to color plots of Pandas DataFrames
                            
                                text/event-stream recognised as a download
                            
                                How to restore python on OS X Yosemite after I've deleted something?
                            
                                Psycopg2 "copy_from" command, possible to ignore delimiter in quote (getting error)?
                            
                                scrapy error :exceptions.ValueError: Missing scheme in request url:
                            
                                Python Selenium: How to check whether the WebDriver did quit()?
                            
                                Subprocess in Python: File Name too long

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Median of pandas dataframe column

Tags:

python

pandas

r

numpy

Ssank

People also ask

2 Answers

ComputerFellow

Marjan Alavi

Recent Activity

Donate For Us