Get first and second highest values in pandas columns

Tags:

I am using pandas to analyse some election results. I have a DF, Results, which has a row for each constituency and columns representing the votes for the various parties (over 100 of them):

In[60]: Results.columns Out[60]:  Index(['Constituency', 'Region', 'Country', 'ID', 'Type', 'Electorate',        'Total', 'Unnamed: 9', '30-50', 'Above',        ...        'WP', 'WRP', 'WVPTFP', 'Yorks', 'Young', 'Zeb', 'Party', 'Votes',        'Share', 'Turnout'],       dtype='object', length=147)

So...

In[63]: Results.head() Out[63]:                           Constituency    Region   Country         ID    Type  \ PAID                                                                            1                            Aberavon     Wales     Wales  W07000049  County    2                           Aberconwy     Wales     Wales  W07000058  County    3                      Aberdeen North  Scotland  Scotland  S14000001   Burgh    4                      Aberdeen South  Scotland  Scotland  S14000002   Burgh    5     Aberdeenshire West & Kincardine  Scotland  Scotland  S14000058  County           Electorate  Total  Unnamed: 9  30-50  Above    ...     WP  WRP  WVPTFP  \ PAID                                                 ...                        1          49821  31523         NaN    NaN    NaN    ...    NaN  NaN     NaN    2          45525  30148         NaN    NaN    NaN    ...    NaN  NaN     NaN    3          67745  43936         NaN    NaN    NaN    ...    NaN  NaN     NaN    4          68056  48551         NaN    NaN    NaN    ...    NaN  NaN     NaN    5          73445  55196         NaN    NaN    NaN    ...    NaN  NaN     NaN           Yorks  Young  Zeb  Party  Votes     Share   Turnout   PAID                                                        1       NaN    NaN  NaN    Lab  15416  0.489040  0.632725   2       NaN    NaN  NaN    Con  12513  0.415052  0.662230   3       NaN    NaN  NaN    SNP  24793  0.564298  0.648550   4       NaN    NaN  NaN    SNP  20221  0.416490  0.713398   5       NaN    NaN  NaN    SNP  22949  0.415773  0.751528    [5 rows x 147 columns]

The per-constituency results for each party are given in the columns Results.ix[:, 'Unnamed: 9': 'Zeb']

I can find the winning party (i.e. the party which polled highest number of votes) and the number of votes it polled using:

RawResults = Results.ix[:, 'Unnamed: 9': 'Zeb'] Results['Party'] = RawResults.idxmax(axis=1) Results['Votes'] = RawResults.max(axis=1).astype(int)

But, I also need to know how many votes the second-place party got (and ideally its index/name). So is there any way in pandas to return the second highest value/index in a set of columns for each row?

544

asked Aug 21 '16 16:08

TimGJ

1 Answers

To get the highest values of a column, you can use nlargest() :

df['High'].nlargest(2)

The above will give you the 2 highest values of column High.

You can also use nsmallest() to get the lowest values.

answered Sep 21 '22 03:09

Pedro Lobito

Related questions
                            
                                Django default=timezone.now() saves records using "old" time
                            
                                ./xx.py: line 1: import: command not found
                            
                                Find coordinate of the closest point on polygon in Shapely
                            
                                Count and summation of positive and negative number sequences
                            
                                python - django: why am I getting this error: AttributeError: 'method_descriptor' object has no attribute 'today'?
                            
                                Quickest way to find the nth largest value in a numpy Matrix
                            
                                Get the value of a checkbox in Flask
                            
                                How do you alias a type in Python?
                            
                                How to generate all combination from values in dict of lists in Python
                            
                                How to get the localStorage with Python and Selenium WebDriver
                            
                                Python's bz2 module not compiled by default
                            
                                Why am I getting this error in python ? (httplib)
                            
                                Finding the length of an mp3 file
                            
                                Bits list to integer in Python
                            
                                Display message when hovering over something with mouse cursor in Python
                            
                                How can I overwrite/print over the current line in Windows command line?
                            
                                How to parse a RFC 2822 date/time into a Python datetime?
                            
                                Checking on a thread / remove from list
                            
                                How to get the Tkinter Label text?
                            
                                PyCharm can not resolve PyGObject 3.0, but code runs fine

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Get first and second highest values in pandas columns

Tags:

python

pandas

dataframe

numpy

TimGJ

People also ask

1 Answers

Pedro Lobito

Recent Activity

Donate For Us