I have a Pandas dataframe in Python. The contents of the dataframe are from here. I modified the case of the first alphabet in the "Single" column slightly. Here is what I have: <pre class="prettyprint"><code>import pandas as pd df = pd.read_csv('test.csv') print df Position Artist Single Year Weeks 1 Frankie Laine I Believe 1953 18 weeks 2 Bryan Adams I Do It for You 1991 16 weeks 3 Wet Wet Wet love Is All Around 1994 15 weeks 4 Drake (feat. Wizkid & Kyla) One Dance 2016 15 weeks 5 Queen bohemian Rhapsody 1975/76 & 1991/92 14 weeks 6 Slim Whitman Rose Marie 1955 11 weeks 7 Whitney Houston i Will Always Love You 1992 10 weeks </code></pre> I would like to sort by the Single column in ascending order (a to z). When I run <pre class="prettyprint"><code>df.sort_values(by='Single',inplace=True) </code></pre> it seems that the sort is not able to combine upper and lowercase. Here is what I get: <pre class="prettyprint"><code>Position Artist Single Year Weeks 1 Frankie Laine I Believe 1953 18 weeks 2 Bryan Adams I Do It for You 1991 16 weeks 4 Drake (feat. Wizkid & Kyla) One Dance 2016 15 weeks 6 Slim Whitman Rose Marie 1955 11 weeks 5 Queen bohemian Rhapsody 1975/76 & 1991/92 14 weeks 7 Whitney Houston i Will Always Love You 1992 10 weeks 3 Wet Wet Wet love Is All Around 1994 15 weeks </code></pre> So, it is sorting by uppercase first and then performing a separate sort by lower case. I want a combined sort, regardless of the case of the starting alphabet in the Single column. The row with "bohemian Rhapsody" is in the wrong location after sorting. It should be first; instead it is appearing as the 5th row after the sort. Is there a way to do sort a Pandas DataFrame while ignoring the case of the text in the Single column?

You can convert all strings to upper/lower case and then call <code>argsort()</code> which gives the index value to reorder the data frame by Single ignoring the case: <pre class="prettyprint"><code>df.iloc[df.Single.str.lower().argsort()] </code></pre> <img src="https://i.stack.imgur.com/0S2VS.jpg" alt="enter image description here">

Pandas 1.1.0 introduced the <code>key</code> argument as a more intuitive way to achieve this: <pre class="prettyprint lang-py prettyprint-override"><code>df.sort_values(by='Single', inplace=True, key=lambda col: col.str.lower()) </code></pre>

Pandas DataFrame sort ignoring the case

Tags:

I have a Pandas dataframe in Python. The contents of the dataframe are from here. I modified the case of the first alphabet in the "Single" column slightly. Here is what I have:

import pandas as pd df = pd.read_csv('test.csv') print df  Position                       Artist                  Single               Year     Weeks        1                Frankie Laine               I Believe               1953  18 weeks        2                  Bryan Adams         I Do It for You               1991  16 weeks        3                  Wet Wet Wet      love Is All Around               1994  15 weeks        4  Drake (feat. Wizkid & Kyla)               One Dance               2016  15 weeks        5                        Queen       bohemian Rhapsody  1975/76 & 1991/92  14 weeks        6                 Slim Whitman              Rose Marie               1955  11 weeks        7              Whitney Houston  i Will Always Love You               1992  10 weeks

I would like to sort by the Single column in ascending order (a to z). When I run

df.sort_values(by='Single',inplace=True)

it seems that the sort is not able to combine upper and lowercase. Here is what I get:

Position                       Artist                  Single               Year     Weeks        1                Frankie Laine               I Believe               1953  18 weeks        2                  Bryan Adams         I Do It for You               1991  16 weeks        4  Drake (feat. Wizkid & Kyla)               One Dance               2016  15 weeks        6                 Slim Whitman              Rose Marie               1955  11 weeks        5                        Queen       bohemian Rhapsody  1975/76 & 1991/92  14 weeks        7              Whitney Houston  i Will Always Love You               1992  10 weeks        3                  Wet Wet Wet      love Is All Around               1994  15 weeks

So, it is sorting by uppercase first and then performing a separate sort by lower case. I want a combined sort, regardless of the case of the starting alphabet in the Single column. The row with "bohemian Rhapsody" is in the wrong location after sorting. It should be first; instead it is appearing as the 5th row after the sort.

Is there a way to do sort a Pandas DataFrame while ignoring the case of the text in the Single column?

602

asked Jan 15 '17 00:01

edesz

2 Answers

You can convert all strings to upper/lower case and then call argsort() which gives the index value to reorder the data frame by Single ignoring the case:

df.iloc[df.Single.str.lower().argsort()]

enter image description here

173

answered Nov 14 '22 04:11

Psidom

Pandas 1.1.0 introduced the key argument as a more intuitive way to achieve this:

df.sort_values(by='Single', inplace=True, key=lambda col: col.str.lower())

answered Nov 14 '22 02:11

RafG

Related questions
                            
                                CSS - How to have swiper slider arrows outside of slider that takes up 12 column row
                            
                                How to run asp.net mvc 4.5 in visual studio code editor?
                            
                                React & Enzyme: why isn't beforeEach() working?
                            
                                What's the difference between elb health check and ec2 health check?
                            
                                Convert JSON data from Request into Pandas DataFrame
                            
                                How to use warm_start
                            
                                Bash: usage of `true`
                            
                                How to change fragment with the Bottom Navigation Activity?
                            
                                RxJava 2 overriding IO scheduler in unit test
                            
                                'dotnet build' specify main method
                            
                                Vue ignore custom component tag
                            
                                C++ project with Bazel and GTest

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With