If I do <pre class="prettyprint"><code>mt = mobile.PattLen.value_counts() # sort True by default </code></pre> I get <pre class="prettyprint"><code>4 2831 3 2555 5 1561 [...] </code></pre> If I do <pre class="prettyprint"><code>mt = mobile.PattLen.value_counts(sort=False) </code></pre> I get <pre class="prettyprint"><code>8 225 9 120 2 1234 [...] </code></pre> What I am trying to do is get the output in 2, 3, 4 ascending order (the left numeric column). Can I change value_counts somehow or do I need to use a different function.

As hinted by normanius’ comment under jezrael’s answer : <pre class="prettyprint"><code>>>> df = pd.DataFrame({"a":[1,1,2,6,6,7,7,7,7,8]}) >>> df.a.value_counts()[df.a.unique()] 1 2 2 1 6 2 7 4 8 1 Name: a, dtype: int64 </code></pre> one can sort by any order by providing a custom index explicitely : <pre class="prettyprint"><code>>>> df.a.value_counts()[[8,7,6,2,1]] 8 1 7 4 6 2 2 1 1 2 Name: a, dtype: int64 >>> df.a.value_counts()[[1,8,6,2,7]] 1 2 8 1 6 2 2 1 7 4 Name: a, dtype: int64 </code></pre> This is of particular interest for plotting categorical data : <pre class="prettyprint"><code>>>> df.a.value_counts()[['hourly','daily','weekly','monthly']].plot(type="bar") </code></pre> Anecdotically, it can be used to remove some entries or to make others appear several times : <pre class="prettyprint"><code>>>> df.a.value_counts()[[1,1,1,8]] 1 2 1 2 1 2 8 1 Name: a, dtype: int64 </code></pre>

changing sort in value_counts

Tags:

python

pandas

dataframe

If I do

mt = mobile.PattLen.value_counts()   # sort True by default

I get

4    2831 3    2555  5    1561 [...]

If I do

mt = mobile.PattLen.value_counts(sort=False)

I get

8    225 9    120 2   1234  [...]

What I am trying to do is get the output in 2, 3, 4 ascending order (the left numeric column). Can I change value_counts somehow or do I need to use a different function.

278

asked May 08 '17 19:05

Mark Ginsburg

2 Answers

I think you need sort_index, because the left column is called index. The full command would be mt = mobile.PattLen.value_counts().sort_index(). For example:

mobile = pd.DataFrame({'PattLen':[1,1,2,6,6,7,7,7,7,8]}) print (mobile)    PattLen 0        1 1        1 2        2 3        6 4        6 5        7 6        7 7        7 8        7 9        8  print (mobile.PattLen.value_counts()) 7    4 6    2 1    2 8    1 2    1 Name: PattLen, dtype: int64   mt = mobile.PattLen.value_counts().sort_index() print (mt) 1    2 2    1 6    2 7    4 8    1 Name: PattLen, dtype: int64

answered Nov 16 '22 01:11

jezrael

As hinted by normanius’ comment under jezrael’s answer :

>>> df = pd.DataFrame({"a":[1,1,2,6,6,7,7,7,7,8]}) >>> df.a.value_counts()[df.a.unique()] 1    2 2    1 6    2 7    4 8    1 Name: a, dtype: int64

one can sort by any order by providing a custom index explicitely :

>>> df.a.value_counts()[[8,7,6,2,1]] 8    1 7    4 6    2 2    1 1    2 Name: a, dtype: int64 >>> df.a.value_counts()[[1,8,6,2,7]] 1    2 8    1 6    2 2    1 7    4 Name: a, dtype: int64

This is of particular interest for plotting categorical data :

>>> df.a.value_counts()[['hourly','daily','weekly','monthly']].plot(type="bar")

Anecdotically, it can be used to remove some entries or to make others appear several times :

>>> df.a.value_counts()[[1,1,1,8]] 1    2 1    2 1    2 8    1 Name: a, dtype: int64

answered Nov 16 '22 01:11

Skippy le Grand Gourou

Related questions
                            
                                Why are NumPy arrays so fast?
                            
                                Using Django database layer outside of Django?
                            
                                Could not find library geos_c or load any of its variants
                            
                                How to create a fix size list in python?
                            
                                WTForms: Install 'email_validator' for email validation support
                            
                                How to read datetime back from sqlite as a datetime instead of string in Python?
                            
                                Concatenate two NumPy arrays vertically
                            
                                Selenium Webdriver finding an element in a sub-element
                            
                                Python, TypeError: unhashable type: 'list'
                            
                                Pandas Plotting with Multi-Index
                            
                                What does the c underscore expression `c_` do exactly?
                            
                                How do I run tox in a project that has no setup.py?
                            
                                Slice Pandas dataframe by index values that are (not) in a list
                            
                                Regular expression: match start or whitespace
                            
                                Keep other columns when doing groupby
                            
                                How to change index of a for loop?
                            
                                install beautiful soup using pip [duplicate]
                            
                                datetime from string in Python, best-guessing string format
                            
                                How can I access global variable inside class in Python
                            
                                Barchart with vertical labels in python/matplotlib

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With