In <code>Python</code>, one can get the counts of values in a list by using <code>Series.value_counts()</code>: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame() df['x'] = ['a','b','b','c','c','d'] df['y'] = list(range(1,7)) df['x'].value_counts() c 2 b 2 a 1 d 1 Name: x, dtype: int64 </code></pre> In <code>R</code>, I have to use three separate commands. <pre class="prettyprint"><code>df <- tibble(x=c('a','b','b','c','c','d'), y=1:6) df %>% group_by(x) %>% summarise(n=n()) %>% arrange(desc(n)) x n b 2 c 2 a 1 d 1 </code></pre> Is there a shorter / more idiomatic way of doing this in R? Or am I better off writing a custom function?

The tidyverse has <code>dplyr::count</code>, which is a shortcut for 'group_by' and 'summarize' to get counts. <pre class="prettyprint"><code>df <- tibble(x=c('a','b','b','c','c','d'), y=1:6) dplyr::count(df, x, sort = TRUE) # A tibble: 4 x 2 x n <chr> <int> 1 b 2 2 c 2 3 a 1 4 d 1 </code></pre>

Analog to Pandas Series.value_counts() in R? [duplicate]

Tags:

pandas

r

dplyr

In Python, one can get the counts of values in a list by using Series.value_counts():

import pandas as pd

df = pd.DataFrame()
df['x'] = ['a','b','b','c','c','d']
df['y'] = list(range(1,7))

df['x'].value_counts()

c    2
b    2
a    1
d    1
Name: x, dtype: int64

In R, I have to use three separate commands.

df <- tibble(x=c('a','b','b','c','c','d'), y=1:6)

df %>% group_by(x) %>% summarise(n=n()) %>% arrange(desc(n))

x   n
b   2
c   2
a   1
d   1

Is there a shorter / more idiomatic way of doing this in R? Or am I better off writing a custom function?

826

asked Feb 27 '20 20:02

max

1 Answers

The tidyverse has dplyr::count, which is a shortcut for 'group_by' and 'summarize' to get counts.

df <- tibble(x=c('a','b','b','c','c','d'), y=1:6)

dplyr::count(df, x, sort = TRUE)

# A tibble: 4 x 2
  x         n
  <chr> <int>
1 b         2
2 c         2
3 a         1
4 d         1

187

answered Sep 27 '22 16:09

rpolicastro

Related questions
                            
                                Shade background of a ggplot chart using geom_rect with categorical variables
                            
                                R- how to conditionally remove first row of group_by
                            
                                Add directlabels to geom_smooth rather than geom_line
                            
                                What is the difference between paste/paste0 and str_c?
                            
                                How to format a difftime object to a string with HH:MM:SS
                            
                                ggplot aes_string doesn't work with spaces
                            
                                rmarkdown & kable/kableextra: Printing % symbol in Table when using escape = F
                            
                                select non-missing variables in a purrr loop
                            
                                Add space above y-axis without expand()
                            
                                Aligning axes of R plots on one side of a grid together
                            
                                Using case_when() to assign two new columns, instead of one
                            
                                R shiny datatable pagination and show all rows as options
                            
                                How to use rlang operators in a package?
                            
                                What is the equivalent of "everything()" operator in "data.table"? [duplicate]
                            
                                How to change the colour of the printed output in base R?
                            
                                How to code elementary symmetric polynomials in R
                            
                                Making variables immutable in R
                            
                                Add a MS Word Comment via Rmarkdown
                            
                                ggplot2 change fill for color legend when fill also used in aesthetic
                            
                                How can I keep pivot_wider() from dropping factor levels in names?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With