Count number of occurences for each unique value

People also ask

How do you count the number of unique values?

You can use the combination of the SUM and COUNTIF functions to count unique values in Excel. The syntax for this combined formula is = SUM(IF(1/COUNTIF(data, data)=1,1,0)). Here the COUNTIF formula counts the number of times each value in the range appears. The resulting array looks like {1;2;1;1;1;1}.

How do you count unique values in pandas?

You can use the nunique() function to count the number of unique values in a pandas DataFrame.

How do I count unique text values in Excel?

Count Unique Text Values in ExcelEnter the formula =SUM(IF(ISTEXT(range)*COUNTIF(range,range)=1,1,0)) in the destination cell and press Ctrl+Shift+Enter. The range denotes the start and end cells that house the elements. From the general formula, we have added the ISTEXT element to find the unique text values.

Perhaps table is what you are after?

dummyData = rep(c(1,2, 2, 2), 25)

table(dummyData)
# dummyData
#  1  2 
# 25 75

## or another presentation of the same data
as.data.frame(table(dummyData))
#    dummyData Freq
#  1         1   25
#  2         2   75

If you have multiple factors (= a multi-dimensional data frame), you can use the dplyr package to count unique values in each combination of factors:

library("dplyr")
data %>% group_by(factor1, factor2) %>% summarize(count=n())

It uses the pipe operator %>% to chain method calls on the data frame data.

It is a one-line approach by using aggregate.

> aggregate(data.frame(count = v), list(value = v), length)

  value count
1     1    25
2     2    75

table() function is a good way to go, as Chase suggested. If you are analyzing a large dataset, an alternative way is to use .N function in datatable package.

Make sure you installed the data table package by

install.packages("data.table")

Code:

# Import the data.table package
library(data.table)

# Generate a data table object, which draws a number 10^7 times  
# from 1 to 10 with replacement
DT<-data.table(x=sample(1:10,1E7,TRUE))

# Count Frequency of each factor level
DT[,.N,by=x]

To get an un-dimensioned integer vector that contains the count of unique values, use c().

dummyData = rep(c(1, 2, 2, 2), 25) # Chase's reproducible data
c(table(dummyData)) # get un-dimensioned integer vector
 1  2 
25 75

str(c(table(dummyData)) ) # confirm structure
 Named int [1:2] 25 75
 - attr(*, "names")= chr [1:2] "1" "2"

This may be useful if you need to feed the counts of unique values into another function, and is shorter and more idiomatic than the t(as.data.frame(table(dummyData))[,2] posted in a comment to Chase's answer. Thanks to Ricardo Saporta who pointed this out to me here.

length(unique(df$col)) is the most simple way I can see.

Related questions
                            
                                Extract a substring according to a pattern
                            
                                Remove duplicated rows using dplyr
                            
                                Ignore outliers in ggplot2 boxplot
                            
                                How to delete a row by reference in data.table?
                            
                                R and version control for the solo data analyst [closed]
                            
                                How to organize large R programs?
                            
                                Add legend to ggplot2 line plot
                            
                                Painless way to install a new version of R?
                            
                                How to format a number as percentage in R?
                            
                                Why were pandas merges in python faster than data.table merges in R in 2012?
                            
                                Add a common Legend for combined ggplots
                            
                                ggplot2 plot without axes, legends, etc
                            
                                Better explanation of when to use Imports/Depends
                            
                                Fastest way to replace NAs in a large data.table
                            
                                Global variables in R
                            
                                Insert picture/table in R Markdown [closed]
                            
                                How to generate a number of most distinctive colors in R?
                            
                                Check for installed packages before running install.packages() [duplicate]
                            
                                Is there a way to make R beep/play a sound at the end of a script?
                            
                                promise already under evaluation: recursive default argument reference or earlier problems?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Count number of occurences for each unique value

Tags:

r

unique

count

People also ask

Recent Activity

Donate For Us