I want to get the number of unique values in each of the columns of a data frame. Let's say I have the following data frame: <pre class="prettyprint"><code>DF <- data.frame(v1 = c(1,2,3,2), v2 = c("a","a","b","b")) </code></pre> then it should return that there are 3 distinct values for v1, and 2 for v2. I tried unique(DF), but it does not work as each rows are different.

Or using <code>unique</code>: <pre class="prettyprint"><code>rapply(DF,function(x)length(unique(x))) v1 v2 3 2 </code></pre>

<pre class="prettyprint"><code>sapply(DF, function(x) length(unique(x))) </code></pre>

In <code>dplyr</code>: <pre class="prettyprint"><code>DF %>% summarise_all(funs(n_distinct(.))) </code></pre>

Unique values in each of the columns of a data frame

Tags:

dataframe

r

I want to get the number of unique values in each of the columns of a data frame. Let's say I have the following data frame:

DF <- data.frame(v1 = c(1,2,3,2), v2 = c("a","a","b","b"))

then it should return that there are 3 distinct values for v1, and 2 for v2.

I tried unique(DF), but it does not work as each rows are different.

304

asked Nov 04 '13 05:11

Benoit_Plante

3 Answers

Or using unique:

rapply(DF,function(x)length(unique(x)))
v1 v2 
 3  2

183

answered Oct 10 '22 06:10

agstudy

sapply(DF, function(x) length(unique(x)))

answered Oct 10 '22 05:10

ben_says

In dplyr:

DF %>% summarise_all(funs(n_distinct(.)))

answered Oct 10 '22 07:10

leerssej

Related questions
                            
                                Mermaid diagram line break
                            
                                Understanding the differences between mclapply and parLapply in R
                            
                                Fetching UTF-8 text from MySQL in R returns "????"
                            
                                Fastest & most flexible way to chart over 2 million rows of flat file data?
                            
                                How to test if object is a vector
                            
                                How to set the tolerance of expect_equal in testthat framework
                            
                                Merge several data.frames into one data.frame with a loop
                            
                                Using png function not working when called within a function
                            
                                R convert zipcode or lat/long to county
                            
                                Which is the correct folder to store images used in vignettes for R packages ?
                            
                                Can I remove an element in ... (dot-dot-dot) and pass it on?
                            
                                How can I expand a vector into the arguments of a function in r?
                            
                                R data.table change R names
                            
                                Append a data frame to a list
                            
                                How do I use `[` correctly with (l|s)apply to select a specific column from a list of matrices?
                            
                                Warning: closing unused connection n
                            
                                Why is expand.grid faster than data.table 's CJ?
                            
                                Listing all files matching a full-path pattern in R
                            
                                Good Ways to Visualize Longitudinal Categorical Data in R
                            
                                two-way density plot combined with one way density plot with selected regions in r

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With