R summarize unique values across columns based on values from one column

Tags:

I want to know the total number of unique values for each column based on the values of var_1.

For example:

Test <- data.frame(var_1 = c("a","a","a", "b", "b", "c", "c", "c", "c", "c"), var_2 = c("bl","bf","bl", "bl","bf","bl","bl","bf","bc", "bg" ), var_3 = c("cf","cf","eg", "cf","cf","eg","cf","dr","eg","fg"))

The results I am looking for would be based on the values in var_1 and should be:

var_1 var_2 var_3
a     2     2
b     2     1
c     3     4

However, after trying various methods (including apply and table) - aggregate has been the closest thing to what I am looking for, but this script results in a summary of the total number of entries for each value of var_1, but the total is not unique

agbyv1= aggregate(. ~ var_1, Test, length) 

var_1 var_2 var_3
a     3     3
b     2     2
c     5     5

I tried

unqbyv1= aggregate(. ~ var_1, Test, length(unique(x)))

but that didn't work.

Any help is greatly appreciated.

972

asked May 05 '15 18:05

Ina.Quest

1 Answers

Try

library(dplyr)
Test %>%
      group_by(var_1) %>% 
      summarise_each(funs(n_distinct(.)))

library(data.table)#v1.9.5+
setDT(Test)[, lapply(.SD, uniqueN), var_1]

If there are NAs

setDT(Test)[, lapply(.SD, function(x) uniqueN(na.omit(x))), var_1]

Or you can use aggregate. By default, the na.action=na.omit. So, we don't need any modifications.

aggregate(.~ var_1, Test, FUN=function(x) length(unique(x)) )

117

answered Oct 05 '22 21:10

akrun

Related questions
                            
                                ggplot2 : printing multiple plots in one page with a loop
                            
                                Rvest error: type 'externalptr'
                            
                                tbl_df and data.frame difference when using loops
                            
                                Weird lines appearing in the R graph
                            
                                Separate a column into multiple columns using tidyr::separate with sep=""
                            
                                How to drop columns in a nested data frame in R?
                            
                                Multiple series barplot
                            
                                Which selector to write in rvest package in R?
                            
                                R data.table replace NA with mean for numeric columns and most frequent value for nominal values
                            
                                Doing absolute descending sort of data.table through function?
                            
                                Efficient calling of F95 in R: use .Fortran or .Call?
                            
                                How to calculate dynamic panel models with lfe package
                            
                                Compiling RMarkdown with RStudio: why reading .RProfile?
                            
                                Count based on multiple conditions from other data.frame
                            
                                how to automatically update a slot of S4 class in R
                            
                                Subset n number of rows from a dataframe, based on a categorical variable, in R
                            
                                Icons as x-axis labels in R
                            
                                fit 2d surface using LOESS in R
                            
                                Building R packages with Packrat and AppVeyor
                            
                                Adding point and lines to 3D scatter plot in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

R summarize unique values across columns based on values from one column

Tags:

r

unique

aggregate

Ina.Quest

People also ask

1 Answers

akrun

Recent Activity

Donate For Us