Remove columns that have only a unique value

Question

I want to remove columns that have only a unique value.

First, I try it for a single column and it works:

data %/% 
  select_if(length(unique(data$policy_id)) > 1)

then I try it for multiple columns as below:

data %/% 
  select_if(length(unique(data[, c("policy_date", "policy_id"])) > 1)

but it does not work. I think it is a conceptual mistake due to my lack of experience.

thanks in advance

Allan Cameron · Accepted Answer

You can use select(where()).

Suppose I have a data frame like this:

df <- data.frame(A = LETTERS[1:5], B = 1:5, C = 2)

df
#>   A B C
#> 1 A 1 2
#> 2 B 2 2
#> 3 C 3 2
#> 4 D 4 2
#> 5 E 5 2

Then I can do:

df %>% select(where(~ n_distinct(.) > 1))

#>   A B
#> 1 A 1
#> 2 B 2
#> 3 C 3
#> 4 D 4
#> 5 E 5

ThomasIsCoding · Answer

Some base R options:

subset(df,select = lengths(sapply(df,unique))>1)

Filter(function(x) length(unique(x))>1,df)

Donate For Us