I would like to identify binary columns in a data.frame.
For example, this table
my.table <-read.table(text="a,b,c
0,2,0
0.25,1,1
1,0,0", header=TRUE, as.is=TRUE,sep = ",")
would give FALSE, FALSE, TRUE
apply(my.table,2,function(x) { all(x %in% 0:1) })
(or
apply(my.table,2,function(x) { all(na.omit(x) %in% 0:1) })
if you want to allow for NA
values)
If you want to accept binary columns with NA in them, the following should do the trick:
is.binary <- function(v) {
x <- unique(v)
length(x) - sum(is.na(x)) == 2L
}
my.table <- data.frame(a=11:15, b=c(T,F,T,NA,T), c=c('foo',NA,'bar','bar','foo'))
vapply(my.table, is.binary, logical(1))
# a b c
#FALSE TRUE TRUE
...or if you only accept 0,1,NA:
is.binary <- function(v) {
x <- unique(v)
length(x) - sum(is.na(x)) == 2L && all(x[1:2] == 0:1)
}
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With