I am trying to select only factor columns from my data frame. Example is below:
bank[,apply(bank[,names(bank)!="id"], is.factor)]
But the code behaves strangely. Step by step:
sapply(bank[,names(bank)!="id"], is.factor)
I get:
age sex region income married children car
FALSE TRUE TRUE FALSE TRUE FALSE TRUE
save_act current_act mortgage pep ageBin
TRUE TRUE TRUE TRUE TRUE
Looks OK. Now, I assume that I just pass this matrix of TRUE/FALSE to the next step and get only the columns I need:
bank[,sapply(bank[,names(bank)!="id"], is.factor)]
But as result I get all the same columns as in original bank dataframe. Nothing is filtered out. I tried it one way or another but can't find a solution. Any advice on what I am doing wrong?
#DATA
df = mtcars
colnames(df) = gsub("mpg","id",colnames(df))
df$am = as.factor(df$am)
df$gear = as.factor(df$gear)
df$id = as.factor(df$id)
#Filter out 'id' after selecting factors
df[,sapply(df, is.factor) & colnames(df) != "id"]
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With