Add a column with count of NAs and Mean

Tags:

I have a data frame and I need to add another column to it which shows the count of NAs in all the other columns for that row and also the mean of the non-NA values. I think it can be done in dplyr.

> df1 <- data.frame(a = 1:5, b = c(1,2,NA,4,NA), c = c(NA,2,3,NA,NA))
> df1
  a  b  c
1 1  1 NA
2 2  2  2
3 3 NA  3
4 4  4 NA
5 5 NA NA

I want to mutate another column which counts the number of NAs in that row and another column which shows the mean of all the NON-NA values in that row.

534

asked Feb 16 '16 21:02

sachinv

1 Answers

library(dplyr)

count_na <- function(x) sum(is.na(x))    

df1 %>%
  mutate(means = rowMeans(., na.rm = T),
         count_na = apply(., 1, count_na))

#### ANSWER FOR RADEK ####
elected_cols <- c('b', 'c')

df1 %>%
  mutate(means = rowMeans(.[elected_cols], na.rm = T),
         count_na = apply(.[elected_cols], 1, count_na))

answered Oct 11 '22 08:10

maloneypatr

Related questions
                            
                                Spreading a two column data frame with tidyr
                            
                                Why does rendering a pdf from rmarkdown require closing rstudio between renders?
                            
                                Labelling logarithmic scale display in R
                            
                                Why is [- subsetting (i.e. deletion) of columns not possible with names?
                            
                                Error in eval(expr, envir, enclos) : object not found
                            
                                Convert std::vector to Rcpp matrix
                            
                                R: Using equation with natural logarithm in nls
                            
                                Calculating the difference between consecutive rows by group using dplyr?
                            
                                edit strip size ggplot2
                            
                                How do I filter a range of numbers in R? [duplicate]
                            
                                How can I use dplyr to apply a function to all non-group_by columns?
                            
                                Shade region between two lines with ggplot
                            
                                How to compute ROC and AUC under ROC after training using caret in R?
                            
                                What is “object of type ‘closure’ is not subsettable” error in Shiny?
                            
                                if not conditions in R?
                            
                                What's the R equivalent of SQL's LIKE 'description%' statement?
                            
                                apply a function over groups of columns
                            
                                Subset rows according to a range of time
                            
                                R: sample() command subject to a constraint
                            
                                Select row with most recent date by group

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Add a column with count of NAs and Mean

Tags:

r

na

dplyr

sachinv

People also ask

1 Answers

maloneypatr

Recent Activity

Donate For Us