Simple method of counting non-NAs in column of data String [duplicate]

Tags:

r

na

I am trying to find a simple way of counting the non missing cases in a column of a data frame. I have used the function:

foo<- function(x) { sum(!is.na(x)) }

and then apply it to a data frame via sapply()

stats$count <- sapply(OldExaminee, foo2, simplify=T)

Although this is working fine, I am just in disbelieve that there isn't a simpler way of counting, i.e. something in the base set of function.

Any ideas?

680

asked Mar 29 '13 13:03

SprengMeister

1 Answers

For a data.frame you can get it using colSums and is.na:

set.seed(45)
df <- data.frame(matrix(sample(c(NA,1:5), 50, replace=TRUE), ncol=5))
#    X1 X2 X3 X4 X5
# 1   3  2 NA  2 NA
# 2   1  5  1  1  4
# 3   1  1  3  2  3
# 4   2  2  3  5  3
# 5   2  2  5  2  2
# 6   1  2 NA  3  3
# 7   1  5  5  5  2
# 8   3 NA  4  1  5
# 9   1  2  3 NA  1
# 10 NA  1  1  2  2

colSums(!is.na(df))
# X1 X2 X3 X4 X5 
#  9  9  8  9  9

167

answered Oct 28 '22 16:10

Arun

Related questions
                            
                                Container is running beyond virtual memory limits
                            
                                Split a character to letters and numbers
                            
                                in R, check if string appears in row of dataframe (in any column)
                            
                                how to comment out R code blocks in R markdown?
                            
                                How to find cumulative variance or standard deviation in R
                            
                                Using quotations inside mutate: an alternative to mutate_(.dots = ...)
                            
                                Python pandas equivalent to R's group_by, mutate, and ifelse
                            
                                Count by factor in ggplot2 chart
                            
                                R: how to merge two matrix according to their column and row names?
                            
                                How can I read multiple files from multiple directories into R for processing?
                            
                                Rounding numbers in R to specified number of digits
                            
                                rle-like function that catches "run" of adjacent integers
                            
                                faster than scan() with Rcpp?
                            
                                Creating regular 15-minute time-series from irregular time-series
                            
                                Is there any way to use the Identify command with ggplot 2?
                            
                                Install R Packages without internet [duplicate]
                            
                                Histogram with "negative" logarithmic scale in R
                            
                                In R, Merge two data frames, fill down the blanks
                            
                                split string with regex
                            
                                Why doesn't the plyr package use my parallel backend?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With