mean returns NaN besides na.rm= TRUE

Tags:

r

dplyr

Sample data

date        coins   
2013-10-01  NA      
2013-10-01  NA      
2013-10-01  NA      
2013-11-01  10      
2013-11-01  NA      
2013-11-01  20      
2013-11-01  30      
2013-11-01  40      
2013-12-30  NA      
2013-12-30  22      
2013-12-30  24
2013-12-30  25

What I want to do?

I want to calculate mean and median of the coins column, ignoring missing values.

What i have done so far?

Grouped the data on date variable by_date <- group_by(df, date)
Summarised data using:by_date %>% summarise_each_(funs(mean(., na.rm = TRUE), median(., na.rm=TRUE)), names(by_date)[2])

Question The results returned by summarise_each_ show NaN for date 2013-10-01. Does that mean the function is not ignoring missing values?

996

asked Feb 15 '16 15:02

Imran Ali

1 Answers

The problem here is that all the values for 2013-10-01 are NA, so there can't be a mean. The NaN is R trying to tell you this.

If you'd rather just not have 2013-10-01 show up in the summary, one option is to get rid of NA values upfront like this:

by_date<-group_by(df[!is.na(df$coins),],date)

108

answered Oct 02 '22 14:10

mrip

Related questions
                            
                                How to replace elements of a matrix in C++ with values from another matrix (using Rcpp)?
                            
                                How to get the exact value of factorial(100)
                            
                                How to find three consecutive rows with the same value
                            
                                Extract shapefile value to point with R
                            
                                Double Sapply nested function
                            
                                Extract the hierarchical structure of the nodes in a dendrogram or cluster
                            
                                How to force idle workers to take jobs in parallel R?
                            
                                Set seed with cv.glmnet paralleled gives different results in R
                            
                                assign colors to each level of factors in R figures
                            
                                Assigning groups using grepl with multiple inputs
                            
                                what is default color of smooth curve in ggplot2?
                            
                                Control layout when displaying a series of ggplot plots stored in a list
                            
                                Monte carlo integration not working?
                            
                                The xgboost package and the random forests regression
                            
                                How to background geom_vline and geom_hline in ggplot 2 in a bubble chart
                            
                                dplyr Update a cell in a data.frame
                            
                                Why do rbind() and do.call(rbind, ) return different results?
                            
                                Ways to improve for loop for matrix manipulations depending on another matrix
                            
                                Cannot Change the Version of R in RStudio
                            
                                How to reduce the resolution (Regrid) of netCDF using bi-linear interpolation in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With