R max function ignore NA

Tags:

3 Answers

You can use hablar::max_ which returns NA if all values are NA

apply(df, 1, function(x) hablar::max_(x[x!=9]))
#[1]  5 NA  7

data

df <- structure(list(age = c(5, NA, 9), marks = c(-5, NA, 7), story = c(2, 
9, NA)), row.names = c(NA, -3L), class = "data.frame")

df
#  age marks story
#1   5    -5     2
#2  NA    NA     9
#3   9     7    NA

answered Sep 20 '22 01:09

It seems that the problem has been pointed out in the comments already. Since some vectors contain only NAs, -Inf is reported, which I take from the comments you don't like. In this answer I would like to point out one possible way to tackle the issue, namely to built in a control statement (instead of overwritting -Inf after the fact, which is equally valid). For instance,

 my.max <- function(x) ifelse( !all(is.na(x)), max(x, na.rm=T), NA)

does this trick. If every (all) element in x is NA, then NA is returned, and the max otherwise. If you want any other value returned, just exchange NA for that value. You can also built this easily into your apply-function. E.g.

 maindata$max_pc_age <- apply(maindata[,c(paste("Q2",1:18,sep="_"))], 1, my.max)

I am still sometimes confused by R's NA and empty set treatment. Statements like test <- NA; test==NA will give NA as a result (instead of TRUE, as returned by is.na(test)), which is sometimes rationalized by saying that since the value is missing, how could you know that these two missing values are identical? In this case, however, max returns -Inf since it is given an empty set, which I think is not at all obvious. My experience is though that if strange and unexpected results pop up, NAs or empty sets are often involved.

answered Oct 21 '22 23:10

coffeinjunky

In cases like below:

df[2,2] <- NA
df[1,2] <- -5

apply(df, 1, function(x) max(x[x != 9],na.rm=TRUE))
#[1]    5 -Inf    7
#Warning message:
#In max(x[x != 9], na.rm = TRUE) :
#  no non-missing arguments to max; returning -Inf

You could do:

df1 <- df  
minVal <- min(df1[!is.na(df1)])-1

df1[is.na(df1)|df1==9] <- minVal
val <- do.call(`pmax`, df1)
val[val==minVal] <- NA
val
#[1]  5 NA  7

answered Oct 22 '22 00:10

akrun

Related questions
                            
                                dplyr / tidyevaluation: How to pass an expression in mutate as a string?
                            
                                Combining Rolling Origin Forecast Resampling and Group V-Fold Cross-Validation in rsample
                            
                                Why does substitute change noquote text to a string in R?
                            
                                How to render a gganimate graph in html using rmarkdown::render(), without generating unwanted output
                            
                                How to create all combinations from a nested list while preserving the structure using R?
                            
                                iteratively constructed dataframe in R
                            
                                Using multiple ellipses arguments in R
                            
                                as.POSIXct gives an unexpected timezone
                            
                                Big Data Process and Analysis in R
                            
                                How can I move facet labels to top of my graph?
                            
                                Writing data isn't preserving encoding
                            
                                Simple function of quantmod not working anymore
                            
                                Reverse fill order for histogram bars in ggplot2
                            
                                How to create a ribbon plot?
                            
                                Set a variable using colnames(), update data.table using := operator, variable is silently updated? [duplicate]
                            
                                3D surface plot from 2D matrix
                            
                                Why is there no NA_logical_
                            
                                RODBC loses time values of datetime when result set is large
                            
                                Calling a Rcpp function from another Rcpp function while building an R package
                            
                                Print, cat, paste in R separated by newline character

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

R max function ignore NA

Tags:

r

max

user2543622

People also ask

3 Answers

Ronak Shah

coffeinjunky

akrun

Recent Activity

Donate For Us