R can't convert NaN to NA

Tags:

I have a data frame with several factor columns containing NaN's that I would like to convert to NA's (the NaN seems to be a problem for using linear regression objects to predict on new data).

> tester1 <- c("2", "2", "3", "4", "2", "3", NaN)
> tester1 
[1] "2"   "2"   "3"   "4"   "2"   "3"   "NaN"
> tester1[is.nan(tester1)] = NA
> tester1 
[1] "2"   "2"   "3"   "4"   "2"   "3"   "NaN"
> tester1[is.nan(tester1)] = "NA"
> tester1 
[1] "2"   "2"   "3"   "4"   "2"   "3"   "NaN"

871

asked Feb 27 '12 22:02

screechOwl

1 Answers

Here's the problem: Your vector is character in mode, so of course it's "not a number". That last element got interpreted as the string "NaN". Using is.nan will only make sense if the vector is numeric. If you want to make a value missing in a character vector (so that it gets handle properly by regression functions), then use (without any quotes), NA_character_.

> tester1 <- c("2", "2", "3", "4", "2", "3", NA_character_)
>  tester1
[1] "2" "2" "3" "4" "2" "3" NA 
>  is.na(tester1)
[1] FALSE FALSE FALSE FALSE FALSE FALSE  TRUE

Neither "NA" nor "NaN" are really missing in character vectors. If for some reason there were values in a factor variable that were "NaN" then you would have been able just use logical indexing:

tester1[tester1 == "NaN"] = "NA"  
# but that would not really be a missing value either 
# and it might screw up a factor variable anyway.

tester1[tester1=="NaN"] <- "NA"
Warning message:
In `[<-.factor`(`*tmp*`, tester1 == "NaN", value = "NA") :
invalid factor level, NAs generated
##########
tester1 <- factor(c("2", "2", "3", "4", "2", "3", NaN))

> tester1[tester1 =="NaN"] <- NA_character_
> tester1
[1] 2    2    3    4    2    3    <NA>
Levels: 2 3 4 NaN

That last result might be surprising. There is a remaining "NaN" level but none of elements is "NaN". Instead the element that was "NaN" is now a real missing value signified in print as .

127

answered Oct 15 '22 09:10

IRTFM

Related questions
                            
                                Is there an R function for the element-wise summation of the matrices stored as elements in single list object? [duplicate]
                            
                                convert string date to R Date FAST for all dates
                            
                                How logical negation operator "!" works
                            
                                Power regression in R similar to excel
                            
                                using column names when appending data in write.table
                            
                                Remove NA from list of lists
                            
                                How to index a vector sequence within a vector sequence
                            
                                I am unable to download the reshape2 package in R [closed]
                            
                                Filter by multiple patterns with filter() and str_detect()
                            
                                From [package] import [function] in R
                            
                                Is it good practice to update R packages often? [closed]
                            
                                How to replicate a Monthly Cycle Chart in R
                            
                                In R, how to use a "null" default value for an argument of a function?
                            
                                grep() to search column names of a dataframe
                            
                                Error installing 'topicmodels' package, non zero exit status; Ubuntu
                            
                                R - Error in UseMethod("groups") : no applicable method for 'groups' applied to an object of class "character"
                            
                                R: How can I use apply on rows of a data.frame and get out $column_name?
                            
                                Examining contents of .rdata file by attaching into a new environment - possible?
                            
                                match two columns with two other columns
                            
                                Remove rows in dataframe with factor ""

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With