Set NA to 0 in R

Tags:

r

After merging a dataframe with another im left with random NA's for the occasional row. I'd like to set these NA's to 0 so I can perform calculations with them.

Im trying to do this with:

Click to copy

    bothbeams.data = within(bothbeams.data, {       bothbeams.data$x.x = ifelse(is.na(bothbeams.data$x.x) == TRUE, 0, bothbeams.data$x.x)       bothbeams.data$x.y = ifelse(is.na(bothbeams.data$x.y) == TRUE, 0, bothbeams.data$x.y)     })

Where $x.x is one column and $x.y is the other of course, but this doesn't seem to work.

848

asked Apr 13 '12 10:04

MaikelS

2 Answers

You can just use the output of is.na to replace directly with subsetting:

Click to copy

bothbeams.data[is.na(bothbeams.data)] <- 0

Or with a reproducible example:

Click to copy

dfr <- data.frame(x=c(1:3,NA),y=c(NA,4:6)) dfr[is.na(dfr)] <- 0 dfr   x y 1 1 0 2 2 4 3 3 5 4 0 6

However, be careful using this method on a data frame containing factors that also have missing values:

Click to copy

> d <- data.frame(x = c(NA,2,3),y = c("a",NA,"c")) > d[is.na(d)] <- 0 Warning message: In `[<-.factor`(`*tmp*`, thisvar, value = 0) :   invalid factor level, NA generated

It "works":

Click to copy

> d   x    y 1 0    a 2 2 <NA> 3 3    c

...but you likely will want to specifically alter only the numeric columns in this case, rather than the whole data frame. See, eg, the answer below using dplyr::mutate_if.

143

answered Sep 18 '22 21:09

James

A solution using mutate_all from dplyr in case you want to add that to your dplyr pipeline:

Click to copy

library(dplyr) df %>%   mutate_all(funs(ifelse(is.na(.), 0, .)))

Result:

Click to copy

   A B C 1  0 0 0 2  1 0 0 3  2 0 2 4  3 0 5 5  0 0 2 6  0 0 1 7  1 0 1 8  2 0 5 9  3 0 2 10 0 0 4 11 0 0 3 12 1 0 5 13 2 0 5 14 3 0 0 15 0 0 1

If in any case you only want to replace the NA's in numeric columns, which I assume it might be the case in modeling, you can use mutate_if:

Click to copy

library(dplyr) df %>%   mutate_if(is.numeric, funs(ifelse(is.na(.), 0, .)))

or in base R:

Click to copy

replace(is.na(df), 0)

Result:

Click to copy

   A    B C 1  0    0 0 2  1 <NA> 0 3  2    0 2 4  3 <NA> 5 5  0    0 2 6  0 <NA> 1 7  1    0 1 8  2 <NA> 5 9  3    0 2 10 0 <NA> 4 11 0    0 3 12 1 <NA> 5 13 2    0 5 14 3 <NA> 0 15 0    0 1

Update

with dplyr 1.0.0, across is introduced:

Click to copy

library(dplyr) # Replace `NA` for all columns df %>%   mutate(across(everything(), ~ ifelse(is.na(.), 0, .)))  # Replace `NA` for numeric columns df %>%   mutate(across(where(is.numeric), ~ ifelse(is.na(.), 0, .)))

Data:

Click to copy

set.seed(123) df <- data.frame(A=rep(c(0:3, NA), 3),                   B=rep(c("0", NA), length.out = 15),                   C=sample(c(0:5, NA), 15, replace = TRUE))

answered Sep 17 '22 21:09

acylam

Related questions
                            
                                Grid line consistent with ticks on axis
                            
                                ggplot2 heatmap with colors for ranged values
                            
                                Calculate percentage change in an R data frame
                            
                                Arrange a grouped_df by group variable not working
                            
                                Internal links in rmarkdown don't work
                            
                                Place a legend for each facet_wrap grid in ggplot2
                            
                                Batch convert columns to numeric type
                            
                                Sum of two Columns of Data Frame with NA Values
                            
                                Spearman correlation and ties
                            
                                How to extract sheet names from Excel file in R
                            
                                readOGR() cannot open file
                            
                                Emulate split() with dplyr group_by: return a list of data frames
                            
                                How to save() with a particular variable name
                            
                                Error: '\R' is an unrecognized escape in character string starting "C:\R"
                            
                                sample rows of subgroups from dataframe with dplyr
                            
                                How to add code folding to output chunks in rmarkdown html documents
                            
                                Calling a function from a namespace
                            
                                How to increase the font size of ggtitle in ggplot2
                            
                                How to filter a data frame
                            
                                Add extra level to factors in dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Set NA to 0 in R

Tags:

r

MaikelS

People also ask

2 Answers

James

Update

acylam

Recent Activity

Donate For Us