data.table replacing a value by NA [duplicate]

Tags:

2 Answers

You can try set for multiple columns. It will be faster as the overhead of .[data.table is avoided

for(j in seq_along(dt1)){
         set(dt1, i=which(dt1[[j]]==0), j=j, value=NA)
}
dt1
#   V1 V2
#1: NA  2
#2:  1  1
#3:  2 NA

Or another option would be looping with lapply and then change the 0 values to NA with replace

dt1[, lapply(.SD, function(x) replace(x, which(x==0), NA))]

Or we can make use of some arthithmetic operations to convert the 0 value to NA.

 dt1[, lapply(.SD, function(x) (NA^!x) *x)]

The way (NA^!x)*x this works is by converting the !x i.e. a logical TRUE/FALSE vector for each column (where TRUE corresponds to 0 value) to NA and 1 by doing NA^!x. We multiply with the x value to replace the 1 with the x value corresponding to it while the NA will remain as such.

Or a syntax similar to base R would be

  is.na(dt1) <- dt1==0

But this method may not be that efficient for large data.table as dt1==0 would be a logical matrix and also as @Roland mentioned in the comments that the dataset would be copied. I would either use the lapply based or the more efficient set for larger datasets.

answered Nov 01 '22 23:11

akrun

dt1[dt1==0] <- NA worked for me.

dt1[dt1==0] <- NA
dt1
##   V1 V2
##1: NA  2
##2:  1  1
##3:  2 NA

As noted by Roland, this does make a copy of the data.table object, and will be slower.

answered Nov 02 '22 00:11

alexforrence

Related questions
                            
                                Install R Packages without internet [duplicate]
                            
                                Histogram with "negative" logarithmic scale in R
                            
                                In R, Merge two data frames, fill down the blanks
                            
                                split string with regex
                            
                                Why doesn't the plyr package use my parallel backend?
                            
                                Simple method of counting non-NAs in column of data String [duplicate]
                            
                                Subset variables in data frame based on column type
                            
                                R calculate the standard error using bootstrap
                            
                                Passing large matrices to RcppArmadillo function without creating copy (advanced constructors)
                            
                                Efficient method to subset drop rows with NA values in R
                            
                                Count the number of pattern matches in a string
                            
                                How can I extract factor loadings from lavaan?
                            
                                mean( ,na.rm=TRUE) still returns NA
                            
                                Replace text that appears at the end of a string
                            
                                Use string as filter in dplyr?
                            
                                How to build a crossword-like plot for a boolean matrix
                            
                                R: find vector in list of vectors
                            
                                Search for and remove outliers from a dataframe grouped by a variable
                            
                                R-ranking values of a column by grouping, conditional to another variable
                            
                                Basic - T-Test -> Grouping Factor Must have Exactly 2 Levels

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

data.table replacing a value by NA [duplicate]

Tags:

r

data.table

MYaseen208

People also ask

2 Answers

akrun

alexforrence

Recent Activity

Donate For Us