I've come across this post on how to replace occurrences of a number in all columns of a data frame (e.g. replace all 4 by 10 in all columns): <code>DF[DF == 4] <- 10</code>. With data tables the same results can be achieved in exactly the same way: <code>DT[DT == 4] <- 10</code>. However, how should I procede if I want to apply this modification but only to specific columns from the data table, whether these columns are specified by position (e.g. <code>2:4</code>) or by name (e.g. <code>c("V2", "V3", "V4")</code>)? I will favor an "elegant" solution rather than iterations over every column.

We can use <code>set</code> which would be more efficient <pre class="prettyprint"><code>for(j in 2:4) { set(DT, i = which(DT[[j]]==4), j=j, value = 10) } DT # V1 V2 V3 V4 #1: A 2 2 10 #2: B 1 10 10 #3: C 3 10 3 #4: D 3 2 10 #5: E 3 3 3 #6: F 10 3 3 </code></pre> The above can be done with column names as well <pre class="prettyprint"><code>for(j in names(DT)[2:4]){ set(DT, i = which(DT[[j]]==4), j=j, value = 10) } </code></pre> <hr> Or another option is to specify the <code>.SDcols</code> with the columns of interest (either the numeric index or the column names), loop through the Subset of Data.table (<code>.SD</code>), <code>replace</code> the values that are 4 to 10 and assign (<code>:=</code>) the output back to columns of interest <pre class="prettyprint"><code>DT[, (2:4) := lapply(.SD, function(x) replace(x, x==4, 10)), .SDcols = 2:4] </code></pre> Or with column names <pre class="prettyprint"><code>DT[, (names(DT)[2:4]) := lapply(.SD, function(x) replace(x, x==4, 10)), .SDcols = names(DT)[2:4]] </code></pre> <h3>data</h3> <pre class="prettyprint"><code>set.seed(24) DT <- data.table(V1 = LETTERS[1:6], V2 = sample(1:4, 6, replace = TRUE), V3 = sample(2:4, 6, replace = TRUE), V4 = sample(3:4, 6, replace= TRUE)) </code></pre>

Replacing occurrences of a number in specific columns from a data table

Q: How do I replace multiple strings in R?

Use str_replace_all() method of stringr package to replace multiple string values with another list of strings on a single column in R and update part of a string with another string.

Tags:

r

data.table

I've come across this post on how to replace occurrences of a number in all columns of a data frame (e.g. replace all 4 by 10 in all columns): DF[DF == 4] <- 10. With data tables the same results can be achieved in exactly the same way: DT[DT == 4] <- 10.

However, how should I procede if I want to apply this modification but only to specific columns from the data table, whether these columns are specified by position (e.g. 2:4) or by name (e.g. c("V2", "V3", "V4"))?

I will favor an "elegant" solution rather than iterations over every column.

333

asked Jun 04 '17 10:06

mat

1 Answers

We can use set which would be more efficient

for(j in 2:4) {
  set(DT, i = which(DT[[j]]==4), j=j, value = 10)
 }
DT
#   V1 V2 V3 V4
#1:  A  2  2 10
#2:  B  1 10 10
#3:  C  3 10  3
#4:  D  3  2 10
#5:  E  3  3  3
#6:  F 10  3  3

The above can be done with column names as well

for(j in names(DT)[2:4]){
   set(DT, i = which(DT[[j]]==4), j=j, value = 10)
 }

Or another option is to specify the .SDcols with the columns of interest (either the numeric index or the column names), loop through the Subset of Data.table (.SD), replace the values that are 4 to 10 and assign (:=) the output back to columns of interest

DT[, (2:4) := lapply(.SD, function(x) replace(x, x==4, 10)), .SDcols = 2:4]

Or with column names

DT[, (names(DT)[2:4]) := lapply(.SD, function(x) replace(x, x==4, 10)), 
      .SDcols = names(DT)[2:4]]

data

set.seed(24)
DT <- data.table(V1 = LETTERS[1:6], V2 = sample(1:4, 6, replace = TRUE), 
   V3 = sample(2:4, 6, replace = TRUE), V4 = sample(3:4, 6, replace= TRUE))

185

answered Oct 13 '22 22:10

akrun

Related questions
                            
                                Bar chart overlay in Plotly R
                            
                                grid_plot + tikzDevice + shared legend with latex mark up
                            
                                How to sort list in R?
                            
                                why does as.integer in R decrement the value?
                            
                                Using ggrepel with single plot point/adding line between label and point
                            
                                How to add legend to geom_vline in facet histograms?
                            
                                p-values in pvclust & results in hclust
                            
                                Remove all rows of a category if one row meets a condition [duplicate]
                            
                                Display dates on axes in R
                            
                                Add grid lines to minor breaks only (ggplot)
                            
                                How add more information in tooltip cloude in ggiraph packages in R?
                            
                                Displaying R ggplots inline in jupyter notebooks
                            
                                How to convert R Date into Excel numeric serial date?
                            
                                Seemingly inconsistent column reference syntax when chaining methods on pandas data frames
                            
                                R: how to write a raster to disk without auxiliary file?
                            
                                Grouping variables select first row (keep one column), last row (keep different column)
                            
                                Get file through API call (R & plumber)
                            
                                Skimr - cant seem to produce the histograms
                            
                                Installing an R package from local unzipped folder
                            
                                how to feed the result of a pipe chain (magrittr) to an object

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With