I'm trying to do := by group for an existing column of type 'integer' where the new values are of type 'double', which fails. My scenario is mutating a column representing time into a POSIXct based on values in other columns. I could modify the creating of the data.table as a work around, but I'm still interested in how to go about actually changing the type of a column, as it is suggested in the error message. Here's a simple toy example of my problem: <pre class="prettyprint"><code>db = data.table(id=rep(1:2, each=5), x=1:10, y=runif(10)) db id x y 1: 1 1 0.47154470 2: 1 2 0.03325867 3: 1 3 0.56784494 4: 1 4 0.47936031 5: 1 5 0.96318208 6: 2 6 0.83257416 7: 2 7 0.10659533 8: 2 8 0.23103810 9: 2 9 0.02900567 10: 2 10 0.38346531 db[, x:=mean(y), by=id] Error in `[.data.table`(db, , `:=`(x, mean(y)), by = id) : Type of RHS ('double') must match LHS ('integer'). To check and coerce would impact performance too much for the fastest cases. Either change the type of the target column, or coerce the RHS of := yourself (e.g. by using 1L instead of 1) </code></pre>

We can convert the class of 'x' column to 'numeric' before assigning the 'mean(y)' to 'x' as the class of 'x' is 'integer'. This may be useful if we are replacing 'x' with the <code>mean</code> of any other numeric variable (including 'x'). <pre class="prettyprint"><code>db[, x:= as.numeric(x)][, x:= mean(y), by=id][] </code></pre> Or assign to a new column, and change the column name afterwards <pre class="prettyprint"><code>setnames(db[, x1:= mean(y),by=id][,x:=NULL],'x1', 'x') </code></pre> Or we can assign 'x' to 'NULL' and then create 'x' as the <code>mean</code> of 'y' ( @David Arenburg's suggestion) <pre class="prettyprint"><code>db[, x:=NULL][, x:= mean(y), by= id][] </code></pre>

How to change type of target column when doing := by group in a data.table in R?

Tags:

types

r

data.table

I'm trying to do := by group for an existing column of type 'integer' where the new values are of type 'double', which fails.

My scenario is mutating a column representing time into a POSIXct based on values in other columns. I could modify the creating of the data.table as a work around, but I'm still interested in how to go about actually changing the type of a column, as it is suggested in the error message.

Here's a simple toy example of my problem:

db = data.table(id=rep(1:2, each=5), x=1:10, y=runif(10))
db
id  x          y
 1:  1  1 0.47154470
 2:  1  2 0.03325867
 3:  1  3 0.56784494
 4:  1  4 0.47936031
 5:  1  5 0.96318208
 6:  2  6 0.83257416
 7:  2  7 0.10659533
 8:  2  8 0.23103810
 9:  2  9 0.02900567
10:  2 10 0.38346531

db[, x:=mean(y), by=id]   

Error in `[.data.table`(db, , `:=`(x, mean(y)), by = id) : 
Type of RHS ('double') must match LHS ('integer'). To check and coerce would impact performance too much for the fastest cases. Either change the type of the target column, or coerce the RHS of := yourself (e.g. by using 1L instead of 1)

902

asked Apr 15 '15 07:04

hallvig

1 Answers

We can convert the class of 'x' column to 'numeric' before assigning the 'mean(y)' to 'x' as the class of 'x' is 'integer'. This may be useful if we are replacing 'x' with the mean of any other numeric variable (including 'x').

db[, x:= as.numeric(x)][, x:= mean(y), by=id][]

Or assign to a new column, and change the column name afterwards

setnames(db[, x1:= mean(y),by=id][,x:=NULL],'x1', 'x')

Or we can assign 'x' to 'NULL' and then create 'x' as the mean of 'y' ( @David Arenburg's suggestion)

db[, x:=NULL][, x:= mean(y), by= id][]

answered Oct 02 '22 01:10

akrun

Related questions
                            
                                What is the purpose of this trailing comma in R?
                            
                                Determining the distance between two ZIP codes (alternatives to mapdist)
                            
                                R: How can a function accept variable arguments using ellipsis (...) without copying them in memory?
                            
                                Adding new columns to a data.table by-reference within a function not always working
                            
                                Why do dates at infinity look like NAs but act like dates? [duplicate]
                            
                                What is the difference between as.tibble(), as_data_frame(), and tbl_df()?
                            
                                Vectorize my thinking: Vector Operations in R
                            
                                derivative of a function
                            
                                Emacs autocomplete-mode extension for ESS and R
                            
                                Frequency weighting in R, comparing results with Stata
                            
                                Exempt code chunks in an Sweave document from emacs spell check
                            
                                Is it possible to change the ylim and xlim when the plot has already been drawn?
                            
                                print if not assigned
                            
                                How to extend S3 method from another package without loading the package
                            
                                Rpres HTML5 presentation "Save As PDF" (Google Chrome) displays incorrectly
                            
                                R Shiny: How to change background color of a table
                            
                                Subsetting a data.table by range making use of binary search
                            
                                Julia: show body of function (to find lost code)
                            
                                Setting up RStudio Portable Default R version
                            
                                How to get chunk name in knitr?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With