I would like to assign a matrix to a multi-column subset of a <code>data.table</code> but the matrix ends up getting treated as a column vector. For example, <pre class="prettyprint"><code>dt1 <- data.table(a1=rnorm(5), a2=rnorm(5), a3=rnorm(5)) m1 <- matrix(rnorm(10), ncol=2) dt1[,c("a1","a2")] <- m1 Warning messages: 1: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091, : 2 column matrix RHS of := will be treated as one vector 2: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091, : Supplied 10 items to be assigned to 5 items of column 'a1' (5 unused) 3: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091, : 2 column matrix RHS of := will be treated as one vector 4: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091, : Supplied 10 items to be assigned to 5 items of column 'a2' (5 unused) </code></pre> The problem can be solved by first converting <code>m1</code> to be another <code>data.table</code> object, but I'm curious what the reasonsing is for this error. The above syntax would work if <code>dt1</code> were a <code>data.frame</code>; what is the architectural rationale for not having it work with <code>data.table</code>?

<pre class="prettyprint"><code>dt1[,c("a1","a2")] <- as.data.table(m1) </code></pre> gives a simple solution but does make a copy. @Simon O'Hanlon provides a solution in the <code>data.table</code> way: <pre class="prettyprint"><code>dt1[ , `:=`( a1 = m1[,1] , a2 = m1[,2] ) ] </code></pre> and in my opinion an even better <code>data.table</code> solution is provided by @Frank: <pre class="prettyprint"><code>dt1[,c("a1","a2") := as.data.table(m1)] </code></pre>

A <code>data.frame</code> is not a <code>matrix</code>, nor is a <code>data.table</code> a <code>matrix</code>. Both <code>data.frame</code> and <code>data.table</code> objects are <code>lists</code>. These are stored very differently, although the indexing can be similar, this is processed under the hood. Within <code>[<-.data.frame</code> splits a matrix-valued <code>value</code> into a list with an element for each column. (The line is <code>value <- split(value, col(value))</code>)). Note also that <code>[<-.data.frame</code> will copy the entire data.frame in the process of assigning something to a subset of columns. <code>data.table</code> attempts to avoid this copying, as such <code>[<-.data.table</code> should be avoided, as all <code><-</code> methods in <code>R</code> make copies. Within <code>[<-.data.table</code>, <code>[<-.data.frame</code> will be called if <code>i</code> is a matrix, but not if only <code>value</code> is. <code>data.table</code> usually likes you to be explicit in ensuring that the types of data match when assigning. This helps avoid any coercion and related copying. You could, perhaps put in a feature request here to ensure compatibility, but given your usage is far outside what is recommended, then perhaps the package authors might request you simply use the <code>data.table</code> conventions and approaches.

Assign a matrix to a subset of a data.table

Tags:

r

data.table

I would like to assign a matrix to a multi-column subset of a data.table but the matrix ends up getting treated as a column vector. For example,

dt1 <- data.table(a1=rnorm(5), a2=rnorm(5), a3=rnorm(5))
m1 <- matrix(rnorm(10), ncol=2)
dt1[,c("a1","a2")] <- m1

Warning messages:
1: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091,  :
  2 column matrix RHS of := will be treated as one vector
2: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091,  :
  Supplied 10 items to be assigned to 5 items of column 'a1' (5 unused)
3: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091,  :
  2 column matrix RHS of := will be treated as one vector
4: In `[<-.data.table`(`*tmp*`, , c("a1", "a2"), value = c(-0.308851784175091,  :
  Supplied 10 items to be assigned to 5 items of column 'a2' (5 unused)

The problem can be solved by first converting m1 to be another data.table object, but I'm curious what the reasonsing is for this error. The above syntax would work if dt1 were a data.frame; what is the architectural rationale for not having it work with data.table?

463

asked Nov 12 '13 00:11

Abiel

2 Answers

dt1[,c("a1","a2")] <- as.data.table(m1)

gives a simple solution but does make a copy.

@Simon O'Hanlon provides a solution in the data.table way:

dt1[ , `:=`( a1 = m1[,1] , a2 = m1[,2] ) ]

and in my opinion an even better data.table solution is provided by @Frank:

dt1[,c("a1","a2") := as.data.table(m1)]

answered Oct 09 '22 04:10

caranbot

A data.frame is not a matrix, nor is a data.table a matrix. Both data.frame and data.table objects are lists. These are stored very differently, although the indexing can be similar, this is processed under the hood.

Within [<-.data.frame splits a matrix-valued value into a list with an element for each column.

(The line is value <- split(value, col(value)))).

Note also that [<-.data.frame will copy the entire data.frame in the process of assigning something to a subset of columns.

data.table attempts to avoid this copying, as such [<-.data.table should be avoided, as all <- methods in R make copies.

Within [<-.data.table, [<-.data.frame will be called if i is a matrix, but not if only value is.

data.table usually likes you to be explicit in ensuring that the types of data match when assigning. This helps avoid any coercion and related copying.

You could, perhaps put in a feature request here to ensure compatibility, but given your usage is far outside what is recommended, then perhaps the package authors might request you simply use the data.table conventions and approaches.

answered Oct 09 '22 03:10

mnel

Related questions
                            
                                Process substitution
                            
                                Remove empty factors from clustered bargraph in ggplot2 with multiple facets
                            
                                r - How to add row index to a data frame, based on combination of factors [duplicate]
                            
                                Using `car` to recode across range of columns
                            
                                Recursive %in% function in R?
                            
                                error-safe templating with brew / whisker
                            
                                Get website directory listing in an R vector using RCurl
                            
                                Fixed Effects Regression with Interaction Term Causes Error
                            
                                filled.contour in R 3.0.x throws error
                            
                                using argmax or something simpler in R
                            
                                How can I compare two strings to find the number of characters that match in R, using substitution distance?
                            
                                How to paginate R output?
                            
                                Exclude elements from vector based on regular expression pattern
                            
                                mix variables/results between chunks in knitr defined in different languages?
                            
                                Removing negative plot area in ggplot2 [duplicate]
                            
                                Add extra arguments to implicit S4 generic for a primitive function
                            
                                combine list elements based on element names
                            
                                Placement of error bars in barplot using ggplot2
                            
                                How can I get a list of all methods defined on an S4 class in R?
                            
                                repeat multiple NULL in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With