While looking at an answer posted recently on SO, I noticed an unfamiliar assignment statement. Instead of the usual form of <code>myVar<- myValue</code>, it used the form <code>myVar[]<- myValue</code>, i.e. the object on lefthand side is indexed with empty square brackets. Personally, I had never seen such an assignment, but it had a highly useful effect-- it reshaped the assigned data 'myValue' to the shape of 'myVar'. I would like to use this in my code as this makes things lot easier. However the documentation for <code>"<-"</code> seems to be silent on it. Is this a well established feature and one can rely on it to work in all cases? Also, my guess is that it might be a side effect of a function call stack, i.e. calling <code><-</code> and <code>[</code> in sequence, but I could not figure out how. Can someone throw some light on that? Here's an example-- <pre class="prettyprint"><code># A dataframe df1 <- data.frame(a = 1:4, b = 11:14) # simple assignment assigns to class of RHS df1 <- c(21:24, 31:34) df1 #[1] 21 22 23 24 31 32 33 34 class(df1) #[1] "integer" #assignment with [] casts to class of LHS df1<- data.frame(a = 1:4, b = 11:14) df1[]<- c(21:24,31:34) df1 # a b # 1 21 31 # 2 22 32 # 3 23 33 # 4 24 34 # recycling to preserve shape df1[]<- c(101:102) df1 # a b # 1 101 101 # 2 102 102 # 3 101 101 # 4 102 102 class(df1) #data.frame # reshaping df1<- data.frame(a = 1:4, b = 11:14) df1[] <- matrix(1:8, 2,4) df1 #matrix reshaped class(df1) #[1] "data.frame" # flattening x<- 1:8 x[] <- matrix(1:8,4,2) x #[1] 1 2 3 4 5 6 7 8 </code></pre>

This is an intentional and documented feature. As joran mentioned, the documentation page "Extract" includes this in the "Atomic Vectors" section: <blockquote> An empty index selects all values: this is most often used to replace all the entries but keep the attributes. </blockquote> However, in the case of recursive objects (<code>data.frames</code> or <code>lists</code>, for example), the attributes are only kept for the subsetted object. Its parts don't get such protection. Here's an example: <pre class="prettyprint"><code>animals <- factor(c('cat', 'dog', 'fish')) df_factor <- data.frame(x = animals) rownames(df_factor) <- c('meow', 'bark', 'blub') str(df_factor) # 'data.frame': 3 obs. of 1 variable: # $ x: Factor w/ 3 levels "cat","dog","fish": 1 2 3 df_factor[] <- 'cat' str(df_factor) # 'data.frame': 3 obs. of 1 variable: # $ x: chr "cat" "cat" "cat" rownames(df_factor) # [1] "meow" "bark" "blub" </code></pre> <code>df_factor</code> kept its <code>rownames</code> attribute, but the <code>x</code> column is just the character vector used in the assignment instead of a factor. We can keep the class and levels of <code>x</code> by specifically replacing its values: <pre class="prettyprint"><code>df_factor <- data.frame(x = animals) df_factor$x[] <- 'cat' str(df_factor) # 'data.frame': 3 obs. of 1 variable: # $ x: Factor w/ 3 levels "cat","dog","fish": 1 1 1 </code></pre> So replacement with empty subsetting is very safe for vectors, matrices, and arrays, because their elements can't have their own attributes. But it requires some care when dealing with list-like objects.

Assignment to empty index (empty square brackets x[]<-) on LHS

Q: What does empty square brackets mean in MATLAB?

MATLAB has positional input arguments. Many MATLAB functions use an empty numeric array (i.e. []) to indicate that an input argument is undefined, which allows further input arguments to be specified.

Q: What do empty brackets mean in Python?

This is to indicate that you have an empty "list" as opposed to any variables. It also allows you to invoke specific methods like . append used in your code for subst_words.

Q: How do you use square brackets in Python?

Values in a Python dictionary can be accessed by placing the key within square brackets next to the dictionary. Values can be written by placing key within square brackets next to the dictionary and using the assignment operator ( = ). If the key already exists, the old value will be overwritten.

Tags:

syntax

casting

variable-assignment

r

reshape

While looking at an answer posted recently on SO, I noticed an unfamiliar assignment statement. Instead of the usual form of myVar<- myValue, it used the form myVar[]<- myValue, i.e. the object on lefthand side is indexed with empty square brackets. Personally, I had never seen such an assignment, but it had a highly useful effect-- it reshaped the assigned data 'myValue' to the shape of 'myVar'.

I would like to use this in my code as this makes things lot easier. However the documentation for "<-" seems to be silent on it.

Is this a well established feature and one can rely on it to work in all cases?

Also, my guess is that it might be a side effect of a function call stack, i.e. calling <- and [ in sequence, but I could not figure out how. Can someone throw some light on that?

Here's an example--

# A dataframe
df1 <- data.frame(a = 1:4, b = 11:14)

# simple assignment assigns to class of RHS
df1 <- c(21:24, 31:34)
df1 
#[1] 21 22 23 24 31 32 33 34
class(df1)
#[1] "integer"

#assignment with [] casts to class of LHS 
df1<- data.frame(a = 1:4, b = 11:14)
df1[]<- c(21:24,31:34)
df1

#    a  b
# 1 21 31
# 2 22 32
# 3 23 33
# 4 24 34


# recycling to preserve shape
df1[]<- c(101:102)
df1

#     a   b
# 1 101 101
# 2 102 102
# 3 101 101
# 4 102 102

class(df1)
#data.frame

# reshaping 

df1<- data.frame(a = 1:4, b = 11:14)
df1[] <- matrix(1:8, 2,4)
df1 #matrix reshaped 
class(df1)
#[1] "data.frame"

# flattening 
x<- 1:8
x[] <- matrix(1:8,4,2)
x
#[1] 1 2 3 4 5 6 7 8

977

asked Dec 16 '16 19:12

R.S.

1 Answers

This is an intentional and documented feature. As joran mentioned, the documentation page "Extract" includes this in the "Atomic Vectors" section:

An empty index selects all values: this is most often used to replace all the entries but keep the attributes.

However, in the case of recursive objects (data.frames or lists, for example), the attributes are only kept for the subsetted object. Its parts don't get such protection.

Here's an example:

animals <- factor(c('cat', 'dog', 'fish'))
df_factor <- data.frame(x = animals)
rownames(df_factor) <- c('meow', 'bark', 'blub')
str(df_factor)
# 'data.frame': 3 obs. of  1 variable:
#   $ x: Factor w/ 3 levels "cat","dog","fish": 1 2 3

df_factor[] <- 'cat'
str(df_factor)
# 'data.frame': 3 obs. of  1 variable:
#   $ x: chr  "cat" "cat" "cat"
rownames(df_factor)
# [1] "meow" "bark" "blub"

df_factor kept its rownames attribute, but the x column is just the character vector used in the assignment instead of a factor. We can keep the class and levels of x by specifically replacing its values:

df_factor <- data.frame(x = animals)
df_factor$x[] <- 'cat'
str(df_factor)
# 'data.frame': 3 obs. of  1 variable:
#   $ x: Factor w/ 3 levels "cat","dog","fish": 1 1 1

So replacement with empty subsetting is very safe for vectors, matrices, and arrays, because their elements can't have their own attributes. But it requires some care when dealing with list-like objects.

112

answered Sep 30 '22 16:09

Nathan Werth

Related questions
                            
                                == and %in% differ based on character encoding?
                            
                                Dynamically display a dashboardPage
                            
                                Why does 'out of bounds' indexing differ between a matrix and a data.frame?
                            
                                Showing equation of nls model with ggpmisc
                            
                                R Plotly animation - initial frame
                            
                                Permute a vector such that an element can't be in the same place
                            
                                Using Unicode inside R's expression() command
                            
                                R: Why does dbWriteTable fail when table exists despite 'append = TRUE'
                            
                                Shiny App unable to start on shiny server
                            
                                Create UML diagrams directly from R code
                            
                                Inserting control inputs and HTML widgets inside rhandsontable cells in shiny
                            
                                How to read a parquet file in R without using spark packages?
                            
                                R data.table weird value/reference semantics
                            
                                Install R Studio Server on Windows
                            
                                Using standard evaluation and do_ to run simulations on a grid of parameters without do.call
                            
                                Optimising Shiny + Leaflet performance for detailed maps with many 'layers'
                            
                                'make'-like dependency-tracking library?
                            
                                How to dodge pointrange ggplots on two levels?
                            
                                Can I use knitr to apply CSS styles to individual table cells?
                            
                                How to extract create statements from different tables of MySQL DBs?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With