Suppose we want to access data from a data frame by row. The examples are simplified but when ordering a data frame by row names, for example, (<code>df[order(row.names(df)]</code>) we use the same technique. If the data frame has one column, we get back an atomic vector: <pre class="prettyprint"><code>> df x1 a x b y c z > df[1, ] # returns atomic vector [1] x </code></pre> If the data frame has two columns, we get back a 1-row data frame including the row name: <pre class="prettyprint"><code>> df x1 x2 a x u b y v c z w > df[1, ] # returns data frame X1 X2 a x u </code></pre> I don't understand why the same operation on the data frame yields two types of results depending on how many columns the frame has.

It's because the default argument to <code>[</code> is <code>drop=TRUE</code>. From <code>?"["</code> <blockquote> drop For matrices and arrays. If TRUE the result is coerced to the lowest possible dimension (see the examples). This only works for extracting elements, not for the replacement. See drop for further details. </blockquote> <pre class="prettyprint"><code>> dat1 <- data.frame(x=letters[1:3]) > dat2 <- data.frame(x=letters[1:3], y=LETTERS[1:3]) </code></pre> The default behaviour: <pre class="prettyprint"><code>> dat[1, ] row sessionId scenarionName stepName duration [1,] 1 1001 A start 0 > dat[2, ] row sessionId scenarionName stepName duration [1,] 2 1001 A step1 2.2 </code></pre> Using <code>drop=FALSE</code>: <pre class="prettyprint"><code>> dat1[1, , drop=FALSE] x 1 a > dat2[1, , drop=FALSE] x y 1 a A </code></pre>

In R, why does selecting rows from a data frame return data as a vector if the data frame has only one column?

Tags:

dataframe

r

Suppose we want to access data from a data frame by row. The examples are simplified but when ordering a data frame by row names, for example, (df[order(row.names(df)]) we use the same technique.

If the data frame has one column, we get back an atomic vector:

> df
    x1
a   x
b   y
c   z

> df[1, ] # returns atomic vector
[1] x

If the data frame has two columns, we get back a 1-row data frame including the row name:

> df
    x1 x2
a   x  u
b   y  v
c   z  w 

> df[1, ] # returns data frame
   X1 X2
a  x  u

I don't understand why the same operation on the data frame yields two types of results depending on how many columns the frame has.

982

asked Oct 06 '11 09:10

malana

1 Answers

It's because the default argument to [ is drop=TRUE.

From ?"["

drop
For matrices and arrays. If TRUE the result is coerced to the lowest possible dimension (see the examples). This only works for extracting elements, not for the replacement. See drop for further details.

> dat1 <- data.frame(x=letters[1:3])
> dat2 <- data.frame(x=letters[1:3], y=LETTERS[1:3])

The default behaviour:

> dat[1, ]
     row sessionId scenarionName stepName duration
[1,]   1      1001             A    start        0

> dat[2, ]
     row sessionId scenarionName stepName duration
[1,]   2      1001             A    step1      2.2

Using drop=FALSE:

> dat1[1, , drop=FALSE]
  x
1 a

> dat2[1, , drop=FALSE]
  x y
1 a A

answered Nov 04 '22 20:11

Andrie

Related questions
                            
                                How can I include a variable name in a function call in R?
                            
                                Removing object from parent environment using rm()
                            
                                Panel data with binary dependent variable in R
                            
                                how to change strip.text labels in ggplot with facet and margin=TRUE
                            
                                Mapping the link network between blogs using R?
                            
                                Calculate the sum of matrices in a list or a 3D array
                            
                                Drop down list implementation in R
                            
                                machine learning libraries in s+ (or R)?
                            
                                R Newbie Confused about Install Packages
                            
                                Can you easily plot rugs/axes on the top/right in ggplot2?
                            
                                Averaging over continuous blocks
                            
                                continuous subgroups with ddply
                            
                                R- xy scatter plot in 3d using density
                            
                                Unable to install zoo package (R)
                            
                                Zip Code Demographics in R
                            
                                Convert data.frame to xts object and preserve types
                            
                                Convert Factor columns in data frame to numeric type columns [duplicate]
                            
                                How do I convert a `raw` into a vector of integers in R?
                            
                                Keep the last 9 digits of an alphanumeric string in R
                            
                                How to load comma separated data into R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With