My data frame contains the output of a survey with a select multiple question type. Some cells have multiple values. <pre class="prettyprint"><code>df <- data.frame(a=1:3,b=I(list(1,1:2,1:3))) df a b 1 1 1 2 2 1, 2 3 3 1, 2, 3 </code></pre> I would like to flatten out the list to obtain the following output: <pre class="prettyprint"><code>df a b 1 1 1 2 2 1 3 2 2 4 3 1 5 3 2 6 3 3 </code></pre> should be easy but somehow I can't find the search terms. thanks.

You can just use <code>unnest</code> from "tidyr": <pre class="prettyprint"><code>library(tidyr) unnest(df, b) # a b # 1 1 1 # 2 2 1 # 3 2 2 # 4 3 1 # 5 3 2 # 6 3 3 </code></pre>

Using <code>base R</code>, one option is <code>stack</code> after naming the <code>list</code> elements of 'b' column with that of the elements of 'a'. We can use <code>setNames</code> to change the names. <pre class="prettyprint"><code>stack(setNames(df$b, df$a)) </code></pre> Or another option would be to use <code>unstack</code> to automatically name the list element of 'b' with 'a' elements and then do the <code>stack</code> to get a <code>data.frame</code> output. <pre class="prettyprint"><code>stack(unstack(df, b~a)) </code></pre> <hr> Or we can use a convenient function <code>listCol_l</code> from <code>splitstackshape</code> to convert the <code>list</code> to <code>data.frame</code>. <pre class="prettyprint"><code>library(splitstackshape) listCol_l(df, 'b') </code></pre>

Flatten list column in data frame with ID column

Tags:

dataframe

r

reshape

My data frame contains the output of a survey with a select multiple question type. Some cells have multiple values.

df <- data.frame(a=1:3,b=I(list(1,1:2,1:3)))
df
  a       b
1 1       1
2 2    1, 2
3 3 1, 2, 3

I would like to flatten out the list to obtain the following output:

df
  a       b
1 1       1
2 2       1
3 2       2
4 3       1
5 3       2
6 3       3

should be easy but somehow I can't find the search terms. thanks.

508

asked May 14 '15 17:05

mloudon

2 Answers

You can just use unnest from "tidyr":

library(tidyr)
unnest(df, b)
#   a b
# 1 1 1
# 2 2 1
# 3 2 2
# 4 3 1
# 5 3 2
# 6 3 3

answered Oct 02 '22 21:10

A5C1D2H2I1M1N2O1R2T1

Using base R, one option is stack after naming the list elements of 'b' column with that of the elements of 'a'. We can use setNames to change the names.

stack(setNames(df$b, df$a))

Or another option would be to use unstack to automatically name the list element of 'b' with 'a' elements and then do the stack to get a data.frame output.

stack(unstack(df, b~a))

Or we can use a convenient function listCol_l from splitstackshape to convert the list to data.frame.

library(splitstackshape)
listCol_l(df, 'b')

answered Oct 02 '22 21:10

akrun

Related questions
                            
                                SpatialPolygons - Creating a set of polygons in R from coordinates
                            
                                Faster way to subset on rows of a data frame in R?
                            
                                How to get "proportion of variance" vector from princomp in R
                            
                                create new variable based on a regular expression
                            
                                How to extract keywords from Google search results page URL?
                            
                                ggplot2 fails to install on R 3.0.2
                            
                                Quarterly Year over Year Growth Rate
                            
                                Plot legend below the graphs and legend title above the legend in ggplot2
                            
                                dplyr - using column names as function arguments
                            
                                POST request using RCurl
                            
                                R: Function to copy to clipboard on Mac/OSX? [duplicate]
                            
                                Correlation between two dataframes by row
                            
                                writeClipboard for matrices or data frames?
                            
                                Modifying timezone of a POSIXct object without changing the display
                            
                                Strip the date and keep the time
                            
                                Correctly color vertices in R igraph
                            
                                How do I reorder data.table columns? [duplicate]
                            
                                time series plot with x axis in "year"-"month" in R
                            
                                ggplot2: Bring one line to the front, but save the colors
                            
                                Is it possible with ggvis to interactively change the variables for the x and y axes?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With