I've just come back to R from a long hiatus writing and I'm having some real problems remembering how to reshape data. I know that what I want to do is easy, but for some reason I'm being dumb tonight and have confused myself with melt and reshape. If anyone could quickly point me in the right direction it would be hugely appreciated. I have a dataframe as such: <pre class="prettyprint"><code>person week year personA 6 1 personA 22 1 personA 41 1 personA 42 1 personA 1 2 personA 23 2 personB 8 2 personB 9 2 .... personN x y </code></pre> I want to end up with a count of events by year and by person: (so that I can plot a quick line graph for each person over the years ) e.g. <pre class="prettyprint"><code>person year1 year2 personA 4 2 personB 0 2 </code></pre> Many thanks for reading.

I would probably use <code>reshape2</code> package and the <code>dcast</code> function since it handles both the reshaping and aggregation in one step: <pre class="prettyprint"><code>library(reshape2) > dcast(person ~ year, value.var = "year", data = dat) Aggregation function missing: defaulting to length person 1 2 1 personA 4 2 2 personB 0 2 </code></pre>

simple data.frame reshape

Tags:

dataframe

r

reshape

I've just come back to R from a long hiatus writing and I'm having some real problems remembering how to reshape data. I know that what I want to do is easy, but for some reason I'm being dumb tonight and have confused myself with melt and reshape. If anyone could quickly point me in the right direction it would be hugely appreciated.

I have a dataframe as such:

person    week    year   
personA   6       1
personA   22      1
personA   41      1
personA   42      1
personA   1       2
personA   23      2
personB   8       2
personB   9       2
....
personN   x       y

I want to end up with a count of events by year and by person: (so that I can plot a quick line graph for each person over the years )

e.g.

person    year1    year2
personA   4        2
personB   0        2

Many thanks for reading.

820

asked May 06 '12 14:05

user1378122

1 Answers

I would probably use reshape2 package and the dcast function since it handles both the reshaping and aggregation in one step:

library(reshape2)
> dcast(person ~ year, value.var = "year", data = dat)
Aggregation function missing: defaulting to length
   person 1 2
1 personA 4 2
2 personB 0 2

144

answered Sep 18 '22 11:09

Chase

Related questions
                            
                                Replace multiple variables in Sprintf with same value
                            
                                R - Running a t-test from piping operators
                            
                                Define an anonymous function without using the `function` keyword
                            
                                How do I reference the entire row when creating a new column in a data.table?
                            
                                How to quickly add quotes and commas to a list of items for c() in R?
                            
                                Using `mutate_at` and `na_if` together to replace zeros with NA for only some columns
                            
                                Sum the odds numbers of a "number"
                            
                                Conditionally colour data points outside of confidence bands in R
                            
                                How can I determine if a function generates a graph
                            
                                Plotting to a file in R
                            
                                Calculating a daily mean in R
                            
                                Display a counter for loops across one display line
                            
                                Can R cause a file to be opened by another program?
                            
                                How to count rows?
                            
                                Can one use polygon() or equivalent in lattice and ggplot2 plots?
                            
                                How to remove all of the typed commands from the command window?
                            
                                Print j on every outside loop iteration in R
                            
                                getting the name of a dataframe from loading a .rda file in R
                            
                                Subset multiple columns in R - more elegant code?
                            
                                How to find good start values for nls function?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With