sum by year in a row in a dataframe in r

Tags:

r

I have a dataframe with two columns (year and precipitation). In a single column, the year is listed such that it starts from 1900 and ends at 2014 and again starts with 1900. In another column I have precipitation value of the respective year. Now i want to add all the precipitation of 1900 as 1 value and 1901 as 1 to up to 2014. My data looks like:

Year    Precipitation

1900    4.826
1901    37.592
2014    14.224
1900    45.974
1901    46.228
2014    79.502
1900    52.578
1901    22.30
2014    15.25

The results should look like:

Year   Precipitation

1900   103.378
1901   106.12
2014   108.976

So far I wrote a code but it does not work, if anybody can fix it?

data=read.table('precipitation.csv',header=T,sep=',')
frame=data.frame(data)
cumcum=tapply(frame$Precipitation, cumsum(frame$year==1), FUN=sum, na.rm=TRUE)

Thanks

380

asked Mar 31 '15 05:03

Juvin

2 Answers

Try data.table

library(data.table)
frame=fread('precipitation.csv',header=TRUE,sep=',')    
frame[, sum(Precipitation), by = Year]

138

answered Oct 06 '22 00:10

vrajs5

1 liner -- try:

aggregate(frame['Precipitation'], by=frame['Year'], sum)

Reference: Consolidate duplicate rows

answered Oct 05 '22 23:10

csiu

Related questions
                            
                                R: Converting nested list to dataframe and get names of list levels als factors
                            
                                Calculate surface area of a 3D mesh
                            
                                Embedding Shiny app in knitr document
                            
                                Repeating or looping an argument
                            
                                Calculate elapsed time since last event
                            
                                corrplot shows insignificant correlation coefficients even when insig = "blank" is set
                            
                                labeling row and col names when using dist() and as.matrix()
                            
                                R create adjacency matrix according to columns from data.frame
                            
                                Double for-loop operation in R (with an example)
                            
                                Overlap image plot on a Google Map background in R
                            
                                Is it okay to modify a mapped matrix in RcppEigen?
                            
                                R: Converting output from getSymbols() to data frame in one command without calling the object name explicitly
                            
                                How to vectorize this loop
                            
                                get bounding box from ggmap object
                            
                                How to convert foreach in Stata to R?
                            
                                R - Vertex attributes - 'Inappropriate value given in set.vertex.attribute.'
                            
                                R write dataframe column to csv having leading zeroes
                            
                                Why do R and statsmodels give slightly different ANOVA results?
                            
                                Formatted table output, printing into R console
                            
                                Count the occurrence of one vector's values in another vector

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With