I have a df like <pre class="prettyprint"><code>ProjectID Dist 1 x 1 y 2 z 2 x 2 h 3 k .... .... </code></pre> I want to add a third column such that we have an incrementing counter for each ProjectID: <pre class="prettyprint"><code>ProjectID Dist counter 1 x 1 1 y 2 2 z 1 2 x 2 2 h 3 1 k 3 .... .... </code></pre> I've had a look at <code>seq</code> <code>rank</code> and a couple of other bits particularly looking to see if I could use <code>ddply</code> to help: <pre class="prettyprint"><code>df$counter <- ddply(df,.(projectID), function(x).....? ) </code></pre> I think I could adapt this answer How to create a counter/numeration by group? but would prefer something using something like ddply (I can't find an equivalent of cumsum but I think that's the same principle here: Create ascending series of integers by group in Pandas ). That'd let me index occurrences in a list (and e.g. merge on this).

A <code>dplyr</code> solution is quite simple: <pre class="prettyprint"><code>library(dplyr) df %>% group_by(ProjectID) %>% mutate(counter = row_number(ProjectID)) # ProjectID Dist counter #1 1 x 1 #2 1 y 2 #3 2 z 1 #4 2 x 2 #5 2 h 3 #6 1 k 3 </code></pre>

Add an index (or counter) to a dataframe by group in R [duplicate]

Tags:

indexing

r

counter

seq

plyr

I have a df like

ProjectID Dist
  1        x
  1        y
  2        z
  2        x
  2        h
  3        k
  ....     ....

I want to add a third column such that we have an incrementing counter for each ProjectID:

ProjectID Dist counter
  1        x     1
  1        y     2
  2        z     1
  2        x     2
  2        h     3
  1        k     3
  ....     ....

I've had a look at seq rank and a couple of other bits particularly looking to see if I could use ddply to help:

df$counter <- ddply(df,.(projectID), function(x).....? )

I think I could adapt this answer How to create a counter/numeration by group? but would prefer something using something like ddply (I can't find an equivalent of cumsum but I think that's the same principle here: Create ascending series of integers by group in Pandas ). That'd let me index occurrences in a list (and e.g. merge on this).

574

asked Feb 21 '15 16:02

sjgknight

1 Answers

A dplyr solution is quite simple:

library(dplyr)

df %>% group_by(ProjectID) %>% mutate(counter = row_number(ProjectID))


#  ProjectID Dist counter
#1         1    x       1
#2         1    y       2
#3         2    z       1
#4         2    x       2
#5         2    h       3
#6         1    k       3

106

answered Sep 29 '22 16:09

jalapic

Related questions
                            
                                Convert a printed message into a character vector
                            
                                dplyr, do(), extracting parameters from model without losing grouping variable
                            
                                parRF on caret not working for more than one core
                            
                                How to use tryCatch in R
                            
                                Splitting knitr Chunk code and output into two different knitrouts
                            
                                Split column name and convert data from wide to long format in R
                            
                                Plotting large number of time series using ggplot. Is it possible to speed up?
                            
                                rPython using wrong python installation on Mac OSX
                            
                                Using Summary function inside Data.table
                            
                                Making igraph clearer to read
                            
                                Using the same argument names for a function defined inside another function
                            
                                How can dplyr generate data frame for each group after the group_by operation?
                            
                                R install package RevoScaleR
                            
                                ggplot annotate with greek symbol and (1) apostrophe or (2) in between text
                            
                                How to normalise subgroups from a grouped data frame in R
                            
                                controlling color of factor group in ggvis - r
                            
                                How to use stat_bin2d() to compute counts labels in ggplot2?
                            
                                Correct use of fun.data with stat_summary in ggplot2?
                            
                                C++11 with R and Rcpp: supported by CRAN policies?
                            
                                Fitting a polynomial with a known intercept

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With