In R I have a data frame with observations described by several values one of which is a factor. I have sorted the dataset by this factor and would like to add a column in which I would get a number of observation on each level of the factor e.g. <pre class="prettyprint"><code>factor obsnum a 1 a 2 a 3 b 1 b 2 b 3 b 4 c 1 c 2 ... </code></pre> In SAS I do it with something like: <pre class="prettyprint"><code>data logs.full; set logs.full; count + 1; by cookie; if first.cookie then count = 1; run; </code></pre> How can I achieve that in R? Thanks,

Use <code>rle</code> (run length encoding) and <code>sequence</code>: <pre class="prettyprint"><code>x <- c("a", "a", "a", "b", "b", "b", "b", "c", "c") data.frame( x=x, obsnum = sequence(rle(x)$lengths) ) x obsnum 1 a 1 2 a 2 3 a 3 4 b 1 5 b 2 6 b 3 7 b 4 8 c 1 9 c 2 </code></pre>

Observation number by group [duplicate]

Tags:

r

sas

In R I have a data frame with observations described by several values one of which is a factor. I have sorted the dataset by this factor and would like to add a column in which I would get a number of observation on each level of the factor e.g.

factor   obsnum
a        1
a        2
a        3
b        1
b        2
b        3
b        4
c        1
c        2
...

In SAS I do it with something like:

data logs.full;
    set logs.full;
    count + 1;
    by cookie;
    if first.cookie then count = 1;
run;

How can I achieve that in R?

Thanks,

541

asked Nov 21 '11 08:11

twowo

1 Answers

Use rle (run length encoding) and sequence:

x <- c("a", "a", "a", "b", "b", "b", "b", "c", "c")

data.frame(
    x=x,
    obsnum = sequence(rle(x)$lengths) 
)

  x obsnum
1 a      1
2 a      2
3 a      3
4 b      1
5 b      2
6 b      3
7 b      4
8 c      1
9 c      2

164

answered Oct 30 '22 17:10

Andrie

Related questions
                            
                                Add axis tick-marks on top and to the right to a ggplot?
                            
                                Extract text in parentheses in R
                            
                                Efficient Way to Incrementally Count Unique Data Points in Data Frame
                            
                                Converting Column Values into Their Own Binary Encoded Columns (Dummy Variables)
                            
                                RStudio HiDPI support
                            
                                r - using summarise_each() to count records ignoring NAs
                            
                                Adding value from one data.frame to another data.frame by matching a variable
                            
                                Loading SpatialPolygonsDataFrame with Leaflet (for R) doesn't work
                            
                                Simulating a timeseries in dplyr instead of using a for loop
                            
                                Remove first character from a string in data frame column [duplicate]
                            
                                Problems with installation R packages
                            
                                Subset by row and column reciprocity [duplicate]
                            
                                How to divide a number of columns by one column?
                            
                                How to define multiple variables with lapply?
                            
                                when to use map() function and when to use summarise_at()/mutate_at()
                            
                                R CRAN Check fail when using parallel functions
                            
                                Getting the unique count of strings from a text string
                            
                                Extracting column names with condition from a data frame
                            
                                Assigning a specific number of values informed by a probability distribution (in R)
                            
                                Loop through a series of qplots

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With