I'm trying to identify if MATLAB or R has a function that resembles the following. Say I have an input vector <code>v</code>. <pre class="prettyprint"><code>v = [1, 3, 1, 2, 4, 2, 1, 3] </code></pre> I want to generate a vector, <code>w</code> of equivalent length to <code>v</code>. Each element <code>w[i]</code> should tell me the following: for the corresponding value <code>v[i]</code>, how many times has this value been encountered so far in <code>v</code>, i.e. in all elements of <code>v</code> up to, but not including, position <code>i</code>. In this example <pre class="prettyprint"><code>w = [0, 0, 1, 0, 0, 1, 2, 1] </code></pre> I'm really looking to see if any statistical or domain-specific languages have a function/instruction like this and what it might be called.

In <code>R</code>, you can try this: <pre class="prettyprint"><code> v <- c(1,3,1,2,4,2,1,3) ave(v, v, FUN=seq_along)-1 #[1] 0 0 1 0 0 1 2 1 </code></pre> <h3>Explanation</h3> <pre class="prettyprint"><code> ave(seq_along(v), v, FUN=seq_along) #It may be better to use `seq_along(v)` considering different classes i.e. `factor` also. #[1] 1 1 2 1 1 2 3 2 </code></pre> Here, we are grouping the sequence of elements by <code>v</code>. For elements that match the same group, the <code>seq_along</code> function will create <code>1,2,3 etc</code>. In the case of <code>v</code>, the elements of same group <code>1</code> are in positions <code>1,3,7</code>, so those corresponding positions will be <code>1,2,3</code>. By subtracting with <code>1</code>, we will be able to start from <code>0</code>. To understand it better, <pre class="prettyprint"><code> lst1 <- split(v,v) lst2 <- lapply(lst1, seq_along) unsplit(lst2, v) #[1] 1 1 2 1 1 2 3 2 </code></pre> Using <code>data.table</code> <pre class="prettyprint"><code> library(data.table) DT <- data.table(v, ind=seq_along(v)) DT[, n:=(1:.N)-1, by=v][,n[ind]] #[1] 0 0 1 0 0 1 2 1 </code></pre>

Function/instruction to count number of times a value has already been seen

Tags:

r

programming-languages

matlab

dsl

instructions

I'm trying to identify if MATLAB or R has a function that resembles the following.

Say I have an input vector v.

v = [1, 3, 1, 2, 4, 2, 1, 3]

I want to generate a vector, w of equivalent length to v. Each element w[i] should tell me the following: for the corresponding value v[i], how many times has this value been encountered so far in v, i.e. in all elements of v up to, but not including, position i. In this example

w = [0, 0, 1, 0, 0, 1, 2, 1]

I'm really looking to see if any statistical or domain-specific languages have a function/instruction like this and what it might be called.

421

asked Aug 20 '14 09:08

hayesti

1 Answers

In R, you can try this:

 v <- c(1,3,1,2,4,2,1,3)
 ave(v, v, FUN=seq_along)-1
 #[1] 0 0 1 0 0 1 2 1

Explanation

 ave(seq_along(v), v, FUN=seq_along)  #It may be better to use `seq_along(v)` considering different classes i.e. `factor` also.
 #[1] 1 1 2 1 1 2 3 2

Here, we are grouping the sequence of elements by v. For elements that match the same group, the seq_along function will create 1,2,3 etc. In the case of v, the elements of same group 1 are in positions 1,3,7, so those corresponding positions will be 1,2,3. By subtracting with 1, we will be able to start from 0.

To understand it better,

 lst1 <- split(v,v)
 lst2 <- lapply(lst1, seq_along)
 unsplit(lst2, v)
 #[1] 1 1 2 1 1 2 3 2

Using data.table

  library(data.table)
  DT <- data.table(v, ind=seq_along(v))
  DT[, n:=(1:.N)-1, by=v][,n[ind]]
  #[1] 0 0 1 0 0 1 2 1

195

answered Sep 21 '22 00:09

akrun

Related questions
                            
                                How do I use plyr to number rows?
                            
                                Margin totals in xtabs
                            
                                Create "missing objects" (aka: "empty symbols" , "empty objects") / needed for formals manipulation/
                            
                                Is there a way to automatically update the documentation in an R package?
                            
                                Makefile for dummies? Mac OS X
                            
                                Is there a way to install R packages using emacs?
                            
                                Removing columns with missing values
                            
                                Efficiently average the second column by intervals defined by the first column
                            
                                Which algorithm I can use to find common adjacent words/ pattern recognition?
                            
                                retrieve row and column name of particular cell in R
                            
                                for() loop step width
                            
                                Initialize a list of matrices in R
                            
                                How to install the fftw3 package of R in ubuntu 12.04?
                            
                                How can I print a table in R with ascii, html, or markdown formatting?
                            
                                "package ‘mgcv’ could not be loaded" only in RStudio
                            
                                Dynamic arguments to expand.grid
                            
                                How to subset data.frames stored in a list?
                            
                                How to remove empty columns in R?
                            
                                Remove zeros in the start and end of a vector
                            
                                Specifying the scale for the density in ggplot2's stat_density2d

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With