Add a column of ranks

Tags:

r

ranking

I have some data:

test <- data.frame(A=c("aaabbb",
"aaaabb",
"aaaabb",
"aaaaab",
"bbbaaa")
)

and so on. All the elements are the same length, and are already sorted before I get them.

I need to make a new column of ranks, "First", "Second", "Third", anything after that can be left blank, and it needs to account for ties. So in the above case, I'd like to get the following output:

Click to copy

   A       B
 aaabbb  First
 aaaabb  Second
 aaaabb  Second
 aaaaab  Third
 bbbaaa
 bbbbaa

I looked at rank() and some other posts that used it, but I wasn't able to get it to do what I was looking for.

622

asked Jun 13 '13 22:06

pak

2 Answers

How about this:

Click to copy

test$B <- match(test$A , unique(test$A)[1:3] )
test
       A  B
1 aaabbb  1
2 aaaabb  2
3 aaaabb  2
4 aaaaab  3
5 bbbaaa NA
6 bbbbaa NA

One of many ways to do this. Possibly not the best, but one that readily springs to mind and is fairly intuitive. You can use unique because you receive the data pre-sorted.

As data is sorted another suitable function worth considering is rle, although it's slightly more obtuse in this example:

Click to copy

rnk <- rle(as.integer(df$A))$lengths
rnk
# [1] 1 2 1 1 1
test$B <- c( rep( 1:3 , times = rnk[1:3] ) , rep(NA, sum( rnk[-c(1:3)] ) ) )

rle computes the lengths (and values which we don't really care about here) of runs of equal values in a vector - so again this works because your data are already sorted.

And if you don't have to have blanks after the third ranked item it's even simpler (and more readable):

Click to copy

test$B <- rep(1:length(rnk),times=rnk)

165

answered Sep 27 '22 00:09

Simon O'Hanlon

This seems like a good application for factors:

Click to copy

test$B <- as.numeric(factor(test$A, levels = unique(test$A)))

cumsum also comes to mind, where we add 1 every time the value changes:

Click to copy

test$B <- cumsum(c(TRUE, tail(test$A, -1) != head(test$A, -1)))

(Like @Simon said, there are many ways to do this...)

answered Sep 23 '22 00:09

flodel

Related questions
                            
                                inset \footnote{} into header with xtable and tabular.environment
                            
                                R does ggplot2 have interactivity option?
                            
                                Determining derivatives from GAM smooth object
                            
                                How to put square brackets and a subscript next to each other in R expression?
                            
                                Reshaping wide dataset in interval format
                            
                                Find all possible ways to split a list of elements into a a given number of group of the same size
                            
                                ggplot2/colorbrewer qualitative pallette with 125 categories
                            
                                Producing RNG vectors in R that have pre-defined sum of pdf or sum of cdf
                            
                                combining column number having same values
                            
                                Wrapping plots in another html container within an Rmd file
                            
                                How to set up a shortcut for different versions of R in RStudio?
                            
                                How to create design matrix in r
                            
                                Simpler way to reconstitute a melted data frame back to the original
                            
                                Interpolating data in R
                            
                                Formal argument "type" matched by multiple actual arguments
                            
                                Smallest size of GGplot2 geom_text()
                            
                                tableGrob: set the height and width of a grid.table
                            
                                How can I ensure that a partition has representative observations from each level of a factor?
                            
                                R Basket Analysis using arules package with unique order number but duplicate order combinations
                            
                                Multi-level regression model on multiply imputed data set in R (Amelia, zelig, lme4)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Add a column of ranks

Tags:

r

ranking

pak

People also ask

2 Answers

Simon O'Hanlon

flodel

Recent Activity

Donate For Us