Finding most frequent combinations

Question

I have a data frame with 2 columns, ID number and brand:

I want to find the top 3 brand combinations that occur together most frequently with regard to id number:

A89, A87
A32, A27
A12, A14

I tried: library(dplyr)

 df %>% 
  group_by(X1,X2) %>%
  mutate(n = n()) %>%
  group_by(X1) %>%
  slice(which.max(n)) %>%
  select(-n)

But it doesn't work correctly. I would appreciate any thoughts or ideas!

d.b · Accepted Answer

Here's a way to do it in base R. We split X2 by X1 and then get combination of two values for each subgroup. Then we grab the three most common ones.

with(data.frame(table(unlist(lapply(split(df$X2, df$X1), function(x)
    combn(unique(x), min(2, length(x)), paste, collapse = "-"))))),
    as.character(Var1[head(order(Freq, decreasing = TRUE), 3)]))
#[1] "A12-A14" "A32-A27" "A89-A87"

DATA

df = structure(list(X1 = c(1234L, 1234L, 1234L, 1234L, 1234L, 1234L, 
1235L, 1235L, 1235L, 1236L, 1236L, 1236L, 1236L, 1236L, 1236L, 
1236L, 1236L, 1237L, 1237L), X2 = c("A89", "A87", "A87", "A32", 
"A27", "A27", "A12", "A14", "A14", "A32", "A32", "A27", "A12", 
"A12", "A14", "A89", "A87", "A99", "A98")), .Names = c("X1", 
"X2"), class = "data.frame", row.names = c(NA, -19L))

Finding most frequent combinations

Tags:

r

anrpet

1 Answers

d.b

Recent Activity

Donate For Us

Finding most frequent combinations

Tags:

r

anrpet

1 Answers

d.b

Related questions

Recent Activity

Donate For Us