I'm struggling to solve this problem in R. I have data like this: <pre class="prettyprint"><code>item id 1 500 2 500 2 600 2 700 3 500 3 600 data.frame(item = c(1, 2, 2, 2, 3, 3), id = c(500, 500, 600, 700, 500, 600)) </code></pre> And I want to count the number of times a pair of items is linked to the same id. So I want this output: <pre class="prettyprint"><code>item1 item2 count 1 2 1 2 3 2 1 3 2 </code></pre> I've tried approaching this with commands like: <pre class="prettyprint"><code>x_agg = aggregate(x, by=list(x$id), c) </code></pre> and then <pre class="prettyprint"><code>x_agg_id = lapply(x_agg$item, unique) </code></pre> thinking that I could then count the occurrence of each item. But the <code>by</code> function seems to create an object of lists, which I don't know how to manipulate. I am hoping there is a simpler way....

<pre class="prettyprint"><code># your data df<-read.table(text="item id 1 500 2 500 2 600 2 700 3 500 3 600",header=TRUE) library(tnet) item_item<-projecting_tm(df, method="sum") names(item_item)<-c("item1","item2","count") item_item #item1 item2 count #1 1 2 1 #2 1 3 1 #3 2 1 1 #4 2 3 2 #5 3 1 1 #6 3 2 2 </code></pre> EDIT how many ids and items do you have? you could always rename things. e.g. <pre class="prettyprint"><code>numberitems<-length(unique(df$id))+9000 items<-data.frame(item=unique(df$item),newitems=c(9000:(numberitems-1))) numberids<-length(unique(df$id))+1000 ids<-data.frame(id=unique(df$id),newids=c(1000:(numberids-1))) newdf<-merge(df,items,by="item") newdf<-merge(newdf,ids,by="id") DF<-data.frame(item=newdf$newitems,id=newdf$newids) library(tnet) item_item<-projecting_tm(DF, method="sum") names(item_item)<-c("item1","item2","count") </code></pre> then merge back the original names afterwards....

Count item pairs linked by column value

Tags:

r

aggregation

I'm struggling to solve this problem in R. I have data like this:

item   id
1      500
2      500
2      600
2      700
3      500
3      600

data.frame(item = c(1, 2, 2, 2, 3, 3),
           id = c(500, 500, 600, 700, 500, 600))

And I want to count the number of times a pair of items is linked to the same id. So I want this output:

item1    item2    count
    1        2        1
    2        3        2
    1        3        2

I've tried approaching this with commands like:

x_agg = aggregate(x, by=list(x$id), c)

and then

x_agg_id = lapply(x_agg$item, unique)

thinking that I could then count the occurrence of each item. But the by function seems to create an object of lists, which I don't know how to manipulate. I am hoping there is a simpler way....

992

asked Aug 22 '12 11:08

Harry Palmer

1 Answers

# your data
df<-read.table(text="item   id
1      500
2      500
2      600
2      700
3      500
3      600",header=TRUE)


library(tnet)
item_item<-projecting_tm(df, method="sum")
names(item_item)<-c("item1","item2","count")

item_item

  #item1 item2 count
#1     1     2     1
#2     1     3     1
#3     2     1     1
#4     2     3     2
#5     3     1     1
#6     3     2     2

EDIT

how many ids and items do you have? you could always rename things. e.g.

numberitems<-length(unique(df$id))+9000
items<-data.frame(item=unique(df$item),newitems=c(9000:(numberitems-1)))
numberids<-length(unique(df$id))+1000
ids<-data.frame(id=unique(df$id),newids=c(1000:(numberids-1)))
newdf<-merge(df,items,by="item")
newdf<-merge(newdf,ids,by="id")
DF<-data.frame(item=newdf$newitems,id=newdf$newids)

library(tnet)
item_item<-projecting_tm(DF, method="sum")
names(item_item)<-c("item1","item2","count")

then merge back the original names afterwards....

126

answered Sep 30 '22 11:09

user1317221_G

Related questions
                            
                                How to avoid <<- by using assign
                            
                                Simple conversion to edgelist with R?
                            
                                Working with lots of data and lots of rasters in R?
                            
                                Programmatically specifying colours in scale_fill_manual ggplot call
                            
                                Automatic E-mailing of pdf graphical output at specific times from R
                            
                                Is higher resolution coastline data readily available for R
                            
                                translation (recoding) error in r
                            
                                Showing separate legend for a geom_text layer?
                            
                                R: how to delete columns in a data.table?
                            
                                Cluster assignments differ sometimes in two DBSCAN implementations
                            
                                Differences between hash and lists in R
                            
                                R2PPT crashes R; are there alternatives to R2PPT?
                            
                                Omit x axis levels with no data in a facetted plot and change widths of the bars
                            
                                How to set the ranges of the values taken by ggplot2 stat_smooth() to fits lines?
                            
                                order while splitting (eg. TA should be split to two column "A" in first "T" second) in r
                            
                                How to create a stacked bar chart from summarized data in ggplot2
                            
                                Adding text labels to ggplot2 scatterplot
                            
                                do.call in combination with "::"
                            
                                OS-independent way to select directory interactively in R
                            
                                How to extract everything until first occurrence of pattern

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With