It seems like such a simple problem, yet i've been pulling my hair out trying to get this to work: Given this data frame identifying the interactions <code>id</code>had with <code>contact</code> who is grouped by <code>contactGrp</code>, <pre class="prettyprint"><code>head(data) id sesTs contact contactGrp relpos maxpos 1 6849 2012-06-25 15:58:34 peter west 0.000000 3 2 6849 2012-06-25 18:24:49 sarah south 0.500000 3 3 6849 2012-06-27 00:13:30 sarah south 1.000000 3 4 1235 2012-06-29 17:49:35 peter west 0.000000 2 5 1235 2012-06-29 23:56:35 peter west 1.000000 2 6 5893 2012-06-30 22:21:33 carl east 0.000000 1 </code></pre> how many contacts where there for <code>unique(data$contactGrp)</code> with <code>relpos=1</code> and <code>maxpos>1</code> ? An expected Result would be: <pre class="prettyprint"><code>1 west 1 2 south 1 3 east 0 </code></pre> A small subset of lines i have tried: <ul> <li> <code>aggregate(data, by=list('contactGrp'), FUN=count)</code> yields an error, no filtering</li> <li>using <code>data.table</code> seems to require a key, which is not unique in this data…</li> <li> <code>ddply(data,"contactGrp",summarise,count=???)</code> not sure which function to use to fill the <code>count</code> column</li> <li> <code>ddply(subset(data,maxpos>1 & relpos==0), c('contactGrp'), function(df)count(df$relpos))</code> works but gives me an extra column <code>x</code> and it feels like i've overcomplicated it…</li> </ul> SQL would be easy: <code>Select contactGrp, count(*) as cnt from data where … Group by contactGrp</code> but im trying to learn <code>R</code>

And here is the <code>data.table</code> solution: <pre class="prettyprint"><code>> library(data.table) > dt <- data.table(sessions) > dt[, length(contact[relpos == 0 & maxpos > 1]), by = contactGrp] contactGrp V1 [1,] west 2 [2,] south 0 [3,] east 0 > dt[, length(contact[relpos == 1 & maxpos > 1]), by = contactGrp] contactGrp V1 [1,] west 1 [2,] south 1 [3,] east 0 </code></pre>

Aggregate (count) rows that match a condition, group by unique values

Tags:

It seems like such a simple problem, yet i've been pulling my hair out trying to get this to work:

Given this data frame identifying the interactions idhad with contact who is grouped by contactGrp,

head(data)
   id               sesTs  contact    contactGrp   relpos   maxpos
1 6849 2012-06-25 15:58:34   peter        west    0.000000      3
2 6849 2012-06-25 18:24:49   sarah        south   0.500000      3
3 6849 2012-06-27 00:13:30   sarah        south   1.000000      3
4 1235 2012-06-29 17:49:35   peter        west    0.000000      2
5 1235 2012-06-29 23:56:35   peter        west    1.000000      2
6 5893 2012-06-30 22:21:33   carl         east    0.000000      1

how many contacts where there for unique(data$contactGrp) with relpos=1 and maxpos>1 ?

An expected Result would be:

1 west   1
2 south  1
3 east   0

A small subset of lines i have tried:

aggregate(data, by=list('contactGrp'), FUN=count) yields an error, no filtering
using data.table seems to require a key, which is not unique in this data…
ddply(data,"contactGrp",summarise,count=???) not sure which function to use to fill the count column
ddply(subset(data,maxpos>1 & relpos==0), c('contactGrp'), function(df)count(df$relpos)) works but gives me an extra column x and it feels like i've overcomplicated it…

SQL would be easy: Select contactGrp, count(*) as cnt from data where … Group by contactGrp but im trying to learn R

629

asked Jul 20 '12 13:07

Lukas Grebe

1 Answers

And here is the data.table solution:

> library(data.table)
> dt <- data.table(sessions)
> dt[, length(contact[relpos == 0 & maxpos > 1]), by = contactGrp]
     contactGrp V1
[1,]       west  2
[2,]      south  0
[3,]       east  0

> dt[, length(contact[relpos == 1 & maxpos > 1]), by = contactGrp]
     contactGrp V1
[1,]       west  1
[2,]      south  1
[3,]       east  0

188

answered Sep 18 '22 15:09

Ryogi

Related questions
                            
                                IIS Express will not start
                            
                                Changing Properties of a Linked Server in SQL Server
                            
                                Convert std::string to integer
                            
                                How to tell apart numeric scalars and string scalars in Perl?
                            
                                How to get the video thumbnail from Dailymotion video from the video id of that video like in youtube?
                            
                                How can I install XML::LibXML on Ubuntu
                            
                                How do I load a JSON object from a file with ajax?
                            
                                What is the path to the JRE file?
                            
                                Aligning php Generated Image dynamic text in center
                            
                                Cannot compile ruby 1.9.3
                            
                                sed - How to extract IP address using sed?
                            
                                HTML5 type=range - showing label

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With