Build difference between groups with dplyr in r

Tags:

I am using dplyr and I am wondering whether it is possible to compute differences between groups in one line. As in the small example below, the task is to compute the difference between groups A and Bs standardized "cent" variables.

library(dplyr)
# creating a small data.frame
GROUP <- rep(c("A","B"),each=10)
NUMBE <- rnorm(20,50,10)
datf <- data.frame(GROUP,NUMBE)

datf2 <- datf %.% group_by(GROUP) %.% mutate(cent = (NUMBE - mean(NUMBE))/sd(NUMBE))

gA <- datf2 %.% ungroup() %.% filter(GROUP == "A") %.% select(cent)
gB <- datf2 %.% ungroup() %.% filter(GROUP == "B") %.% select(cent)

gA - gB

This is of course no problem by creating different objects - but is there a more "built in" way of performing this task? Something more like this not working fantasy code below?

datf2 %.% summarize(filter(GROUP == "A",select(cent)) - filter(GROUP == "B",select(cent)))

Thank you!

432

asked Mar 23 '14 10:03

Manuel

2 Answers

Given we have 10 of each group, add an index 1:10, 1:10 and summarize over that with difference:

> datf2$entry=c(1:10,1:10)
> datf2 %.% ungroup() %.% group_by(entry) %.% summarize(d=cent[1]-cent[2])
Source: local data frame [10 x 2]

   entry          d
1      1 -0.8272879
2      2 -0.9159827
3      3 -0.5064762
4      4  0.4211639
5      5  1.3681720
6      6  3.3430289
7      7  1.0086822
8      8 -0.6163907
9      9 -0.7325220
10    10 -2.5423875

compare:

> gA - gB
         cent
1  -0.8272879
2  -0.9159827
3  -0.5064762
4   0.4211639
5   1.3681720
6   3.3430289
7   1.0086822
8  -0.6163907
9  -0.7325220
10 -2.5423875

Is there a way to inject the entry field into the data or the dplyr call? I'm not sure, it seems to rely on the functions knowing too much about the data...

169

answered Sep 19 '22 21:09

Spacedman

Thank you for the inspiration. I further developed this solution to that:

mutate(datf2,diffence = filter(datf2, GROUP == "A")$cent - filter(datf2, GROUP == "B")$cent)

This adds the result as column in the the data.frame.

answered Sep 21 '22 21:09

Manuel

Related questions
                            
                                Matching IDs in two datasets
                            
                                Setting an image as the colour of a polygon
                            
                                TukeyHSD adjusted P value is 0.0000000
                            
                                Polar-transform image in R
                            
                                R script from command line
                            
                                In R how do you write a sparse matrix to a file?
                            
                                Compiling *.Rnw files with knitr --without Rstudio
                            
                                Merge Two Arrays in R
                            
                                subset with pattern
                            
                                dcast fun.aggregate parameters
                            
                                Flip the matrix
                            
                                R How to get confidence interval for multinominal logit?
                            
                                $ operator is invalid for atomic vectors
                            
                                R multiple urls into lapply
                            
                                Change title fontsize in heatmap.2 function?
                            
                                optim function argument missing
                            
                                How do I make my facets perfectly square?
                            
                                Parallelization in R: how to "source" on every node?
                            
                                How do I get a data.frame from R's aggregate function in the right format?
                            
                                how to scrape this squawka page?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Build difference between groups with dplyr in r

Tags:

r

dplyr

difference

statistics

Manuel

People also ask

2 Answers

Spacedman

Manuel

Recent Activity

Donate For Us