I have this dataframe: <pre class="prettyprint"><code>x <- data.frame( name = rep(letters[1:4], each = 2), condition = rep(c("A", "B"), times = 4), value = c(2,10,4,20,8,40,20,100) ) # name condition value # 1 a A 2 # 2 a B 10 # 3 b A 4 # 4 b B 20 # 5 c A 8 # 6 c B 40 # 7 d A 20 # 8 d B 100 </code></pre> I want to group by name and divide the value of rows with <code>condition == "B"</code> with those with <code>condition == "A"</code>, to get this: <pre class="prettyprint"><code>data.frame( name = letters[1:4], value = c(5,5,5,5) ) # name value # 1 a 5 # 2 b 5 # 3 c 5 # 4 d 5 </code></pre> I know something like this can get me pretty close: <pre class="prettyprint"><code>x$value[which(x$condition == "B")]/x$value[which(x$condition == "A")] </code></pre> but I was wondering if there was an easy way to do this with dplyr (My dataframe is a toy example and I got to it by chaining multiple <code>group_by</code> and <code>summarise</code> calls).

Using <code>data.table</code>, convert the 'data.frame' to 'data.table' (<code>setDT(x)</code>), grouped by 'name', we divide the 'value' corresponds to 'B' condition by the those that corresponds to 'A' 'condition'. <pre class="prettyprint"><code>library(data.table) setDT(x)[,.(value = value[condition=="B"]/value[condition=="A"]) , name] # name value #1: a 5 #2: b 5 #3: c 5 #4: d 5 </code></pre> Or reshape from 'long' to 'wide' and divide the 'B' column by 'A'. <pre class="prettyprint"><code>dcast(setDT(x), name~condition, value.var='value')[, .(name, value = B/A)] </code></pre>

Try: <pre class="prettyprint"><code>x %>% group_by(name) %>% summarise(value = value[condition == "B"] / value[condition == "A"]) </code></pre> Which gives: <pre class="prettyprint"><code>#Source: local data frame [4 x 2] # # name value # (fctr) (dbl) #1 a 5 #2 b 5 #3 c 5 #4 d 5 </code></pre>

How to divide between groups of rows using dplyr?

Tags:

dataframe

r

dplyr

I have this dataframe:

x <- data.frame(
    name = rep(letters[1:4], each = 2),
    condition = rep(c("A", "B"), times = 4),
    value = c(2,10,4,20,8,40,20,100)
) 
#   name condition value
# 1    a         A     2
# 2    a         B    10
# 3    b         A     4
# 4    b         B    20
# 5    c         A     8
# 6    c         B    40
# 7    d         A    20
# 8    d         B   100

I want to group by name and divide the value of rows with condition == "B" with those with condition == "A", to get this:

data.frame(
    name = letters[1:4],
    value = c(5,5,5,5)
)
#   name value
# 1    a     5
# 2    b     5
# 3    c     5
# 4    d     5

I know something like this can get me pretty close:

x$value[which(x$condition == "B")]/x$value[which(x$condition == "A")]

but I was wondering if there was an easy way to do this with dplyr (My dataframe is a toy example and I got to it by chaining multiple group_by and summarise calls).

779

asked May 25 '16 21:05

nachocab

2 Answers

Using data.table, convert the 'data.frame' to 'data.table' (setDT(x)), grouped by 'name', we divide the 'value' corresponds to 'B' condition by the those that corresponds to 'A' 'condition'.

library(data.table)
setDT(x)[,.(value = value[condition=="B"]/value[condition=="A"]) , name]
#    name value
#1:    a     5
#2:    b     5
#3:    c     5
#4:    d     5

Or reshape from 'long' to 'wide' and divide the 'B' column by 'A'.

dcast(setDT(x), name~condition, value.var='value')[, .(name, value = B/A)]

119

answered Sep 21 '22 06:09

akrun

Try:

x %>% 
  group_by(name) %>%
  summarise(value = value[condition == "B"] / value[condition == "A"])

Which gives:

#Source: local data frame [4 x 2]
#
#    name value
#  (fctr) (dbl)
#1      a     5
#2      b     5
#3      c     5
#4      d     5

answered Sep 19 '22 06:09

Steven Beaupré

Related questions
                            
                                Add variable to nested list
                            
                                How to run a package's testthat tests
                            
                                geom_density_ridges requires the following missing aesthetics: y
                            
                                Extract colnames from a nested list of data.frames
                            
                                Using R with Apache & PHP [closed]
                            
                                Split vector of strings and paste subset of resulting elements into a new vector
                            
                                Calculate within categories: Equivalent of R's ddply in Python?
                            
                                Convert Python to R
                            
                                How to overlay two geom_bar?
                            
                                R xts and data.table
                            
                                Constructing 3D array in Rcpp
                            
                                How do I concatenate String and an output evaluated from a function in R?
                            
                                Cartesian join in data.table
                            
                                Replace NA with previous and next rows mean in R
                            
                                Plot map with values for countries as color in R?
                            
                                How to have only every other border in a persp
                            
                                Combine multiple data frames and calculate average
                            
                                Why does dplyr's mutate() change the time format?
                            
                                Merge multiple data tables with duplicate column names
                            
                                How to use fread() as readLines() without auto column detection?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With