Difference between dplyr:mutate and transform when using pmin and pmax?

Tags:

r

dplyr

While trying to answer this question, I encountered a difference between mutate and transform in what I expected to be equivalent operations.

# data
x <- data.frame(a=c(rep(0,10),rep(1,10),3),b=c(1:10,0,11:19,0))

#transform
transform(x,a=pmin(a,b), b=pmax(a,b))
   a  b
1  0  1
2  0  2
3  0  3
4  0  4
5  0  5
6  0  6
7  0  7
8  0  8
9  0  9
10 0 10
11 0  1
12 1 11
13 1 12
14 1 13
15 1 14
16 1 15
17 1 16
18 1 17
19 1 18
20 1 19
21 0  3

#mutate
libarary(dplyr)
x %>% mutate(a=pmin(a,b), b=pmax(a,b))
   a  b
1  0  1
2  0  2
3  0  3
4  0  4
5  0  5
6  0  6
7  0  7
8  0  8
9  0  9
10 0 10
11 0  0
12 1 11
13 1 12
14 1 13
15 1 14
16 1 15
17 1 16
18 1 17
19 1 18
20 1 19
21 0  0

Note the differences in lines 11 and 21. I suspect that mutate is mutating the data as it goes and therefore, pmax is not seeing the original data. Is this correct? Is it a bug, or by design?

577

asked Jul 14 '14 18:07

James

1 Answers

It appears my suspicions are correct, and that it is by design to allow the use of computed variables immediately afterwards, eg:

data.frame(a=1:4,b=5:8) %>% mutate(sum=a+b, letter=letters[sum])
  a b sum letter
1 1 5   6      f
2 2 6   8      h
3 3 7  10      j
4 4 8  12      l

In order to replicate the expected behaviour from transform one needs to simply reference the variable directly:

x %>% mutate(a=pmin(x$a,x$b), b=pmax(x$a,x$b))
   a  b
1  0  1
2  0  2
3  0  3
4  0  4
5  0  5
6  0  6
7  0  7
8  0  8
9  0  9
10 0 10
11 0  1
12 1 11
13 1 12
14 1 13
15 1 14
16 1 15
17 1 16
18 1 17
19 1 18
20 1 19
21 0  3

answered Dec 26 '22 07:12

James

Related questions
                            
                                Complex R Shiny input binding issue with datatable
                            
                                scraping asp javascript paginated tables behind search with R
                            
                                Date column coerced to numeric when indexing dataframe with [[ and a vector
                            
                                "API rate limit exceeded" when trying to install local R package using devtools::install()
                            
                                Parallel optimization in R
                            
                                How to count how many elements satisfy a condition in an idiomatic way?
                            
                                Binning Dates in R
                            
                                How to read quoted text containing escaped quotes
                            
                                Convert plural nouns into singular nouns
                            
                                Method initialisation in R reference classes
                            
                                Merging data frames without duplicating rows
                            
                                Can't set limits with coord_trans
                            
                                Include dimension names in row and column headers for LaTeX-formatted contingency table
                            
                                convert difftime time to years, months and days
                            
                                Is there a standard way to document data frames?
                            
                                Processing example files with roxygen2: backslashes are duplicated (\dontrun becomes \\dontrun)
                            
                                Bailing out from error in large `sapply`
                            
                                Plot county level data with tooltips in R
                            
                                How to separate geom_vline() and geom_hline() legends from other legends in ggplot2
                            
                                capturing cat output periodically for R shiny output (renderPrint)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between dplyr:mutate and transform when using pmin and pmax?

Tags:

r

dplyr

James

People also ask

1 Answers

James

Recent Activity

Donate For Us