ggplot scale transformation acts differently on points and functions

Tags:

I'm trying to plot a distribution CDF using R and ggplot2. However, I am finding difficulties in plotting the CDF function after I transform the Y axis to obtain a straight line. This kind of plot is frequently used in Gumbel paper plots, but here I'll use as example the normal distribution.

I generate the data, and plot the cumulative density function of the data along with the function. They fit well. However, when I apply an Y axis transformation, they don't fit anymore.

sim <- rnorm(100) #Simulate some data
sim <- sort(sim)  #Sort it

cdf <- seq(0,1,length.out=length(sim)) #Compute data CDF

df <- data.frame(x=sim, y=cdf) #Build data.frame

library(scales)
library(ggplot2)

#Now plot!
gg <- ggplot(df, aes(x=x, y=y)) +
        geom_point() +
        stat_function(fun = pnorm, colour="red")
gg

And the output should be something on the lines of: enter image description here Good!

Now I try to transform the Y axis according to the distribution used.

#Apply transformation
gg + scale_y_continuous(trans=probability_trans("norm"))

And the result is: enter image description here

The points are transformed correctly (they lie on a straight line), but the function is not!

However, everything seems to work fine if I do like this, calculating the CDF with ggplot:

ggplot(data.frame(x=sim), aes(x=x)) +
  stat_ecdf(geom = "point") +
  stat_function(fun="pnorm", colour="red") +
  scale_y_continuous(trans=probability_trans("norm"))

The result is OK: This wokrs OK

Why is this happening? Why doesn't calculating the CDF manually work with scale transformations?

848

asked May 15 '16 09:05

AF7

1 Answers

This works:

gg <- ggplot(df, aes(x=x, y=y)) +
  geom_point() +
  stat_function(fun ="pnorm", colour="red", inherit.aes = FALSE) +
  scale_y_continuous(trans=probability_trans("norm"))
gg

enter image description here

Possible explanation:

Documentation States: inherit.aes If FALSE, overrides the default aesthetics, rather than combining with them. This is most useful for helper functions that define both data and aesthetics and shouldn't inherit behaviour from the default plot specification, e.g. borders.

My guess: As scale_y_continuous changes the aesthetics of the main plot, we need to turn off the default inherit.aes=TRUE. It seems inherit.aes=TRUE in stat_function picks its aesthetics from the first layer of the plot, and so the scale transformation does not impact unless specifically chosen to.

156

answered Sep 17 '22 21:09

Divi

Related questions
                            
                                Return a list in dplyr mutate()
                            
                                RStudio knitr themes
                            
                                Importing and accessing large data files in Shiny
                            
                                R: multi-index on columns and/or rows
                            
                                How to use XGBoost algorithm for regression in R?
                            
                                The use of quotation marks when loading a package in R
                            
                                R: what is NA_character_?
                            
                                How do I use roxygen to document a R package that includes a function with the same name?
                            
                                using tidyverse; counting after and before change in value, within groups, generating new variables for each unique shift
                            
                                Synchronise Dygraph and DateRangeInput in Shiny
                            
                                How to write sf object as shapefile to ESRI file geodatabase with st_write?
                            
                                How to flatten the data of different data types by using Sparklyr package?
                            
                                How can I use the new bs4() theme in bookdown?
                            
                                R ggplot and facet grid: how to control x-axis breaks
                            
                                r - ggplot2 - create a shaded region between two geom_abline layers
                            
                                Code styling for black and white documents
                            
                                How to use useDynLib() correctly in an R package namespace file
                            
                                Designing multivariate density plot in R
                            
                                How is xgboost quality calculated?
                            
                                R shiny datatable filter box size to narrow to see text

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

ggplot scale transformation acts differently on points and functions

Tags:

r

ggplot2

transform

normal-distribution

cdf

AF7

People also ask

1 Answers

Divi

Recent Activity

Donate For Us