I'm working with faceted plots, and adding lines using the <code>lm</code> method in <code>geom_smooth()</code> <pre class="prettyprint"><code>d<-data.frame(n=c(100, 80, 60, 55, 50, 102, 78, 61, 42, 18), year=rep(2000:2004, 2), cat=rep(c("a", "b"), each=5)) ggplot(d, aes(year, n, group=cat))+geom_line()+geom_point()+ facet_wrap(~cat, ncol=1)+ geom_smooth(method="lm") </code></pre> I would like to set up a function to apply a polynomial where appropriate. I've worked up a function: <pre class="prettyprint"><code>lm.mod<-function(df){ m1<-lm(n~year, data=df) m2<-lm(n~year+I(year^2), data=df) ifelse(AIC(m1)<AIC(m2), "y~x", "y~poly(x, 2)") } </code></pre> But I'm having trouble applying it. Any ideas, or better ways to approach this?

It's not possible to apply different smooth functions with a single <code>geom_smooth</code> call. Here is a solution which is based on smoothing subsets of data: First, create the base plot without <code>geom_smooth</code>: <pre class="prettyprint"><code>library(ggplot2) p <- ggplot(d, aes(year, n, group = cat)) + geom_line() + geom_point() + facet_wrap( ~ cat, ncol = 1) </code></pre> Second, the function <code>by</code> is used to create a <code>geom_smooth</code> for each level of <code>cat</code> (the variable used for facetting). This function returns a list. <pre class="prettyprint"><code>p_smooth <- by(d, d$cat, function(x) geom_smooth(data=x, method = lm, formula = lm.mod(x))) </code></pre> Now, you can add the list of <code>geom_smooth</code>s to your base plot: <pre class="prettyprint"><code>p + p_smooth </code></pre> The plot includes a second-order polynomial for the upper panel and a linear smooth for the lower panel: <img src="https://i.stack.imgur.com/5fiuA.png" alt="enter image description here">

Custom lm formula in geom_smooth

Tags:

r

ggplot2

I'm working with faceted plots, and adding lines using the lm method in geom_smooth()

d<-data.frame(n=c(100, 80, 60, 55, 50, 102, 78, 61, 42, 18),
              year=rep(2000:2004, 2), 
              cat=rep(c("a", "b"), each=5))

ggplot(d, aes(year, n, group=cat))+geom_line()+geom_point()+
  facet_wrap(~cat, ncol=1)+
  geom_smooth(method="lm")

I would like to set up a function to apply a polynomial where appropriate. I've worked up a function:

lm.mod<-function(df){
  m1<-lm(n~year, data=df)
  m2<-lm(n~year+I(year^2), data=df)
  ifelse(AIC(m1)<AIC(m2), "y~x", "y~poly(x, 2)")
}

But I'm having trouble applying it. Any ideas, or better ways to approach this?

603

asked Nov 27 '13 10:11

Ed G

2 Answers

It's not possible to apply different smooth functions with a single geom_smooth call. Here is a solution which is based on smoothing subsets of data:

First, create the base plot without geom_smooth:

library(ggplot2)
p <- ggplot(d, aes(year, n, group = cat)) +
       geom_line() +
       geom_point() +
       facet_wrap( ~ cat, ncol = 1)

Second, the function by is used to create a geom_smooth for each level of cat (the variable used for facetting). This function returns a list.

p_smooth <- by(d, d$cat, 
               function(x) geom_smooth(data=x, method = lm, formula = lm.mod(x)))

Now, you can add the list of geom_smooths to your base plot:

p + p_smooth

The plot includes a second-order polynomial for the upper panel and a linear smooth for the lower panel:

enter image description here

answered Sep 18 '22 17:09

Sven Hohenstein

lm.mod<-function(df){
  m1<-lm(n~year, data=df)
  m2<-lm(n~year+I(year^2), data=df)
  p <- ifelse(AIC(m1)<AIC(m2), "y~x", "y~poly(x, 2)")
return(p) 
}
# I only made the return here explicit out of personal preference

ggplot(d, aes(year, n, group=cat)) + geom_line() + geom_point() +
  facet_wrap(~cat, ncol=1)+
  stat_smooth(method=lm, formula=lm.mod(d))
# stat_smooth and move of your function to formula=

# test by reversing the condition and you should get a polynomial.
# lm.mod<-function(df){
#   m1<-lm(n~year, data=df)
#   m2<-lm(n~year+I(year^2), data=df)
#   p <- ifelse(AIC(m1)>AIC(m2), "y~x", "y~poly(x, 2)")
# return(p)
# }

answered Sep 18 '22 17:09

user1317221_G

Related questions
                            
                                How do I use elements of a dataframe like hash keys / dictionary keys / primary keys?
                            
                                Import date-time at a specified timezone, disregard Daylight Savings Time
                            
                                What exactly does R CMD Sweave --pdf do?
                            
                                How to pass a list to ggplot2?
                            
                                Is there any existing syntax checker for GNU R
                            
                                What is the difference between sort() and sort.list() in R?
                            
                                aggregate/sum with ggplot
                            
                                How to predict x values from a linear model (lm)
                            
                                How to specify in which order to load S4 methods when using roxygen2
                            
                                how to create an R data frame from a xml file
                            
                                ggplot font family change between versions
                            
                                unexpected output from aggregate
                            
                                Replace NAs by simulating data
                            
                                multiple choice box in R/shiny - adding a scroll bar
                            
                                R package compilation with dependency on data.table
                            
                                Error running ImageMagick from R: Invalid parameter
                            
                                My R has memory leaks?
                            
                                Convex hull ggplot using data.tables in R
                            
                                Why does naiveBayes return all NA's for multiclass classification in R?
                            
                                Index a data frame row-by-row using column names selected from a variable

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With