Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Overlaying two graphs using ggplot2 in R

Tags:

graph

r

ggplot2

I have two graphs and I am trying to overlay one on top of the other:

An example of the data frame "ge" looks like this. In actuality there are 10 Genes with 200 samples each, so there are 2000 rows and 3 columns:

Exp    Gene    Sample 903.0   1       1 1060.0  1       2 786.0   1       3 736.0   1       4 649.0   2       1 657.0   2       2 733.5   2       3 774.0   2       4 

An example of the data frame "avg" looks like this. This is an average of the data points for each gene across all samples. In actuality this graph has 10 genes, so the matrix is 4col X 10 rows:

mean       Gene   sd         se 684.2034    1   102.7142    7.191435 723.2892    2   100.6102    7.044122 

The first graph graphs a line of the average expression for each gene along with the standard deviation for each data point.

avggraph <- ggplot(avg, aes(x=Gene, y=mean)) + geom_point() +geom_line() + geom_errorbar(aes(ymin=mean-sd, ymax=mean+sd), width=.1) 

The second graph graphs the gene expression in the form a line for each sample across all the genes.

linegraphs <- ggplot(ge, aes(x=Gene, y=Expression, group=Samples, colour="#000099")) + geom_line() + scale_x_discrete(limits=flevels.tge) 

I would like to superimpose avggraph on top of linegraphs. Is there a way to do this? I've tried avggraph + linegraphs but I'm getting an error. I think this is because the graphs are generated by two different data frames.

I should also point out that the axes of both graphs are the same. Both graphs have the genes on the X-axis and the gene expression on the Y-axis.

Any help would be greatly appreciated!

like image 947
Sheila Avatar asked Nov 21 '12 06:11

Sheila


People also ask

How do I overlay two plots in R?

Overlaying is a technique that is used to draw multiple plots on a single frame. To draw multiple plots in the R Language, we draw a basic plot and add an overlay line plot or scatter plot by using the lines() and the points() function.

How do you overlay a plot?

Creating an Overlay Plot Before using an overlay plot, you must first analyze the data sets (or simulate the diagrams) you wish to include in the plot. Then to add an overlay plot to the project, choose Home > Insert > Overlay Plot.


2 Answers

One way is to add the geom_line command for the second plot to the first plot. You need to tell ggplot that this geom is based on a different data set:

ggplot(avg, aes(x=Gene, y=mean)) +    geom_point() +    geom_line() +    geom_errorbar(aes(ymin=mean-sd, ymax=mean+sd), width=.1) +   geom_line(data = ge, aes(x=Gene, y=Exp, group=Sample, colour="#000099"),             show_guide = FALSE) 

The last geom_line command is for creating the lines based on the raw data. enter image description here

like image 61
Sven Hohenstein Avatar answered Sep 21 '22 22:09

Sven Hohenstein


The workaround which I found was that instead of merging the two plots, I merged the data. I added an additional column at the end of the two dataframes and then performed the rbind operation on them.The using either the fill or color aesthetics to separate the two plots. Of course in my case the scale used for the axis were to be the same.

like image 35
ants.in.my.eyes Avatar answered Sep 22 '22 22:09

ants.in.my.eyes