Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

multiple histograms on top of eachother without bins

Tags:

plot

r

histogram

Let's say I've got this dataframe with 2 levels. LC and HC. Now i want to get 2 plots like below on top of eachother.

data <- data.frame(
    welltype=c("LC","LC","LC","LC","LC","HC","HC","HC","HC","HC"),
    value=c(1,2,1,2,1,5,4,5,4,5))

The code to get following plot =

x <- rnorm(1000)
y <- hist(x)
plot(y$breaks,
   c(y$counts,0),
   type="s",col="blue")

(with thanks to Joris Meys)

So, how do I even start on this. Since I'm used to java I was thinking of a for loop, but I've been told not to do it this way.

enter image description here

like image 753
Sir Ksilem Avatar asked May 17 '11 13:05

Sir Ksilem


2 Answers

Next to the method provided by Aaron, there's a ggplot solution as well (see below), but I would strongly advise you to use the densities, as they will give nicer plots and are a whole lot easier to construct :

# make data
wells <- c("LC","HC","BC")
Data <- data.frame(
    welltype=rep(wells,each=100),
    value=c(rnorm(100),rnorm(100,2),rnorm(100,3))
)

ggplot(Data,aes(value,fill=welltype)) + geom_density(alpha=0.2)

gives :enter image description here

For the plot you requested :

# make hists dataframe
hists <- tapply(Data$value,Data$welltype,
            function(i){
              tmp <- hist(i)
              data.frame(br=tmp$breaks,co=c(tmp$counts,0))
            })
ll <- sapply(hists,nrow)
hists <- do.call(rbind,hists)
hists$fac <- rep(wells,ll)

# make plot
require(ggplot2)
qplot(br,co,data=hists,geom="step",colour=fac)

enter image description here

like image 73
Joris Meys Avatar answered Sep 20 '22 13:09

Joris Meys


You can use the same code except with points instead of plot for adding additional lines to the plot.

Making up some data

set.seed(5)
d <- data.frame(x=c(rnorm(1000)+3, rnorm(1000)),
                g=rep(1:2, each=1000) )

And doing it in a fairly straightforward way:

x1 <- d$x[d$g==1]
x2 <- d$x[d$g==2]
y1 <- hist(x1, plot=FALSE)
y2 <- hist(x2, plot=FALSE)
plot(y1$breaks, c(y1$counts,0), type="s",col="blue",
     xlim=range(c(y1$breaks, y2$breaks)), ylim=range(c(0,y1$counts, y2$counts)))
points(y2$breaks, c(y2$counts,0), type="s", col="red")

Or in a more R-ish way:

col <- c("blue", "red")
ds <- split(d$x, d$g)
hs <- lapply(ds, hist, plot=FALSE)
plot(0,0,type="n",
     ylim=range(c(0,unlist(lapply(hs, function(x) x$counts)))),
     xlim=range(unlist(lapply(hs, function(x) x$breaks))) )
for(i in seq_along(hs)) {
  points(hs[[i]]$breaks, c(hs[[i]]$counts,0), type="s", col=col[i])
}

EDIT: Inspired by Joris's answer, I'll note that lattice can also easily do overlapping density plots.

library(lattice)
densityplot(~x, group=g, data=d)
like image 39
Aaron left Stack Overflow Avatar answered Sep 21 '22 13:09

Aaron left Stack Overflow