I have data which contain binary indicators for two groups, and to more groups that are nested within one of the first two groups. For example: <pre class="prettyprint"><code>set.seed(1) df <- data.frame(a=rep(0,10),b=rep(0,10),b.1=rep(0,10),b.2=rep(0,10)) df$a[sample(10,5,replace=F)] <- 1 df$b[sample(10,5,replace=F)] <- 1 df$b.1[sample(which(df$b==1),3,replace=F)] <- 1 df$b.2[sample(which(df$b==1),3,replace=F)] <- 1 df <- df[which(rowSums(df)==0),] </code></pre> <code>a</code> and <code>b</code> are the two groups and <code>b.1</code> and <code>b.2</code> are nested within group <code>b</code>. What I'd like to do is draw one venn diagram of all groups. This means that <code>b.1</code> and <code>b.2</code> will be circumscribed within <code>b</code>, which will intersect <code>a</code>. Is there any way to achieve this? Using a <code>ggplot</code> solution would be great. Trying <code>R's VennDiagram</code>' only for groups b, b.1, and b.2 doesn't even work for me: <pre class="prettyprint"><code>library(VennDiagram) draw.triple.venn(area1=sum(df$b),area2=sum(df$b.1),area3=sum(df$b.2), n12=sum(df$b*df$b.1),n23=sum(df$b.1*df$b.2),n13=sum(df$b*df$b.2),n123=sum(df$b*df$b.1*df$b.2), category=c("b","b1","b2")) </code></pre> <img src="https://i.stack.imgur.com/9lldG.png" alt="enter image description here"> With the <code>Vennerable</code> package I get close only drawing the "b" groups: <pre class="prettyprint"><code>library(Vennerable) plot(Venn(Sets=list(b=which(df$b==1),b.1=which(df$b.1==1),b.2=which(df$b.2==1))),doEuler=T,doWeight=T) </code></pre> <img src="https://i.stack.imgur.com/mhDiK.png" alt="enter image description here"> But when I add the <code>a</code> group it gets messed up: <img src="https://i.stack.imgur.com/vEl8P.png" alt="enter image description here"> Because what I really need is one circle for group <code>a</code> with an intersecting area with group <code>b</code>, and within the circle of group <code>b</code> are the circles of groups <code>b.1</code> and <code>b.2</code>.

In your assumption, there are few patterns of circle locations. I think it would be better to make your <code>function()</code>. Here is my example (edited; change default vp): <pre class="prettyprint"><code>nest_venn <- function(data_list, fill = c(2, 4, 5, 6), alpha = 0.15, vp = viewport(height=unit(1 ,"snpc"), width=unit(1,"snpc"))) { counts <- get.venn.partitions(data_list)$..count.. # calculation of each area's value if(any(counts[c(3, 4, 7, 8, 11, 12)]==!0)) warning("data_list[[3]] and/or data_list[[4]] isn't nested") grobs <- grobTree( circleGrob(x = 0.33, y = 0.5, r = 0.3, gp = gpar(fill = alpha(fill[1], alpha), col=8, lwd = 2)), # a circle circleGrob(x = 0.67, y = 0.5, r = 0.3, gp = gpar(fill = alpha(fill[2], alpha), col=8, lwd = 2)), # b circle circleGrob(x = 0.67, y = 0.6, r = 0.16, gp = gpar(fill = alpha(fill[3], alpha), col=8, lwd = 2)), # b.1 circle circleGrob(x = 0.67, y = 0.4, r = 0.16, gp = gpar(fill = alpha(fill[4], alpha), col=8, lwd = 2)), # b.2 circle textGrob(names(data_list)[1], x = 0.33, y = 0.82, gp = gpar(cex = 1, fontface = 4)), # a label textGrob(names(data_list)[2], x = 0.67, y = 0.82, gp = gpar(cex = 1, fontface = 4)), # b label textGrob(names(data_list)[3], x = 0.83, y = 0.7, gp = gpar(cex = 1, fontface = 4)), # b.1 label textGrob(names(data_list)[4], x = 0.83, y = 0.3, gp = gpar(cex = 1, fontface = 4)), # b.2 label textGrob(counts[15], x = 0.28, y = 0.5, gp = gpar(cex = 1.2)), # a textGrob(counts[14], x = 0.9, y = 0.5, gp = gpar(cex = 1.2)), # b textGrob(counts[13], x = 0.47, y = 0.5, gp = gpar(cex = 1.2)), # a & b textGrob(counts[10], x = 0.68, y = 0.65, gp = gpar(cex = 1.2)), # b & b.1 textGrob(counts[6], x = 0.68, y = 0.35, gp = gpar(cex = 1.2)), # b & b.2 textGrob(counts[9], x = 0.57, y = 0.6, gp = gpar(cex = 1.2)), # a & b & b.1 textGrob(counts[5], x = 0.57, y = 0.4, gp = gpar(cex = 1.2)), # a & b & b.2 textGrob(counts[2], x = 0.69, y = 0.5, gp = gpar(cex = 1.2)), # b & b.1 & b.2 textGrob(counts[1], x = 0.6, y = 0.5, gp = gpar(cex = 1.2)), # a & b & b.1 & b.2 vp = vp) return(grobs) } </code></pre> preparation of data list: <pre class="prettyprint"><code>set.seed(1) df <- data.frame(a=rep(0,10),b=rep(0,10),b.1=rep(0,10),b.2=rep(0,10)) df$a[sample(10,5,replace=F)] <- 1 df$b[sample(10,5,replace=F)] <- 1 df$b.1[sample(which(df$b==1),3,replace=F)] <- 1 df$b.2[sample(which(df$b==1),3,replace=F)] <- 1 df <- df[-which(rowSums(df)==0),] # the same as OP's example data data_list <- list() for(i in colnames(df)) data_list[[i]] <- which(df[,i]==1) # > data_list[1] # $a # [1] 2 3 4 5 7 </code></pre> use above function and draw the output: <pre class="prettyprint"><code>library(VennDiagram); library(grid); library(ggplot2) nestvenn.obj <- nest_venn(data_list) grid.newpage() grid.draw(nestvenn.obj) # [ edited ] # If you want a fixed size etc, please give an argument, vp. vp1 <- viewport(height=unit(150 ,"mm"), width=unit(150, "mm")) # example nestvenn.obj <- nest_venn(data_list, vp = vp1) grid.newpage() </code></pre> <img src="https://i.stack.imgur.com/zUC3Z.png" alt="enter image description here"> <pre class="prettyprint"><code># an example with ggplot library(gtable); library(dplyr) grid.newpage() ggplot(data.frame(x=1, y=1), aes(x, y)) %>% ggplotGrob() %>% gtable_filter("panel") %>% gList(nestvenn.obj) %>% grid.draw() </code></pre>

Drawing nested venn diagrams

Tags:

r

nested

ggplot2

venn-diagram

I have data which contain binary indicators for two groups, and to more groups that are nested within one of the first two groups.

For example:

set.seed(1)
df <- data.frame(a=rep(0,10),b=rep(0,10),b.1=rep(0,10),b.2=rep(0,10))
df$a[sample(10,5,replace=F)] <- 1
df$b[sample(10,5,replace=F)] <- 1
df$b.1[sample(which(df$b==1),3,replace=F)] <- 1
df$b.2[sample(which(df$b==1),3,replace=F)] <- 1
df <- df[which(rowSums(df)==0),]

a and b are the two groups and b.1 and b.2 are nested within group b.

What I'd like to do is draw one venn diagram of all groups. This means that b.1 and b.2 will be circumscribed within b, which will intersect a.

Is there any way to achieve this? Using a ggplot solution would be great.

Trying R's VennDiagram' only for groups b, b.1, and b.2 doesn't even work for me:

library(VennDiagram)
draw.triple.venn(area1=sum(df$b),area2=sum(df$b.1),area3=sum(df$b.2),
                   n12=sum(df$b*df$b.1),n23=sum(df$b.1*df$b.2),n13=sum(df$b*df$b.2),n123=sum(df$b*df$b.1*df$b.2),
                   category=c("b","b1","b2"))

enter image description here

With the Vennerable package I get close only drawing the "b" groups:

library(Vennerable)
plot(Venn(Sets=list(b=which(df$b==1),b.1=which(df$b.1==1),b.2=which(df$b.2==1))),doEuler=T,doWeight=T)

enter image description here

But when I add the a group it gets messed up: enter image description here

Because what I really need is one circle for group a with an intersecting area with group b, and within the circle of group b are the circles of groups b.1 and b.2.

879

asked Jul 31 '16 00:07

user1701545

2 Answers

The main idea is to draw a triple Venn with a, b1, and b2, and then manually overlay an ellipse for b.

library(VennDiagram)
library(gridExtra)
polygons <- draw.triple.venn(
    area1=sum(df$a),
    area2=sum(df$b.1),
    area3=sum(df$b.2),
    n12=sum(df$a*df$b.1),
    n23=sum(df$b.1*df$b.2),
    n13=sum(df$a*df$b.2),
    n123=sum(df$a*df$b.1*df$b.2),
    category=c("a","b1","b2"),
    margin=.1)

Now we draw the ellipse and add the label. This requires a fair bit of trial and error to get the location, angle, and size right. As it is, it's not perfect, but it's almost there.

b <- ellipseGrob(
    x=unit(0.562,"npc"),
    y=unit(0.515,"npc"),
    angle=(1.996*pi)/3,
    size=65.5, ar=2, gp=gpar(lwd=2.2))
grid.draw(b)
grid.text("b", x=unit(.9,"npc"), y=unit(.9,"npc"), gp=gpar(fontfamily="serif"))

enter image description here

answered Oct 17 '22 04:10

Weihuang Wong

In your assumption, there are few patterns of circle locations. I think it would be better to make your function().

Here is my example (edited; change default vp):

nest_venn <- function(data_list, fill = c(2, 4, 5, 6), alpha = 0.15, 
                      vp = viewport(height=unit(1 ,"snpc"), width=unit(1,"snpc"))) {
  counts <- get.venn.partitions(data_list)$..count..      # calculation of each area's value
  if(any(counts[c(3, 4, 7, 8, 11, 12)]==!0)) warning("data_list[[3]] and/or data_list[[4]] isn't nested")
  grobs <- grobTree(
    circleGrob(x = 0.33, y = 0.5, r = 0.3, gp = gpar(fill = alpha(fill[1], alpha), col=8, lwd = 2)),  # a circle
    circleGrob(x = 0.67, y = 0.5, r = 0.3, gp = gpar(fill = alpha(fill[2], alpha), col=8, lwd = 2)),  # b circle
    circleGrob(x = 0.67, y = 0.6, r = 0.16, gp = gpar(fill = alpha(fill[3], alpha), col=8, lwd = 2)), # b.1 circle
    circleGrob(x = 0.67, y = 0.4, r = 0.16, gp = gpar(fill = alpha(fill[4], alpha), col=8, lwd = 2)), # b.2 circle
    textGrob(names(data_list)[1], x = 0.33, y = 0.82, gp = gpar(cex = 1, fontface = 4)), # a label
    textGrob(names(data_list)[2], x = 0.67, y = 0.82, gp = gpar(cex = 1, fontface = 4)), # b label
    textGrob(names(data_list)[3], x = 0.83, y = 0.7, gp = gpar(cex = 1, fontface = 4)),  # b.1 label
    textGrob(names(data_list)[4], x = 0.83, y = 0.3, gp = gpar(cex = 1, fontface = 4)),  # b.2 label
    textGrob(counts[15], x = 0.28, y = 0.5, gp = gpar(cex = 1.2)),  # a
    textGrob(counts[14], x = 0.9, y = 0.5, gp = gpar(cex = 1.2)),   #     b
    textGrob(counts[13], x = 0.47, y = 0.5, gp = gpar(cex = 1.2)),  # a & b
    textGrob(counts[10], x = 0.68, y = 0.65, gp = gpar(cex = 1.2)), #     b & b.1
    textGrob(counts[6], x = 0.68, y = 0.35, gp = gpar(cex = 1.2)),  #     b       & b.2
    textGrob(counts[9], x = 0.57, y = 0.6, gp = gpar(cex = 1.2)),   # a & b & b.1
    textGrob(counts[5], x = 0.57, y = 0.4, gp = gpar(cex = 1.2)),   # a & b       & b.2
    textGrob(counts[2], x = 0.69, y = 0.5, gp = gpar(cex = 1.2)),   #     b & b.1 & b.2
    textGrob(counts[1], x = 0.6, y = 0.5, gp = gpar(cex = 1.2)),    # a & b & b.1 & b.2
    vp = vp)
  return(grobs)
}

preparation of data list:

set.seed(1)
df <- data.frame(a=rep(0,10),b=rep(0,10),b.1=rep(0,10),b.2=rep(0,10))
df$a[sample(10,5,replace=F)] <- 1
df$b[sample(10,5,replace=F)] <- 1
df$b.1[sample(which(df$b==1),3,replace=F)] <- 1
df$b.2[sample(which(df$b==1),3,replace=F)] <- 1
df <- df[-which(rowSums(df)==0),]            # the same as OP's example data

data_list <- list()
for(i in colnames(df)) data_list[[i]] <- which(df[,i]==1)
  # > data_list[1]
  # $a
  # [1] 2 3 4 5 7

use above function and draw the output:

library(VennDiagram); library(grid); library(ggplot2)

nestvenn.obj <- nest_venn(data_list)
grid.newpage()
grid.draw(nestvenn.obj)

# [ edited ]
# If you want a fixed size etc, please give an argument, vp.
vp1 <- viewport(height=unit(150 ,"mm"), width=unit(150, "mm")) # example
nestvenn.obj <- nest_venn(data_list, vp = vp1)
grid.newpage()

enter image description here

# an example with ggplot
library(gtable); library(dplyr)

grid.newpage()
ggplot(data.frame(x=1, y=1), aes(x, y)) %>% ggplotGrob() %>% 
  gtable_filter("panel") %>% gList(nestvenn.obj) %>% grid.draw()

answered Oct 17 '22 06:10

cuttlefish44

Related questions
                            
                                R alignment of axis labels with expressions
                            
                                Extracting a number following specific text in R
                            
                                knitr: Add figure notes
                            
                                ggplot GLM fitted curve without interaction
                            
                                r Shiny: renderImage from www
                            
                                How to specify different random effects in nlme vs. lme4?
                            
                                R Syntax Highlighting for Confluence
                            
                                ggplot2: boxplot with colors and text labels mapped to combination of two categorical variables
                            
                                igraph does not apply edge.width for negative correlation coefficients
                            
                                Reproduce a 'The Economist' chart with dual axis
                            
                                Multiply previous row value by constant R
                            
                                Date roll-up in R
                            
                                R ggplot geom_jitter duplicates outlier
                            
                                Time series plot gets offset by 2 hours if scale_x_datetime is used
                            
                                Referencing a range of columns in dplyr
                            
                                doParallel (package) foreach does not work for big iterations in R
                            
                                How to make the size of points on a plot proportional to p-value?
                            
                                The equivalent of 'this' or 'self' in R
                            
                                How to decrease padding between lines and points in R "both" type plots
                            
                                Store multiple objects in sysdata.rda: R-package development

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Drawing nested venn diagrams

Tags:

r

nested

ggplot2

venn-diagram

user1701545

People also ask

2 Answers

Weihuang Wong

cuttlefish44

Recent Activity

Donate For Us