How can I label the points of a quantile-quantile plot composed with ggplot2?

Tags:

I am building a quantile-quantile plot out of an variable called x from a data frame called df in the working example provided below. I would like to label the points with the name variable of my df dataset.

Is it possible to do this in ggplot2 without resorting to the painful solution (coding the theoretical distribution by hand and then plotting it against the empirical one)?

Edit: it happens that yes, thanks to a user who posted and then deleted his answer. See the comments after Arun's answer below. Thanks to Didzis for his otherwise clever solution with ggbuild.

Click to copy

# MWE
df <- structure(list(name = structure(c(1L, 2L, 3L, 4L, 5L, 7L, 9L, 
10L, 6L, 12L, 13L, 14L, 15L, 16L, 17L, 19L, 18L, 20L, 21L, 22L, 
8L, 23L, 11L, 24L), .Label = c("AUS", "AUT", "BEL", "CAN", "CYP", 
"DEU", "DNK", "ESP", "FIN", "FRA", "GBR", "GRC", "IRL", "ITA", 
"JPN", "MLT", "NLD", "NOR", "NZL", "PRT", "SVK", "SVN", "SWE", 
"USA"), class = "factor"), x = c(-0.739390016757746, 0.358177826874146, 
1.10474523846099, -0.250589535389937, -0.423112615445571, -0.862144579740376, 
0.823039669834058, 0.079521521937704, 1.08173649722493, -2.03962942823921, 
1.05571087029737, 0.187147291278723, -0.144770773941437, 0.957990771847331, 
-0.0546549555439176, -2.70142550075757, -0.391588386498849, -0.23855544527369, 
-0.242781575907386, -0.176765072121165, 0.105155860923456, 2.69031085872414, 
-0.158320176671995, -0.564560815972446)), .Names = c("name", 
"x"), row.names = c(NA, -24L), class = "data.frame")

library(ggplot2)
qplot(sample = x, data = df) + geom_abline(linetype = "dotted") + theme_bw()

# ... using names instead of points would allow to spot the outliers

I am working on an adaptation of this gist, and will consider sending other questions to CrossValidated if I have questions about the regression diagnostics, which might be of interest to CV users.

596

asked Feb 19 '13 13:02

Fr.

2 Answers

You can save your original QQ plot as object (used function ggplot() and stat_qq() instead of qplot())

Click to copy

g<-ggplot(df, aes(sample = x)) + stat_qq()

Then with function ggplot_build() you can extract data used for plotting. They are stored in element data[[1]]. Saved those data as new data frame.

Click to copy

df.new<-ggplot_build(g)$data[[1]]
head(df.new)
           x          y     sample theoretical PANEL group
1 -2.0368341 -2.7014255 -2.7014255  -2.0368341     1     1
2 -1.5341205 -2.0396294 -2.0396294  -1.5341205     1     1
3 -1.2581616 -0.8621446 -0.8621446  -1.2581616     1     1
4 -1.0544725 -0.7393900 -0.7393900  -1.0544725     1     1
5 -0.8871466 -0.5645608 -0.5645608  -0.8871466     1     1
6 -0.7415940 -0.4231126 -0.4231126  -0.7415940     1     1

Now you can add to hew data frame names of observations. Important is to use order() as data in new data frame are ordered.

Click to copy

df.new$name<-df$name[order(df$x)]

Now plot new data frame as usual and instead of geom_point() provide geom_text().

Click to copy

ggplot(df.new,aes(theoretical,sample,label=name))+geom_text()+ 
  geom_abline(linetype = "dotted") + theme_bw()

enter image description here

170

answered Oct 10 '22 05:10

Didzis Elferts

The points are too close by. I would do something like this:

Click to copy

df <- df[with(df, order(x)), ]
df$t <- quantile(rnorm(1000), seq(0, 100, length.out = nrow(df))/100)

p <- ggplot(data = df, aes(x=t, y=x)) + geom_point(aes(colour=df$name))

This gives:

enter image description here

If you insist on having labels inside the plot, then, you could try something like:

Click to copy

df <- df[with(df, order(x)), ]
df$t <- quantile(rnorm(1000), seq(0, 100, length.out = nrow(df))/100)

p <- ggplot(data = df, aes(x=t, y=x)) + geom_point(aes(colour=df$name))
p <- p + geom_text(aes(x=t-0.05, y=x-0.15, label=df$name, size=1, colour=df$name))

p

enter image description here

You can play around with the x and y coordinates and if you want you can always remove the colour aesthetics.

answered Oct 10 '22 04:10

Arun

Related questions
                            
                                How to draw a point in polar coordinates with negative r?
                            
                                "Hmisc" package or namespace failed to load - no package called 'latticeExtra'
                            
                                Is it possible to draw the axis line first, before the data?
                            
                                Correlation clustering in R
                            
                                Getting the contents of a library interactively in R
                            
                                predict.svm does not predict new data
                            
                                Changing user agent string in a http request in R
                            
                                What is the best practice of handling time series in R?
                            
                                3d scatterplot with colored spheres with R and Rgl
                            
                                Reading in only part of a Stata .DTA file in R
                            
                                ddply aggregated column names
                            
                                Digging into R profiling information
                            
                                ggplot2: how to adjust line types + order in legend?
                            
                                Capture last output as an R object [duplicate]
                            
                                Trouble loading wordnet package in R
                            
                                R: Calculate means for subset of a group
                            
                                removing a layer legend in ggplot
                            
                                Function for resizing matrices in R
                            
                                Filtering data frame based on values in second data frame
                            
                                Setting HTML meta elements with knitr

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I label the points of a quantile-quantile plot composed with ggplot2?

Tags:

r

ggplot2

Fr.

People also ask

2 Answers

Didzis Elferts

Arun

Recent Activity

Donate For Us