I am trying to produce a series of box plots in R that is grouped by 2 factors. I've managed to make the plot, but I cannot get the boxes to order in the correct direction. My data farm I am using looks like this: <pre class="prettyprint"><code>Nitrogen Species Treatment 2 G L 3 R M 4 G H 4 B L 2 B M 1 G H </code></pre> I tried: <pre class="prettyprint"><code>boxplot(mydata$Nitrogen~mydata$Species*mydata$Treatment) </code></pre> this ordered the boxes alphabetically (first three were the "High" treatments, then within those three they were ordered by species name alphabetically). <img src="https://i.stack.imgur.com/TTiGD.png" alt="alt text"> I want the box plot ordered Low>Medium>High then within each of those groups G>R>B for the species. So i tried using a factor in the formula: <pre class="prettyprint"><code>f = ordered(interaction(mydata$Treatment, mydata$Species), levels = c("L.G","L.R","L.B","M.G","M.R","M.B","H.G","H.R","H.B") </code></pre> then: <pre class="prettyprint"><code>boxplot(mydata$Nitrogen~f) </code></pre> however the boxes are still shoeing up in the same order. The labels are now different, but the boxes have not moved. I have pulled out each set of data and plotted them all together individually: <pre class="prettyprint"><code>lg = mydata[mydata$Treatment="L" & mydata$Species="G", "Nitrogen"] mg = mydata[mydata$Treatment="M" & mydata$Species="G", "Nitrogen"] hg = mydata[mydata$Treatment="H" & mydata$Species="G", "Nitrogen"] etc .. boxplot(lg, lr, lb, mg, mr, mb, hg, hr, hb) </code></pre> This gives what i want, but I would prefer to do this in a more elegant way, so I don't have to pull each one out individually for larger data sets. <hr> Loadable data: <pre class="prettyprint"><code>mydata <- structure(list(Nitrogen = c(2L, 3L, 4L, 4L, 2L, 1L), Species = structure(c(2L, 3L, 2L, 1L, 1L, 2L), .Label = c("B", "G", "R"), class = "factor"), Treatment = structure(c(2L, 3L, 1L, 2L, 3L, 1L), .Label = c("H", "L", "M"), class = "factor")), .Names = c("Nitrogen", "Species", "Treatment"), class = "data.frame", row.names = c(NA, -6L)) </code></pre>

The following commands will create the ordering you need by rebuilding the Treatment and Species factors, with explicit manual ordering of the levels: <pre class="prettyprint"><code>mydata$Treatment = factor(mydata$Treatment,c("L","M","H")) mydata$Species = factor(mydata$Species,c("G","R","B")) </code></pre> <img src="https://i.stack.imgur.com/B1DBI.png" alt="alt text"> <hr> edit 1 : oops I had set it to HML instead of LMH. fixing. edit 2 : what factor(X,Y) does: If you run factor(X,Y) on an existing factor, it uses the ordering of the values in Y to enumerate the values present in the factor X. Here's some examples with your data. <pre class="prettyprint"><code>> mydata$Treatment [1] L M H L M H Levels: H L M > as.integer(mydata$Treatment) [1] 2 3 1 2 3 1 > factor(mydata$Treatment,c("L","M","H")) [1] L M H L M H <-- not changed Levels: L M H <-- changed > as.integer(factor(mydata$Treatment,c("L","M","H"))) [1] 1 2 3 1 2 3 <-- changed </code></pre> It does NOT change what the factor looks like at first glance, but it does change how the data is stored. What's important here is that many plot functions will plot the lowest enumeration leftmost, followed by the next, etc. If you create factors simply using <code>factor(X)</code> then usually the enumeration is based upon the alphabetical order of the factor levels, (e.g. "H","L","M"). If your labels have a conventional ordering different from alphabetical (i.e. "H","M","L"), this can make your graphs seems strange. At first glance, it may seem like the problem is due to the ordering of data in the data frame - i.e. if only we could place all "H" at the top and "L" at the bottom, then it would work. It doesn't. But if you want your labels to appear in the same order as the first occurrence in the data, you can use this form: <pre class="prettyprint"><code> mydata$Treatment = factor(mydata$Treatment, unique(mydata$Treatment)) </code></pre>

R - ordering in boxplot

Tags:

r

boxplot

I am trying to produce a series of box plots in R that is grouped by 2 factors. I've managed to make the plot, but I cannot get the boxes to order in the correct direction.

My data farm I am using looks like this:

Nitrogen    Species    Treatment 2           G          L 3           R          M 4           G          H 4           B          L 2           B          M 1           G          H

I tried:

boxplot(mydata$Nitrogen~mydata$Species*mydata$Treatment)

this ordered the boxes alphabetically (first three were the "High" treatments, then within those three they were ordered by species name alphabetically).

alt text

I want the box plot ordered Low>Medium>High then within each of those groups G>R>B for the species.

So i tried using a factor in the formula:

f = ordered(interaction(mydata$Treatment, mydata$Species),              levels = c("L.G","L.R","L.B","M.G","M.R","M.B","H.G","H.R","H.B")

then:

boxplot(mydata$Nitrogen~f)

however the boxes are still shoeing up in the same order. The labels are now different, but the boxes have not moved.

I have pulled out each set of data and plotted them all together individually:

lg = mydata[mydata$Treatment="L" & mydata$Species="G", "Nitrogen"] mg = mydata[mydata$Treatment="M" & mydata$Species="G", "Nitrogen"] hg = mydata[mydata$Treatment="H" & mydata$Species="G", "Nitrogen"] etc ..  boxplot(lg, lr, lb, mg, mr, mb, hg, hr, hb)

This gives what i want, but I would prefer to do this in a more elegant way, so I don't have to pull each one out individually for larger data sets.

Loadable data:

mydata <- structure(list(Nitrogen = c(2L, 3L, 4L, 4L, 2L, 1L), Species = structure(c(2L,  3L, 2L, 1L, 1L, 2L), .Label = c("B", "G", "R"), class = "factor"),      Treatment = structure(c(2L, 3L, 1L, 2L, 3L, 1L), .Label = c("H",      "L", "M"), class = "factor")), .Names = c("Nitrogen", "Species",  "Treatment"), class = "data.frame", row.names = c(NA, -6L))

407

asked Nov 23 '10 20:11

Robert

1 Answers

The following commands will create the ordering you need by rebuilding the Treatment and Species factors, with explicit manual ordering of the levels:

mydata$Treatment = factor(mydata$Treatment,c("L","M","H"))  mydata$Species = factor(mydata$Species,c("G","R","B"))

alt text

edit 1 : oops I had set it to HML instead of LMH. fixing.

edit 2 : what factor(X,Y) does:

If you run factor(X,Y) on an existing factor, it uses the ordering of the values in Y to enumerate the values present in the factor X. Here's some examples with your data.

> mydata$Treatment [1] L M H L M H Levels: H L M > as.integer(mydata$Treatment) [1] 2 3 1 2 3 1 > factor(mydata$Treatment,c("L","M","H")) [1] L M H L M H                               <-- not changed Levels: L M H                                 <-- changed > as.integer(factor(mydata$Treatment,c("L","M","H"))) [1] 1 2 3 1 2 3                               <-- changed

It does NOT change what the factor looks like at first glance, but it does change how the data is stored.

What's important here is that many plot functions will plot the lowest enumeration leftmost, followed by the next, etc.

If you create factors simply using factor(X) then usually the enumeration is based upon the alphabetical order of the factor levels, (e.g. "H","L","M"). If your labels have a conventional ordering different from alphabetical (i.e. "H","M","L"), this can make your graphs seems strange.

At first glance, it may seem like the problem is due to the ordering of data in the data frame - i.e. if only we could place all "H" at the top and "L" at the bottom, then it would work. It doesn't. But if you want your labels to appear in the same order as the first occurrence in the data, you can use this form:

 mydata$Treatment = factor(mydata$Treatment, unique(mydata$Treatment))

186

answered Sep 22 '22 11:09

Alex Brown

Related questions
                            
                                Error in Confusion Matrix : the data and reference factors must have the same number of levels
                            
                                Draw a box around a legend ggplot2
                            
                                Formatting a date in R without leading zeros
                            
                                Error in unserialize(socklist[[n]]) : error reading from connection on Unix
                            
                                Meaning of objects being masked by the global environment
                            
                                Variable width bars in ggplot2 barplot in R
                            
                                Colorize parts of the title in a plot
                            
                                Rank variable by group (dplyr)
                            
                                Data input via shinyTable in R shiny application
                            
                                Apply a function to each row in a data frame in R [duplicate]
                            
                                How to use subscripts in ggplot2 legends [R]
                            
                                Using Roxygen2 Template tags
                            
                                data.table join then add columns to existing data.frame without re-copy
                            
                                List files in R that do NOT match a pattern
                            
                                Handling missing/incomplete data in R--is there function to mask but not remove NAs?
                            
                                Package inputenc Error: Unicode char \u8 in RStudio
                            
                                Change arrowhead of arrows()
                            
                                Applying a function to two lists?
                            
                                Remove legend entries for some factors levels
                            
                                How to split Shiny app code over multiple files in RStudio? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With