The right way to plot multiple y values as separate lines with ggplot2

Tags:

I often run into an issue where I have a data frame that has a single x variable, one or more facet variables, and multiple different other variables. Sometimes I would like to simultaneously plot different y variables as separate lines. But it is always only a subset I want. I've tried using melt to get "variable" as a column and use that, and it works if I want every single column that was in the original dataset. Usually I don't.

Right now I've been doing things really roundabout it feels like. Suppose with mtcars I want to plot disp, hp, and wt against mpg:

ggplot(mtcars, aes(x=mpg)) + 
  geom_line(aes(y=disp, color="disp")) + 
  geom_line(aes(y=hp, color="hp")) + 
  geom_line(aes(y=wt, color="wt"))

This feels really redundant. If I first melt mtcars, then all variables will get melted, and then I will wind up plotting other variables that I don't want to.

Does anyone have a good way of doing this?

363

asked Sep 27 '11 13:09

Chris Neff

2 Answers

ggplot always prefers long format dataframe, so melt it:

library(reshape2)
mtcars.long <- melt(mtcars, id = "mpg", measure = c("disp", "hp", "wt"))
ggplot(mtcars.long, aes(mpg, value, colour = variable)) + geom_line()

There are many other options for doing this transformation. You can see the R-FAQ on converting data from wide to long for an overview.

answered Sep 22 '22 23:09

kohske

With reshape2 being deprecated, I updated @kohske answer using pivot_longer from tidyverse package.

Pivoting is explained here and involves specifying the data to reshape, second argument describes which columns need to be reshape (use - to exclude that column). Third is names_to gives the name of the variable that will be created from the data stored in the column names. Finally values_to gives the name of the variable that will be created from the data stored in the cell value, i.e. count. They also have more complex examples like numbers in column names e.g. wk1 wk2 etc.

# new suggestion
library(tidyverse)

# I subset to just the variables wanted so e.g. gear and cab are not included
mtcars.long <- mtcars %>% 
  select("mpg","disp", "hp", "wt") %>% 
  pivot_longer(-mpg, names_to = "variable", values_to = "value")

head(mtcars.long)
# # A tibble: 6 x 3
# mpg variable  value
# <dbl> <chr>     <dbl>
#   1    21 disp     160   
# 2    21 hp       110   
# 3    21 wt         2.62
# 4    21 disp     160   
# 5    21 hp       110   
# 6    21 wt         2.88


ggplot(mtcars.long, aes(mpg, value, colour = variable)) + geom_line()

Chart is:

mtcarstestchart

answered Sep 21 '22 23:09

micstr

Related questions
                            
                                Adding custom CSS tags to an RMarkdown html document
                            
                                Variables as default arguments of a function, using dplyr
                            
                                Reading Excel file: How to find the start cell in messy spreadsheets?
                            
                                Documenting R6 classes and methods within R package in RStudio
                            
                                Increase space between axis.title and axis.text in ggplot2 (version >= 0.9.0)
                            
                                Loading PNG files directly from URL
                            
                                read.xlsx reading dates wrong if non-date in column
                            
                                Convert Date to POSIXct
                            
                                Filling in the area under a line graph in ggplot2: geom_area()
                            
                                Sending Rstudio view() content to different pane
                            
                                R: how to center output in R markdown
                            
                                Seeking workaround for gtable_add_grob code broken by ggplot 2.2.0
                            
                                Number of significant digits in dplyr summarise
                            
                                Reticulate - Running python chunks in Rmarkdown
                            
                                ggplot2 : How to reduce the width AND the space between bars with geom_bar
                            
                                How to make scrollable slides in an ioslides presentation with rmarkdown
                            
                                How can I programmatically tell how many facets a ggplot has?
                            
                                Is it possible to run R from a tablet using Honeycomb (Android 3.0)?
                            
                                R: xtable caption (or comment)
                            
                                "..1" in the body of "[[.data.frame"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With