I have a survey file in which row are observation and column question. Here are some fake data they look like: <pre class="prettyprint"><code>People,Food,Music,People P1,Very Bad,Bad,Good P2,Good,Good,Very Bad P3,Good,Bad,Good P4,Good,Very Bad,Very Good P5,Bad,Good,Very Good P6,Bad,Good,Very Good </code></pre> My aim is to create this kind of plot with <code>ggplot2</code>. <ul> <li>I absolutely don't care of the colors, design, etc. </li> <li>The plot doesn't correspond to the fake data</li> </ul> <img src="https://i.stack.imgur.com/yCWRc.png" alt="enter image description here"> Here are my fake data: <pre class="prettyprint"><code>raw <- read.csv("http://pastebin.com/raw.php?i=L8cEKcxS",sep=",") raw[,2]<-factor(raw[,2],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,3]<-factor(raw[,3],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,4]<-factor(raw[,4],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) </code></pre> But if I choose Y as count then I'm facing an issue about choosing the X and the Group values... I don't know if I can succeed without using <code>reshape2</code>... I've also tired to use reshape with melt function. But I don't understand how to use it...

EDIT: Eight years later... This needs a tidyverse solution, so here is one, with all non-base packages explicitly stated so that you know where each function comes from (except for <code>read.csv</code> which is from <code>utils</code> which comes with base R): <pre class="prettyprint"><code>library(magrittr) # needed for %>% if dplyr is not attached "http://pastebin.com/raw.php?i=L8cEKcxS" %>% read.csv(sep = ",") %>% tidyr::pivot_longer(cols = c(Food, Music, People.1), names_to = "variable", values_to = "value") %>% dplyr::group_by(variable, value) %>% dplyr::summarise(n = dplyr::n()) %>% dplyr::mutate(value = factor( value, levels = c("Very Bad", "Bad", "Good", "Very Good")) ) %>% ggplot2::ggplot(ggplot2::aes(variable, n)) + ggplot2::geom_bar(ggplot2::aes(fill = value), position = "dodge", stat = "identity") </code></pre> <hr> The original answer: First you need to get the counts for each category, i.e. how many Bads and Goods and so on are there for each group (Food, Music, People). This would be done like so: <pre class="prettyprint"><code>raw <- read.csv("http://pastebin.com/raw.php?i=L8cEKcxS",sep=",") raw[,2]<-factor(raw[,2],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,3]<-factor(raw[,3],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,4]<-factor(raw[,4],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw=raw[,c(2,3,4)] # getting rid of the "people" variable as I see no use for it freq=table(col(raw), as.matrix(raw)) # get the counts of each factor level </code></pre> Then you need to create a data frame out of it, melt it and plot it: <pre class="prettyprint"><code>Names=c("Food","Music","People") # create list of names data=data.frame(cbind(freq),Names) # combine them into a data frame data=data[,c(5,3,1,2,4)] # sort columns # melt the data frame for plotting data.m <- melt(data, id.vars='Names') # plot everything ggplot(data.m, aes(Names, value)) + geom_bar(aes(fill = variable), position = "dodge", stat="identity") </code></pre> Is this what you're after? <img src="https://i.stack.imgur.com/zlAi2.png" alt="enter image description here"> To clarify a little bit, in ggplot multiple grouping bar you had a data frame that looked like this: <pre class="prettyprint"><code>> head(df) ID Type Annee X1PCE X2PCE X3PCE X4PCE X5PCE X6PCE 1 1 A 1980 450 338 154 36 13 9 2 2 A 2000 288 407 212 54 16 23 3 3 A 2020 196 434 246 68 19 36 4 4 B 1980 111 326 441 90 21 11 5 5 B 2000 63 298 443 133 42 21 6 6 B 2020 36 257 462 162 55 30 </code></pre> Since you have numerical values in columns 4-9, which would later be plotted on the y axis, this can be easily transformed with <code>reshape</code> and plotted. For our current data set, we needed something similar, so we used <code>freq=table(col(raw), as.matrix(raw))</code> to get this: <pre class="prettyprint"><code>> data Names Very.Bad Bad Good Very.Good 1 Food 7 6 5 2 2 Music 5 5 7 3 3 People 6 3 7 4 </code></pre> Just imagine you have <code>Very.Bad</code>, <code>Bad</code>, <code>Good</code> and so on instead of <code>X1PCE</code>, <code>X2PCE</code>, <code>X3PCE</code>. See the similarity? But we needed to create such structure first. Hence the <code>freq=table(col(raw), as.matrix(raw))</code>.

Grouped bar plot in ggplot

Tags:

r

ggplot2

reshape

bar-chart

reshape2

I have a survey file in which row are observation and column question.

Here are some fake data they look like:

Click to copy

People,Food,Music,People P1,Very Bad,Bad,Good P2,Good,Good,Very Bad P3,Good,Bad,Good P4,Good,Very Bad,Very Good P5,Bad,Good,Very Good P6,Bad,Good,Very Good

My aim is to create this kind of plot with ggplot2.

I absolutely don't care of the colors, design, etc.
The plot doesn't correspond to the fake data

enter image description here

Here are my fake data:

Click to copy

raw <- read.csv("http://pastebin.com/raw.php?i=L8cEKcxS",sep=",") raw[,2]<-factor(raw[,2],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,3]<-factor(raw[,3],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,4]<-factor(raw[,4],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE)

But if I choose Y as count then I'm facing an issue about choosing the X and the Group values... I don't know if I can succeed without using reshape2... I've also tired to use reshape with melt function. But I don't understand how to use it...

802

asked Aug 10 '13 03:08

S12000

1 Answers

EDIT: Eight years later...

This needs a tidyverse solution, so here is one, with all non-base packages explicitly stated so that you know where each function comes from (except for read.csv which is from utils which comes with base R):

Click to copy

library(magrittr) # needed for %>% if dplyr is not attached  "http://pastebin.com/raw.php?i=L8cEKcxS" %>%   read.csv(sep = ",") %>%   tidyr::pivot_longer(cols = c(Food, Music, People.1),                       names_to = "variable",                       values_to = "value") %>%   dplyr::group_by(variable, value) %>%   dplyr::summarise(n = dplyr::n()) %>%   dplyr::mutate(value = factor(     value,     levels = c("Very Bad", "Bad", "Good", "Very Good"))   ) %>%   ggplot2::ggplot(ggplot2::aes(variable, n)) +   ggplot2::geom_bar(ggplot2::aes(fill = value),                     position = "dodge",                     stat = "identity")

The original answer:

First you need to get the counts for each category, i.e. how many Bads and Goods and so on are there for each group (Food, Music, People). This would be done like so:

Click to copy

raw <- read.csv("http://pastebin.com/raw.php?i=L8cEKcxS",sep=",") raw[,2]<-factor(raw[,2],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,3]<-factor(raw[,3],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE) raw[,4]<-factor(raw[,4],levels=c("Very Bad","Bad","Good","Very Good"),ordered=FALSE)  raw=raw[,c(2,3,4)] # getting rid of the "people" variable as I see no use for it  freq=table(col(raw), as.matrix(raw)) # get the counts of each factor level

Then you need to create a data frame out of it, melt it and plot it:

Click to copy

Names=c("Food","Music","People")     # create list of names data=data.frame(cbind(freq),Names)   # combine them into a data frame data=data[,c(5,3,1,2,4)]             # sort columns  # melt the data frame for plotting data.m <- melt(data, id.vars='Names')  # plot everything ggplot(data.m, aes(Names, value)) +      geom_bar(aes(fill = variable), position = "dodge", stat="identity")

Is this what you're after?

enter image description here

To clarify a little bit, in ggplot multiple grouping bar you had a data frame that looked like this:

Click to copy

> head(df)   ID Type Annee X1PCE X2PCE X3PCE X4PCE X5PCE X6PCE 1  1    A  1980   450   338   154    36    13     9 2  2    A  2000   288   407   212    54    16    23 3  3    A  2020   196   434   246    68    19    36 4  4    B  1980   111   326   441    90    21    11 5  5    B  2000    63   298   443   133    42    21 6  6    B  2020    36   257   462   162    55    30

Since you have numerical values in columns 4-9, which would later be plotted on the y axis, this can be easily transformed with reshape and plotted.

For our current data set, we needed something similar, so we used freq=table(col(raw), as.matrix(raw)) to get this:

Click to copy

> data    Names Very.Bad Bad Good Very.Good 1   Food        7   6    5         2 2  Music        5   5    7         3 3 People        6   3    7         4

Just imagine you have Very.Bad, Bad, Good and so on instead of X1PCE, X2PCE, X3PCE. See the similarity? But we needed to create such structure first. Hence the freq=table(col(raw), as.matrix(raw)).

139

answered Oct 01 '22 22:10

jakub

Related questions
                            
                                Get a list of the data sets in a particular package
                            
                                reshape vs. reshape2 in R
                            
                                extracting standardized coefficients from lm in R
                            
                                How to get the name of the calling function inside the called routine?
                            
                                What are Replacement Functions in R?
                            
                                Sort matrix according to first column in R
                            
                                Set R plots x axis to show at y=0
                            
                                Reading data from PDF files into R
                            
                                Solution. How to install_github when there is a proxy
                            
                                Extract matrix column values by matrix column name
                            
                                How to slice data from a middle index until the end without using `length` in R (like you can in python)?
                            
                                Adjust Transparency (alpha) of stat_smooth lines, not just transparency of Confidence Interval
                            
                                lambda-like functions in R?
                            
                                dplyr: How to use group_by inside a function?
                            
                                Fast vectorized merge of list of data.frames by row
                            
                                Looping over a Date or POSIXct object results in a numeric iterator
                            
                                How do I open a script file in RStudio using an R command?
                            
                                How to annotate() ggplot with latex
                            
                                Subset rows in a data frame based on a vector of values
                            
                                Fill and border colour in geom_point (scale_colour_manual) in ggplot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Grouped bar plot in ggplot

Tags:

r

ggplot2

reshape

bar-chart

reshape2

S12000

People also ask

1 Answers

jakub

Recent Activity

Donate For Us