I would like to plot each column of a data.frame using a histogram on one page. Here is an example using the sample "diamonds" data set which comes with R: <pre class="prettyprint"><code>p = list() for (i in 1:ncol(diamonds)) p[[i]] <- qplot(diamonds[,i], xlab=names(diamonds)[[i]]) do.call(grid.arrange, p) </code></pre> <img src="https://i.stack.imgur.com/VJLp7.png" alt="enter image description here"> This does plot all the columns, but the data looks the same in each one. So, something is clearly wrong. Is this the right approach for this task? I'm sure I have some silly syntax somewhere that is assigning the same column data set to each element in the list, but I'm not sure what it is. Thank you

Here you go: <pre class="prettyprint"><code>library(reshape2) library(ggplot2) d <- melt(diamonds[,-c(2:4)]) ggplot(d,aes(x = value)) + facet_wrap(~variable,scales = "free_x") + geom_histogram() </code></pre> <img src="https://i.stack.imgur.com/cyUV0.png" alt="enter image description here"> <code>melt</code>ing allows us to use the resulting grouping variables (called <code>variable</code>) to split the data into groups and plot a histogram for each one. Note the use of <code>scales = "free_x"</code> because each of the variables has a markedly different range and scale.

Plot every column in a data frame as a histogram on one page using ggplot

Tags:

r

ggplot2

I would like to plot each column of a data.frame using a histogram on one page. Here is an example using the sample "diamonds" data set which comes with R:

p = list()
for (i in 1:ncol(diamonds)) p[[i]] <- qplot(diamonds[,i], xlab=names(diamonds)[[i]])
do.call(grid.arrange, p)

enter image description here

This does plot all the columns, but the data looks the same in each one. So, something is clearly wrong.

Is this the right approach for this task? I'm sure I have some silly syntax somewhere that is assigning the same column data set to each element in the list, but I'm not sure what it is.

Thank you

886

asked Oct 23 '12 17:10

oneself

1 Answers

Here you go:

library(reshape2)
library(ggplot2)
d <- melt(diamonds[,-c(2:4)])
ggplot(d,aes(x = value)) + 
    facet_wrap(~variable,scales = "free_x") + 
    geom_histogram()

enter image description here

melting allows us to use the resulting grouping variables (called variable) to split the data into groups and plot a histogram for each one. Note the use of scales = "free_x" because each of the variables has a markedly different range and scale.

answered Oct 24 '22 11:10

joran

Related questions
                            
                                Lookup table based on multiple conditions in R
                            
                                How to get the arrow package for R with lz4 support?
                            
                                How to mutate a column based on values occurring in a particular sequence?
                            
                                Multiple Processes Instead of for loop in R
                            
                                readxl, selected worksheets in single .xlsx-workbook
                            
                                Define mlr3 task using data from a database (different tables)?
                            
                                Setting multiple and different attributes for columns of a data.table
                            
                                How to create matrix of distribution in R
                            
                                How to sort multiple tables in Shiny
                            
                                Remove line from polygon crossing the international dateline in R (e.g. Russia in rnaturalearth)
                            
                                Resetting R random number generator (rlecuyer) for inner loops using Snow/doSNOW
                            
                                Avoid legend duplication in plotly conversion from ggplot with facet_wrap
                            
                                Find two keywords if they are between 0 and 3 words apart
                            
                                Converting image array to RGB to HSL/HSV and back?
                            
                                Error message when using between() function with variable names
                            
                                Create lower triangle genetic distance matrix
                            
                                Extract p-value from gam.check in R
                            
                                R RJDBC java.lang.OutOfMemoryError
                            
                                Remove Plot Margins in ggplot2

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With