What is the most useful output format for graphs? [closed]

Tags:

r

Before any of you run at the closing vote let me say that I understand that this question may be subjective, and the expected answer may begin by "it depends". Nevertheless, it is an actually relevant problem I run into, as I am creating more and more graphs, and I don't necessarily know the exact way I am going to use them, or don't have the time to test for the final use case immediately.

So I am leveraging the experience of SO R users to get good reasons to choose one over the other, between jpg(), bmp(), png(), tiff(), pdf() and possibly with which options. I don't have the experience in R and the knowledge in the different formats to choose wisely.

Potential use cases:

quick look after or during run time of algorithms
presentations (.ppt mainly)
reports (word or latex)
publication (internet)
storage (without too much loss and to transform it later for a specific use)
anything relevant I forgot

Thanks! I'm happy to make the question clearer.

754

asked Sep 06 '13 01:09

Antoine Lizée

2 Answers

To expand a little on my comment, there is no real easy answer, but my suggestions:

My first totally flexible choice would be to simply store the final raw data used in the plot(s) and a bit of R code for generating the plot(s). That way you could easily enough send the output to whatever device that suits your particular purpose. It would not be that arduous a task to set yourself up a couple of basic templates based on png()/pdf() that you could call upon.
Use the svg() device. As noted by @gung, storing the output using pdf() , svg() , cairo_ps() or cairo_pdf() are your only real options for retaining scalable, vector images. I would tend to lean towards svg() rather than pdf() due to the greater editing options available using programs like Inkscape. It is also becoming a quite widely supported format for internet publication (see - http://caniuse.com/svg )
If on the other hand you're a latex user, most headaches seem to be solved by going straight to pdf() - you can usually import and convert pdf files using Inkscape or command line utilities like Imagemagick if you have to format shift.
For Word/Powerpoint interaction, if you are running R on Windows, you can also export directly using win.metafile() which will give you scalable/component emf images which you can import into Word or Powerpoint directly. I have heard of people running R through Wine or using intermediary steps on Linux to get emf files out for later use. For Mac, there are roundabout pathways as well.

So, to summarise, in order of preference.

Don't store images at all, store code to generate images
Use svg/pdf and convert formats as required.
Use a backup win.metafile export directly for those cases where you can't escape using Word/Powerpoint and are primarily going to be based on Windows systems.

154

answered Sep 20 '22 19:09

thelatemail

So far the answers for this question have all recommended outputting plots in vector based formats. This will give you the best output, allowing you to resize your image as you need for whatever medium your image will end up in (whether that be a webpage, document, or presentation), but this comes at a computational cost.

For my own work, I often find it is much more convenient to save my plots in a raster format of sufficient resolution. You probably want to do this whenever your data takes a non-trivial amount of time to plot.

Some examples of where I find a raster format is more convenient:

Manhattan plots: A plot showing p-value significance for hundreds of thousands-millions of DNA markers across a genome.
Large Heatmaps: Clustering the top 5000 differentially expressed genes between two groups of people, one with a disease, and one healthy.
Network Rendering: When drawing a large number of nodes connected to each other by edges, redrawing the edges (as vectors) can slow down your computer.

Ultimately it comes down to a trade-off in your own sanity. What annoys you more? your computer grinding to a halt trying to redraw an image? or figuring out the exact dimensions to render an image in raster format so it doesn't look awful for your final publishing medium?

answered Sep 24 '22 19:09

Scott Ritchie

Related questions
                            
                                Replacement for diff() for multiple columns
                            
                                Convert a mm-yy string "Jan-01" into date format [duplicate]
                            
                                R, passing variables to a system command
                            
                                Reading space separated numbers in R
                            
                                Pretty-printing of character strings (ensuring automatic line-breaks to stay within a given print margin)
                            
                                Function to add names to data frame
                            
                                Using R to parse and return text in parenthesis
                            
                                add header to file created by "write.csv"
                            
                                How to count the number of sentences in a text in R?
                            
                                Creating a cumulative step graph in R
                            
                                money representation in R
                            
                                Use for loop to plot multiple lines in single plot with ggplot2
                            
                                time display in clock with xy scatter plot in r
                            
                                Keyed lookup on data.table without 'with'
                            
                                Use object names within a list in lapply/ldply
                            
                                ggplot2: How to specify multiple fill colors for points that are connected by lines of different colors
                            
                                how to generate random numbers with sequence in R
                            
                                how to draw arrow in ggplot2 with annotation
                            
                                Change thickness of a marker in ggplot2
                            
                                How can I shorten x-axis label text in ggplot?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With