Importing and accessing large data files in Shiny

Tags:

I have an app where I want to pull out values from a lookup table based on user inputs. The reference table is a statistical test, based on a calculation that'd be too slow to do for all the different combinations of user inputs. Hence, a lookup table for all the possibilities.

But... right now the table is about 60 MB (as .Rdata) or 214 MB (as .csv), and it'll get much larger if I expand the possible user inputs. I've already reduced the number of significant figures in the data (to 3) and removed the row/column names.

Obviously, I can preload the lookup table outside the reactive server function, but it'll still take a decent chunk of time to load in that data. Does anyone have any tips on dealing with large amounts of data in Shiny? Thanks!

420

asked Sep 04 '14 02:09

Laura Hughes

Video Answer

1 Answers

flaneuse, we are still working with a smaller set that you but we have been experimenting with:

Use rds for our data

As @jazzurro mentioned rds above, and you seem to know how to do this, but the syntax for others is below.

Format .rds allows you to bring in a single R object so you can rename it if needs be.

In your prep data code, for example:
```
mystorefile <- file.path("/my/path","data.rds")
# ... do data stuff

# Save down (assuming mydata holds your data frame or table)
saveRDS(mydata, file = mystorefile)
```
In your shiny code:
```
#  Load in my data
x <- readRDS(mystorefile)
```
Remember to copy your data .rds file into your app directory when you deploy. We use a data directory /myapp/data and then file.path for store file is changed to "./data" in our shiny code.
global.R

We have placed our readRDS calls to load in our data in this global file (instead of in server.R before shinyServer() call), so that is run once, and is available for all sessions, with the added bonus it can be seen by ui.R.

See this scoping explanation for R Shiny.
Slice and dice upfront

The standard daily reports use the most recent data. So I make a small latest.dt in my global.R of a smaller subset of my data. So the landing page with the latest charts work with this smaller data set to get faster charts.

The custom data tab which uses the full.dt then is on a separate tab. It is slower but at that stage the user is more patient, and is thinking of what dates and other parameters to choose.

This subset idea may help you.

Would be interested in what others (with more demanding data sets have tried)!

195

answered Oct 29 '22 11:10

micstr

Related questions
                            
                                collect only if query returns less than n_max rows
                            
                                How to change the order of the panels in simple Lattice graphs
                            
                                Is there an implementation of Hadley's ddply for python?
                            
                                Difference between installing a package from source and from compiled binary [duplicate]
                            
                                R connecting to EC2 instance for parallel processing
                            
                                "Incorrect number of dimensions" error, help me understand why
                            
                                How to avoid implicit character conversion when using apply on dataframe
                            
                                Behavior of <- NULL on lists versus data.frames for removing data
                            
                                How can I suppress the creation of a plot while calling a function in R?
                            
                                Unable to launch SparkR in RStudio
                            
                                increasing the distance between igraph nodes
                            
                                Save leaflet map in Shiny
                            
                                how to rearrange an order of matches between two data frames
                            
                                Complete R Session Size
                            
                                Clear startup screen in R / RStudio
                            
                                Configuration error when installing R on Linux [closed]
                            
                                Increase polygonal resolution of ggplot polar plots
                            
                                .Rd links to suggested package [closed]
                            
                                Return a list in dplyr mutate()
                            
                                RStudio knitr themes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With