Persisting data in Google Colaboratory

Tags:

google-colaboratory

Has anyone figured out a way to keep files persisted across sessions in Google's newly open sourced Colaboratory?

Using the sample notebooks, I'm successfully authenticating and transferring csv files from my Google Drive instance and have stashed them in /tmp, my ~, and ~/datalab. Pandas can read them just fine off of disk too. But once the session times out , it looks like the whole filesystem is wiped and a new VM is spun up, without downloaded files.

I guess this isn't surprising given Google's Colaboratory Faq:

Q: Where is my code executed? What happens to my execution state if I close the browser window?

A: Code is executed in a virtual machine dedicated to your account. Virtual machines are recycled when idle for a while, and have a maximum lifetime enforced by the system.

Given that, maybe this is a feature (ie "go use Google Cloud Storage, which works fine in Colaboratory")? When I first used the tool, I was hoping that any .csv files that were in the My File/Colab Notebooks Google Drive folder would be also loaded onto the VM instance that the notebook was running on :/

632

asked Nov 09 '17 04:11

user3424705

2 Answers

Put that before your code, so will always download your file before run your code.

!wget -q http://www.yoursite.com/file.csv

186

answered Sep 29 '22 09:09

Marcel Pinheiro

Your interpretation is correct. VMs are ephemeral and recycled after periods of inactivity. There's no mechanism for persistent data on the VM itself right now.

In order for data to persist, you'll need to store it somewhere outside of the VM, e.g., Drive, GCS, or any other cloud hosting provider.

Some recipes for loading and saving data from external sources is available in the I/O example notebook.

answered Sep 29 '22 09:09

Bob Smith

Related questions
                            
                                Matplotlib - Plot a plane and points in 3D simultaneously
                            
                                Keras flowFromDirectory get file names as they are being generated
                            
                                Python inheritance - how to call grandparent method?
                            
                                matplotlib Axes.plot() vs pyplot.plot()
                            
                                Python 2.7 not working anymore: cannot import name md5
                            
                                Why use pandas.assign rather than simply initialize new column?
                            
                                Making a python iterator go backwards?
                            
                                Pickle with custom classes
                            
                                What is a unicode string? [closed]
                            
                                Force child class to call parent method when overriding it
                            
                                Duplicated rows when merging dataframes in Python
                            
                                ElasticSearch updates are not immediate, how do you wait for ElasticSearch to finish updating it's index?
                            
                                Python Headless MatplotLib / Pyplot [duplicate]
                            
                                List as a member of a python class, why is its contents being shared across all instances of the class?
                            
                                How to determine if Python script was run via command line?
                            
                                How to convert `ctime` to `datetime` in Python?
                            
                                Pandas: create named columns in DataFrame from dict
                            
                                Django test coverage vs code coverage
                            
                                Are functions objects in Python?
                            
                                tkinter: how to use after method

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With