<p>Here's the situation : With Heroku & Postgres, you can have automatically generated backups dump file. <strong>But what can you do with it?</strong></p> <ol> <li>Dump it on your database, if you want to fully go back to the backup state</li> <li>Dump it locally to "have a look", or to use production data in development environment</li> <li>Set back specific rows of your database in a previous state (eg. restore accidentally deleted rows)</li> </ol> <p>I found myself so much <strong>struggling</strong> about latter point that I wanted to share how I have done it.</p> <p><strong>How to restore specific data from previous backup on Postgres Heroku?</strong></p>

<h3>Summary / TL;DR</h3> <p>In 3 steps you'll be able to execute very simply:</p> <pre class="prettyprint"><code>INSERT INTO production_db.table_name SELECT * FROM backup_db.table_name -- backup_db being remote </code></pre> <p>First install the backup locally, second get a SQL script, third open your localhost to the outside world with <strong>ngrok</strong>.</p> <h3>Let's go?</h3> <h3>1. Download your dump file on Heroku and dump it somewhere:</h3> <ul> <li>You can do that on a remote database if you have some servers available. But if like me you don't want to provision another production database on Heroku or somewhere else, locally will totally do.</li> <li>I like to use PGAdmin (available on Linux, Mac and Windows), but using command line and <code>psql</code> will also do (by reading this post by example)</li> <li>In PGAdmin, you'd do <code>Create a database</code>. Then right click on it and use the <code>restore</code> function. Select your dump file, click <code>Restore</code> and you're all set : <strong>your backup data is available locally!</strong> Good job!</li> </ul> <h3>2. Access it from your remote database</h3> <p>I wanted to do the following:</p> <pre class="prettyprint"><code>SELECT * FROM backup_db.table_name -- So I could then do INSERT INTO production_db.table_name SELECT * FROM backup_db.table_name </code></pre> <p>And I would be all set. Super easy, right? Pretty obvious? This must have been done hundreds of times already. Well, no!</p> <p>There is a utility called <code>db_link</code> in Postgres 9.1+, but it is pretty constraining as the following syntax applies:</p> <pre class="prettyprint"><code>SELECT fname, lname FROM db_link('host=localhost dbname=backup-28-08', 'SELECT fname, lname FROM users') AS remote (varchar255 fname varchar255 lname) </code></pre> <p>Every column name needs to be repeated twice including its type. Pretty heavy, we are far from the simple <code>SELECT * FROM backup_db.table_name</code></p> <p>So the idea here is to use the <code>information_schema</code> table content, which describes each table with its column names, its types etc. I found this question on SO: Specify dblink column definition list from a local existing type which helped me <strong>a lot</strong> (Thanks bentrm).</p> <p>But its solution was a two steps process, first generating a function, then querying it:</p> <pre class="prettyprint"><code>SELECT dblink_star_func('dbname=ben', 'public', 'test'); SELECT * FROM star_test() WHERE data = 'success'; </code></pre> <p>And I was still aiming at a 1 liner. After some little pain (not being a SQL Guru), here is the Gist : https://gist.github.com/augnustin/d30973ea8b5bf0067841</p> <p>I now can do:</p> <pre class="prettyprint"><code>SELECT * FROM remote_db(NULL::users) -- (Still not 100% about why I need the NULL::) -- And also INSERT INTO users SELECT * FROM remote_db(NULL::users) </code></pre> <p>Awesome, right?</p> <h3>3. Access localhost remotely</h3> <p>If your remote database is already available from the internet (=has an IP address, a domain name Eg. for Heroku it will look like: <code>ec2-54-217-229-169.eu-west-1.compute.amazonaws.com:5672/df68cfpbufjd9p</code>) <strong>you can skip this step</strong>. But if you use your local database, you need to make it available from the outside world (so that the Heroku database can access it).</p> <p>For this, I use the <strong>wonderful ngrok</strong>.</p> <p>Once installed I only need to enter the following command:</p> <pre class="prettyprint"><code>ngrok -proto=tcp 5432 #5432 being the default port for Postgresql. (Adapt if necessary) Tunnel Status online Version 1.7/1.6 Forwarding tcp://ngrok.com:51727 -> 127.0.0.1:5432 Web Interface 127.0.0.1:4040 # Conn 0 Avg Conn Time 0.00ms </code></pre> <p>And you'd only need to plug <code>db_link</code> (in the gist) to <code>host=ngrock.com port=51727</code> and you are <strong>good to go</strong>!</p> <h3>4. Going further</h3> <p>There are many possible improvements to this. Here are some I see already:</p> <ul> <li>Considering the script as a default feature to <code>db_link</code> function</li> <li>Being more error-proof if database structures are different in backup and production</li> <li>Making comparison tool between database results and backup results (to only return diffing lines)</li> <li>Handle simple joins</li> <li>And even further would be to have an application level adapter (Eg. ActiveRecord in Rails) that could allow manipulation of backend objects instead of raw SQL like now</li> </ul> <p>Hope I was clear! Please ask for more details otherwise</p>

How to restore specific data from previous backup on Postgres Heroku? (Eg. Accidentally deleted rows)

1 Answers

Summary / TL;DR

In 3 steps you'll be able to execute very simply:

INSERT INTO production_db.table_name
SELECT * FROM backup_db.table_name -- backup_db being remote

First install the backup locally, second get a SQL script, third open your localhost to the outside world with ngrok.

Let's go?

1. Download your dump file on Heroku and dump it somewhere:

You can do that on a remote database if you have some servers available. But if like me you don't want to provision another production database on Heroku or somewhere else, locally will totally do.
I like to use PGAdmin (available on Linux, Mac and Windows), but using command line and psql will also do (by reading this post by example)
In PGAdmin, you'd do Create a database. Then right click on it and use the restore function. Select your dump file, click Restore and you're all set : your backup data is available locally! Good job!

2. Access it from your remote database

I wanted to do the following:

SELECT * FROM backup_db.table_name
-- So I could then do
INSERT INTO production_db.table_name
SELECT * FROM backup_db.table_name

And I would be all set. Super easy, right? Pretty obvious? This must have been done hundreds of times already. Well, no!

There is a utility called db_link in Postgres 9.1+, but it is pretty constraining as the following syntax applies:

SELECT fname, lname FROM db_link('host=localhost dbname=backup-28-08', 'SELECT fname, lname FROM users') AS remote (varchar255 fname varchar255 lname)

Every column name needs to be repeated twice including its type. Pretty heavy, we are far from the simple SELECT * FROM backup_db.table_name

So the idea here is to use the information_schema table content, which describes each table with its column names, its types etc. I found this question on SO: Specify dblink column definition list from a local existing type which helped me a lot (Thanks bentrm).

But its solution was a two steps process, first generating a function, then querying it:

SELECT dblink_star_func('dbname=ben', 'public', 'test');
SELECT * FROM star_test() WHERE data = 'success';

And I was still aiming at a 1 liner. After some little pain (not being a SQL Guru), here is the Gist : https://gist.github.com/augnustin/d30973ea8b5bf0067841

I now can do:

SELECT * FROM remote_db(NULL::users) -- (Still not 100% about why I need the NULL::)
-- And also
INSERT INTO users
SELECT * FROM remote_db(NULL::users)

Awesome, right?

3. Access localhost remotely

If your remote database is already available from the internet (=has an IP address, a domain name Eg. for Heroku it will look like: ec2-54-217-229-169.eu-west-1.compute.amazonaws.com:5672/df68cfpbufjd9p) you can skip this step. But if you use your local database, you need to make it available from the outside world (so that the Heroku database can access it).

For this, I use the wonderful ngrok.

Once installed I only need to enter the following command:

ngrok -proto=tcp 5432 #5432 being the default port for Postgresql. (Adapt if necessary)
                                                                                                                                                                                                    
Tunnel Status                 online                                                                                                                                                                
Version                       1.7/1.6                                                                                                                                                               
Forwarding                    tcp://ngrok.com:51727 -> 127.0.0.1:5432                                                                                                                               
Web Interface                 127.0.0.1:4040                                                                                                                                                        
# Conn                        0                                                                                                                                                                     
Avg Conn Time                 0.00ms

And you'd only need to plug db_link (in the gist) to host=ngrock.com port=51727 and you are good to go!

4. Going further

There are many possible improvements to this. Here are some I see already:

Considering the script as a default feature to db_link function
Being more error-proof if database structures are different in backup and production
Making comparison tool between database results and backup results (to only return diffing lines)
Handle simple joins
And even further would be to have an application level adapter (Eg. ActiveRecord in Rails) that could allow manipulation of backend objects instead of raw SQL like now

Hope I was clear! Please ask for more details otherwise

179

answered Sep 28 '22 18:09

6 revs, 3 users 98%

Related questions
                            
                                pg_largeobject access on heroku
                            
                                APNS token collision, stored in Postgres
                            
                                Ruby on Rails / PostgreSQL - Library not loaded error when starting server
                            
                                Connecting to Heroku Postgres using SSH (Putty)
                            
                                ISO 8601 format date for PostgreSQL
                            
                                Comparison of JSON and User-Defined types in Postgres 9.3
                            
                                Rails: Faster way to perform updates on many records
                            
                                Insert multiple rows in one table based on number in another table
                            
                                How to programmatically check if row is deletable?
                            
                                What is a 'Schema' in PostgreSQL? [closed]
                            
                                Postgres geolocation points Distance
                            
                                Using npgsql 12 and ef 6 together - have anyone succeeded with it?
                            
                                First and last value aggregate functions in postgresql that work correctly with NULL values
                            
                                Postgres SELECT in Go returns all columns as string (using pq and database/sql)
                            
                                Check whether sqlalchemy table is empty
                            
                                Recursive function in postgres
                            
                                Convert executed SQL result to a list of Model object
                            
                                Error installing node-postgres on Amazon Linux. Missing pg_config.h file
                            
                                Find all intersections of all sets of ranges in PostgreSQL
                            
                                Docker - Tomcat and PostgreSQL containers in same host - No Route to host

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to restore specific data from previous backup on Postgres Heroku? (Eg. Accidentally deleted rows)

Tags:

postgresql

heroku

ngrok

Augustin Riedinger

People also ask