I've to duplicate values from one table to another (identical table schemes). What is better (performance): <ul> <li>Drop table1 and create as select * from table2</li> <li>Delete all rows from table1 and insert all rows from table2</li> </ul> Update: I've made a small test on table with almost 3k rows. Drop and create gives about 60ms vs Delete and insert - about 30ms.

I see four useful ways to replace the contents of the table. None of them is "obviously right", but it depends on your requirements. <ol> <li> (In a single transaction) <code>DELETE FROM foo; INSERT INTO foo SELECT ...</code> Pro: Best concurrency: doesn't lock out other transactions accessing the table, as it leverages Postgres's MVCC. Con: Probably the slowest if you measure the insert-speed alone. Causes autovacuum to clean up dead rows, thus creating a higher I/O load. </li> <li> <code>TRUNCATE foo; INSERT INTO foo SELECT ...</code> Pro: Fastest for smaller tables. Causes less write I/O than #1 Con: Excludes all other readers -- other transactions reading from the table will have to wait. </li> <li> <code>TRUNCATE foo</code>, DROP all indexes on table, <code>INSERT INTO foo SELECT ...</code>, re-create all indexes. Pro: Fastest for large tables, because creating indexes with <code>CREATE INDEX</code> is faster than updating them incrementally. Con: Same as #2 </li> <li> The switcheroo. Create two identical tables <code>foo</code> and <code>foo_tmp</code> <pre class="prettyprint"><code>TRUNCATE foo_tmp; INSERT INTO foo_tmp SELECT ...; ALTER TABLE foo RENAME TO foo_tmp1; ALTER TABLE foo_tmp RENAME TO foo; ALTER TABLE foo_tmp1 RENAME TO foo_tmp; </code></pre> Thanks to PostgreSQL's transactional DDL capabilities, if this is done in a transaction, the rename is performed without other transactions noticing. You can also combine this with #3 and drop/create indexes. Pro: Less I/O performed, like #2, and without locking out other readers (locks taken only during the rename part). Con: The most complicated. Also you cannot have foreign keys or views pointing to the table, as they would point to the wrong table after renaming it. </li> </ol>

DROP and CREATE vs DELETE and INSERT in PostgreSQL

1 Answers

I see four useful ways to replace the contents of the table. None of them is "obviously right", but it depends on your requirements.

(In a single transaction) DELETE FROM foo; INSERT INTO foo SELECT ...

Pro: Best concurrency: doesn't lock out other transactions accessing the table, as it leverages Postgres's MVCC.

Con: Probably the slowest if you measure the insert-speed alone. Causes autovacuum to clean up dead rows, thus creating a higher I/O load.
TRUNCATE foo; INSERT INTO foo SELECT ...

Pro: Fastest for smaller tables. Causes less write I/O than #1

Con: Excludes all other readers -- other transactions reading from the table will have to wait.
TRUNCATE foo, DROP all indexes on table, INSERT INTO foo SELECT ..., re-create all indexes.

Pro: Fastest for large tables, because creating indexes with CREATE INDEX is faster than updating them incrementally.

Con: Same as #2
The switcheroo. Create two identical tables foo and foo_tmp
```
TRUNCATE foo_tmp;
INSERT INTO foo_tmp SELECT ...;
ALTER TABLE foo RENAME TO foo_tmp1;
ALTER TABLE foo_tmp RENAME TO foo;
ALTER TABLE foo_tmp1 RENAME TO foo_tmp;
```
Thanks to PostgreSQL's transactional DDL capabilities, if this is done in a transaction, the rename is performed without other transactions noticing. You can also combine this with #3 and drop/create indexes.

Pro: Less I/O performed, like #2, and without locking out other readers (locks taken only during the rename part).

Con: The most complicated. Also you cannot have foreign keys or views pointing to the table, as they would point to the wrong table after renaming it.

answered Sep 22 '22 02:09

intgr

Related questions
                            
                                If-else-if versus map
                            
                                How are databases efficient?
                            
                                Fastest data structure for inserting/sorting
                            
                                Slow performing SQL query with triple self-join
                            
                                Reusing of a PreparedStatement between methods?
                            
                                speeding up data frame matching
                            
                                How can Google's Dart get better performance?
                            
                                Fastest math programming language?
                            
                                Enable Keep-Alive (Page Speed)
                            
                                Find out how many Milliseconds a C# program has taken to execute in your debugger
                            
                                Faster implementation of verbal arithmetic in Prolog
                            
                                Memory Cache .Net 4.0 performance test : astonishing result
                            
                                VB.NET How give best performance "Select case" or IF... ELSEIF ... ELSE... END IF
                            
                                What is better to use in Java? x <= 10 or x < 11?
                            
                                Fastest way to filter a data.frame list column contents in R / Rcpp
                            
                                Performance impact of changing to generic interfaces
                            
                                Why this performance difference? (Exception catching)
                            
                                Statistics on Query Time (PostgreSQL)
                            
                                Do all profilers significantly slow execution?
                            
                                dbms_output.put_line

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

DROP and CREATE vs DELETE and INSERT in PostgreSQL

Tags:

performance

postgresql

mmatloka

People also ask

1 Answers

intgr

Recent Activity

Donate For Us