Bulk updating existing rows in Redshift

Tags:

This seems like it should be easy, but isn't. I'm migrating a query from MySQL to Redshift of the form:

INSERT INTO table
(...)
VALUES
(...)
ON DUPLICATE KEY UPDATE
  value = MIN(value, VALUES(value))

For primary keys we're inserting that aren't already in the table, those are just inserted. For primary keys that are already in the table, we update the row's values based on a condition that depends on the existing and new values in the row.

http://docs.aws.amazon.com/redshift/latest/dg/merge-replacing-existing-rows.html does not work, because filter_expression in my case depends on the current entries in the table. I'm currently creating a staging table, inserting into it with a COPY statement and am trying to figure out the best way to merge the staging and real tables.

586

asked Mar 20 '14 19:03

moinudin

1 Answers

I'm having to do exactly this for a project right now. The method I'm using involves 3 steps:

Run an update that addresses changed fields (I'm updating whether or not the fields have changed, but you can certainly qualify that):

update table1 set col1=s.col1, col2=s.col2,...
from table1 t
 join stagetable s on s.primkey=t.primkey;

Run an insert that addresses new records:

insert into table1
select s.* 
from stagetable s 
 left outer join table1 t on s.primkey=t.primkey
where t.primkey is null;

Mark rows no longer in the source as inactive (our reporting tool uses views that filter inactive records):

update table1 
set is_active_flag='N', last_updated=sysdate
from table1 t
 left outer join stagetable s on s.primkey=t.primkey
where s.primkey is null;

117

answered Oct 03 '22 05:10

mike_pdb

Related questions
                            
                                how does codeigniter sanitize inputs?
                            
                                How do I edit BLOBs (containing JSON) in Oracle SQL Developer?
                            
                                Application users account registration and login, best way to handle?
                            
                                Does SQL Server TOP stop processing once it finds enough rows?
                            
                                Select values from a table that are not in a list SQL
                            
                                Is there a performance difference in using a GROUP BY with MAX() as the aggregate vs ROW_NUMBER over partition by?
                            
                                Postgres Creating Schema in a specific database
                            
                                Is it possible to use subquery in join condition in Access?
                            
                                PostgreSQL - Installing JDBC driver
                            
                                Detecting SQL Server Utilization with a query
                            
                                Django ORM version of SQL COUNT(DISTINCT <column>)
                            
                                Show/Change Work Directory in SQLite
                            
                                Limit SQL by the sum of the row's value
                            
                                Range scan vs Unique Scan vs Skip Scan [closed]
                            
                                Oracle OCI, bind variables, and queries like ID IN (1, 2, 3)
                            
                                Can a readcommitted isolation level ever result in a deadlock (Sql Server)?
                            
                                How to prevent query injection on Google Big Query
                            
                                How to use CTE's with update/delete on SQLite?
                            
                                SSRS Reports - force table to expand to bottom of page
                            
                                EF5 db.Database.SqlQuery mapping returned objects

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Bulk updating existing rows in Redshift

Tags:

sql

postgresql

amazon-redshift

moinudin

People also ask

1 Answers

mike_pdb

Recent Activity

Donate For Us