I am trying to write the following MySQL query in PostgreSQL 8.0 (specifically, using Redshift): <pre class="prettyprint"><code>DELETE t1 FROM table t1 LEFT JOIN table t2 ON ( t1.field = t2.field AND t1.field2 = t2.field2 ) WHERE t1.field > 0 </code></pre> PostgreSQL 8.0 does not support <code>DELETE FROM table USING</code>. The examples in the docs say that you can reference columns in other tables in the where clause, but that doesn't work here as I'm joining on the same table I'm deleting from. The other example is a subselect query, but the primary key of the table I'm working with has four columns so I can't see a way to make that work either.

Amazon Redshift may be based on Postgres 8.0, but is a very much different thing. I don't use it, but the manual informs, that the <code>USING</code> clause is supported in <code>DELETE</code> statements: Just use the modern form: <pre class="prettyprint"><code>DELETE FROM tbl USING tbl t2 WHERE t2.field = tbl.field AND t2.field2 = tbl.field2 AND t2.pkey <> tbl.pkey -- exclude self-join AND tbl.field > 0; </code></pre> This is assuming <code>JOIN</code> instead of <code>LEFT JOIN</code> in your MySQL statement, which would not make any sense. I also added the condition <code>AND t2.pkey <> t1.pkey</code>, to make it a useful query. This excludes rows joining itself. <code>pkey</code> being the primary key column. What this query does: Delete all rows where at least one other row exists in the same table with the same not-null values in <code>field</code> and <code>field2</code>. All such duplicates are deleted without leaving a single row per set. To keep (for example) the row with the smallest <code>pkey</code> per set of duplicates, use <code>t2.pkey < t2.pkey</code>. An <code>EXISTS</code> semi-join (as @wilplasser already hinted) might be a better choice, especially if multiple rows could be joined (a row can only be deleted once anyway): <pre class="prettyprint"><code>DELETE FROM tbl WHERE field > 0 AND EXISTS ( SELECT 1 FROM tbl t2 WHERE t2.field = tbl.field AND t2.field2 = tbl.field2 AND t2.pkey <> tbl.pkey ); </code></pre>

Delete from table A joining on table A in Redshift

Tags:

sql

delete-row

amazon-redshift

I am trying to write the following MySQL query in PostgreSQL 8.0 (specifically, using Redshift):

DELETE t1 FROM table t1
LEFT JOIN table t2 ON (
    t1.field   = t2.field AND
    t1.field2  = t2.field2
)
WHERE t1.field > 0

PostgreSQL 8.0 does not support DELETE FROM table USING. The examples in the docs say that you can reference columns in other tables in the where clause, but that doesn't work here as I'm joining on the same table I'm deleting from. The other example is a subselect query, but the primary key of the table I'm working with has four columns so I can't see a way to make that work either.

405

asked Apr 30 '14 16:04

moinudin

2 Answers

Amazon Redshift may be based on Postgres 8.0, but is a very much different thing. I don't use it, but the manual informs, that the USING clause is supported in DELETE statements:

Just use the modern form:

DELETE FROM tbl
USING  tbl t2
WHERE  t2.field  = tbl.field
AND    t2.field2 = tbl.field2
AND    t2.pkey  <> tbl.pkey    -- exclude self-join
AND    tbl.field > 0;

This is assuming JOIN instead of LEFT JOIN in your MySQL statement, which would not make any sense. I also added the condition AND t2.pkey <> t1.pkey, to make it a useful query. This excludes rows joining itself. pkey being the primary key column.

What this query does:
Delete all rows where at least one other row exists in the same table with the same not-null values in field and field2. All such duplicates are deleted without leaving a single row per set.

To keep (for example) the row with the smallest pkey per set of duplicates, use t2.pkey < t2.pkey.

An EXISTS semi-join (as @wilplasser already hinted) might be a better choice, especially if multiple rows could be joined (a row can only be deleted once anyway):

DELETE FROM tbl
WHERE  field > 0
AND    EXISTS (
   SELECT 1
   FROM   tbl t2
   WHERE  t2.field  = tbl.field
   AND    t2.field2 = tbl.field2
   AND    t2.pkey  <> tbl.pkey 
   );

113

answered Sep 27 '22 19:09

Erwin Brandstetter

I don't understand the mysql syntax, but you probably want this:

DELETE FROM mytablet1
WHERE t1.field > 0
   -- don't need this self-join if {field,field2}
   -- are a candidate key for mytable
   -- (in that case, the exists-subquery would detect _exactly_ the
   -- same tuples as the ones to be deleted, which always succeeds)
-- AND EXISTS (
--     SELECT *
--     FROM mytable t2 
--     WHERE t1.field = t2.field
--     AND t1.field2  = t2.field2
--    )
    ;

Note: For testing purposes, you can replace the DELETE keyword by SELECT * or SELECT COUNT(*), and see which rows would be affected by the query.

answered Sep 27 '22 21:09

wildplasser

Related questions
                            
                                getting count from the same column in a mysql table?
                            
                                Sum of Multiple rows in MySql
                            
                                Crosstab Query with Dynamic Columns in SQL Server 2005 up
                            
                                Group by data intervals
                            
                                SELECT 5 most recent SQL Server
                            
                                How to convert java.util.Date into current time in timestamp..?
                            
                                SQL GROUP BY and a condition on COUNT
                            
                                SQL create table use %type at column
                            
                                Querying the inner join of two tables with the same column name, Column 'exName' in field list is ambiguous
                            
                                Why does EXEC retport an error of MUST DECLARE SCALAR VARIABLE
                            
                                Rails 3 Sum Product of two fields
                            
                                Can I improve this query?
                            
                                SELECT * - pros /cons
                            
                                How to compare two date values using SQL [duplicate]
                            
                                How can a blank MS Access database be created using VBA?
                            
                                Replace multiple characters from string without using any nested replace functions
                            
                                Difference between SQL Server codes?
                            
                                DBNull check for ExecuteScalar
                            
                                Change column value when matching condition
                            
                                Find rows with duplicate values in a column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With