I have a table with 3 columns as below: <pre class="prettyprint"><code>one | two | three | name ------------------------------------ A1 B1 C1 xyz A1 B1 C1 pqr -> should be deleted A1 B1 C1 lmn -> should be deleted A2 B2 C2 abc A2 B2 C2 def -> should be deleted A3 B3 C3 ghi ------------------------------------ </code></pre> The table is not having any primary key column. I do not have any control over the table and so I can not add any primary key column. As shown, I want to delete the rows where the combination of one, two and three column is same. So if A1B1C1 is occurring thrice (as in above e.g.), the other two should be deleted and only one should stay. How to achieve this through just one query in DB2 ? My requirement is for a single query as I would be running it through a java program.

(This assumes you're on DB2 for Linux/Unix/Windows, other platforms may vary slightly) <pre class="prettyprint"><code>DELETE FROM (SELECT ROWNUMBER() OVER (PARTITION BY ONE, TWO, THREE) AS RN FROM SESSION.TEST) AS A WHERE RN > 1; </code></pre> Should get you what you're looking for. The query uses the OLAP function <code>ROWNUMBER()</code> to assign a number for each row within each <code>ONE</code>, <code>TWO</code>, <code>THREE</code> combination. DB2 is then able to match the rows referenced by the <code>fullselect</code> (A) as the rows that the <code>DELETE</code> statement should remove from the table. In order to be able to use a <code>fullselect</code> as the target for a delete clause, it has to match the rules for a deletable view (see "deletable view" under the notes section). Below is some proof (tested on LUW 9.7): <pre class="prettyprint"><code>DECLARE GLOBAL TEMPORARY TABLE SESSION.TEST ( one CHAR(2), two CHAR(2), three CHAR(2), name CHAR(3) ) ON COMMIT PRESERVE ROWS; INSERT INTO SESSION.TEST VALUES ('A1', 'B1', 'C1', 'xyz'), ('A1', 'B1', 'C1', 'pqr'), ('A1', 'B1', 'C1', 'lmn'), ('A2', 'B2', 'C2', 'abc'), ('A2', 'B2', 'C2', 'def'), ('A3', 'B3', 'C3', 'ghi'); DELETE FROM (SELECT ROWNUMBER() OVER (PARTITION BY ONE, TWO, THREE) AS RN FROM SESSION.TEST) AS A WHERE RN > 1; SELECT * FROM SESSION.TEST; </code></pre> Edit 2 March 2017: In response to the question from Ahmed Anwar, if you need to capture what was deleted, you can also combine the delete with a "data change statement". In this example you could do something like the following, which would give you the "rn" column, one, two, and three: <pre class="prettyprint"><code>SELECT * FROM OLD TABLE ( DELETE FROM (SELECT ROWNUMBER() OVER (PARTITION BY ONE, TWO, THREE) AS RN ,ONE ,TWO ,THREE FROM SESSION.TEST) AS A WHERE RN > 1 ) OLD; </code></pre>

Removing duplicate rows from a table in DB2 in a single query

Tags:

sql

sql-delete

db2

I have a table with 3 columns as below:

one   |   two    |  three  |   name
------------------------------------
 A1       B1          C1        xyz
 A1       B1          C1        pqr      -> should be deleted
 A1       B1          C1        lmn      -> should be deleted
 A2       B2          C2        abc
 A2       B2          C2        def      -> should be deleted
 A3       B3          C3        ghi
------------------------------------

The table is not having any primary key column. I do not have any control over the table and so I can not add any primary key column.

As shown, I want to delete the rows where the combination of one, two and three column is same. So if A1B1C1 is occurring thrice (as in above e.g.), the other two should be deleted and only one should stay.

How to achieve this through just one query in DB2 ?

My requirement is for a single query as I would be running it through a java program.

907

asked Apr 10 '12 11:04

Vicky

1 Answers

(This assumes you're on DB2 for Linux/Unix/Windows, other platforms may vary slightly)

DELETE FROM
    (SELECT ROWNUMBER() OVER (PARTITION BY ONE, TWO, THREE) AS RN
     FROM SESSION.TEST) AS A
WHERE RN > 1;

Should get you what you're looking for.

The query uses the OLAP function ROWNUMBER() to assign a number for each row within each ONE, TWO, THREE combination. DB2 is then able to match the rows referenced by the fullselect (A) as the rows that the DELETE statement should remove from the table. In order to be able to use a fullselect as the target for a delete clause, it has to match the rules for a deletable view (see "deletable view" under the notes section).

Below is some proof (tested on LUW 9.7):

DECLARE GLOBAL TEMPORARY TABLE SESSION.TEST (
    one CHAR(2),
    two CHAR(2),
    three CHAR(2),
    name CHAR(3)
) ON COMMIT PRESERVE ROWS;

INSERT INTO SESSION.TEST VALUES 
    ('A1', 'B1', 'C1', 'xyz'),
    ('A1', 'B1', 'C1', 'pqr'),
    ('A1', 'B1', 'C1', 'lmn'),
    ('A2', 'B2', 'C2', 'abc'),
    ('A2', 'B2', 'C2', 'def'),
    ('A3', 'B3', 'C3', 'ghi');

DELETE FROM
    (SELECT ROWNUMBER() OVER (PARTITION BY ONE, TWO, THREE) AS RN
     FROM SESSION.TEST) AS A
WHERE RN > 1;

SELECT * FROM SESSION.TEST;

Edit 2 March 2017:

In response to the question from Ahmed Anwar, if you need to capture what was deleted, you can also combine the delete with a "data change statement". In this example you could do something like the following, which would give you the "rn" column, one, two, and three:

SELECT * FROM OLD TABLE (
    DELETE FROM
        (SELECT 
             ROWNUMBER() OVER (PARTITION BY ONE, TWO, THREE) AS RN
            ,ONE
            ,TWO
            ,THREE
         FROM SESSION.TEST) AS A
    WHERE RN > 1
) OLD;

132

answered Nov 03 '22 00:11

bhamby

Related questions
                            
                                SQL deadlocking..in single user mode now
                            
                                Difference between super key and composite key
                            
                                Oracle Julian day of year
                            
                                Oracle: Update a datarow by adding to existing value
                            
                                how to convert a Integer to Text value in Power BI
                            
                                what is the difference between left join and left outer join? [duplicate]
                            
                                How to update multiple ids at once
                            
                                join two tables into one big table
                            
                                SQL Coalesce in WHERE clause
                            
                                Function for week of the month in mysql
                            
                                howto to create mysql database from fabric dynamically
                            
                                What's the easiest way to return a recordset from a PostgreSQL stored procedure?
                            
                                Even lighter than SQLite
                            
                                SQL - Getting Most Recent Date From Multiple Columns
                            
                                MYSQL UNION ORDERING
                            
                                MySQL: How to group data per hour and get the latest hour
                            
                                SQL Server: What's the limit on number of UNIONs?
                            
                                Select all if parameter is null in stored procedure
                            
                                #1062 - Duplicate entry '1' for key 1 - No duplicate entries found
                            
                                Manually forward a sequence - oracle sql

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With