I have the below table and now I need to delete the rows which are having duplicate "refIDs" but have atleast one row with that ref, i.e i need to remove row 4 and 5. please help me on this <pre class="prettyprint"><code>+----+-------+--------+--+ | ID | refID | data | | +----+-------+--------+--+ | 1 | 1023 | aaaaaa | | | 2 | 1024 | bbbbbb | | | 3 | 1025 | cccccc | | | 4 | 1023 | ffffff | | | 5 | 1023 | gggggg | | | 6 | 1022 | rrrrrr | | +----+-------+--------+--+ </code></pre>

In MySQL, you can do this with a <code>join</code> in <code>delete</code>: <pre class="prettyprint"><code>delete t from table t left join (select min(id) as id from table t group by refId ) tokeep on t.id = tokeep.id where tokeep.id is null; </code></pre> For each <code>RefId</code>, the subquery calculates the minimum of the <code>id</code> column (presumed to be unique over the whole table). It uses a <code>left join</code> for the match, so anything that doesn't match has a <code>NULL</code> value for <code>tokeep.id</code>. These are the ones that are deleted.

remove duplicate rows based on one column value

Tags:

sql

mysql

I have the below table and now I need to delete the rows which are having duplicate "refIDs" but have atleast one row with that ref, i.e i need to remove row 4 and 5. please help me on this

+----+-------+--------+--+
| ID | refID |  data  |  |
+----+-------+--------+--+
|  1 |  1023 | aaaaaa |  |
|  2 |  1024 | bbbbbb |  |
|  3 |  1025 | cccccc |  |
|  4 |  1023 | ffffff |  |
|  5 |  1023 | gggggg |  |
|  6 |  1022 | rrrrrr |  |
+----+-------+--------+--+

862

asked Feb 10 '15 12:02

Jeeppp

4 Answers

This is similar to Gordon Linoff's query, but without the subquery:

DELETE t1 FROM table t1
  JOIN table t2
  ON t2.refID = t1.refID
  AND t2.ID < t1.ID

This uses an inner join to only delete rows where there is another row with the same refID but lower ID.

The benefit of avoiding a subquery is being able to utilize an index for the search. This query should perform well with a multi-column index on refID + ID.

answered Oct 24 '22 05:10

Marcus Adams

I would do:

delete from t where 
ID not in (select min(ID) from table t group by refID having count(*) > 1)
and refID in (select refID from table t group by refID  having count(*) > 1)

criteria is refId is among the duplicates and ID is different from the min(id) from the duplicates. It would work better if refId is indexed

otherwise and provided you can issue multiple times the following query until it does not delete anything

delete from t 
where 
ID in (select max(ID) from table t group by refID  having count(*) > 1)

answered Oct 24 '22 06:10

NJ73

Some another variant, in some cases a bit faster than Marcus and NJ73 answers:

DELETE ourTable 
FROM ourTable JOIN 
 (SELECT ID,targetField 
  FROM ourTable 
  GROUP BY targetField HAVING COUNT(*) > 1) t2 
ON ourTable.targetField = t2.targetField AND ourTable.ID != t2.ID;

Hope that will help someone. On big tables Marcus answer stalls.

answered Oct 24 '22 04:10

user2501323

In MySQL, you can do this with a join in delete:

delete t
    from table t left join
         (select min(id) as id
          from table t
          group by refId
         ) tokeep
         on t.id = tokeep.id
    where tokeep.id is null;

For each RefId, the subquery calculates the minimum of the id column (presumed to be unique over the whole table). It uses a left join for the match, so anything that doesn't match has a NULL value for tokeep.id. These are the ones that are deleted.

answered Oct 24 '22 04:10

Gordon Linoff

Related questions
                            
                                What is the difference between an Index and a Foreign Key?
                            
                                mysql query to update field to max(field) + 1
                            
                                "Order by desc" in reverse order?
                            
                                What is a good database design (schema) for a attendance database? [closed]
                            
                                Selecting distinct dates from datetime column in a table
                            
                                PDO bindParam into one statement?
                            
                                Writing the data frame to MySql DB table
                            
                                User can't access a database
                            
                                Php/Mysql date saved as '0000-00-00'
                            
                                MySQL - Where Or?
                            
                                How willl I set MySQL enum datatype default value as 'No'?
                            
                                Why is "LIMIT 0" even allowed in MySQL SELECT statements?
                            
                                MySQL convert YEARWEEK to date
                            
                                WAMP server switch MySQL to MariaDB
                            
                                What's the best way to search a MySQL database with PHP?
                            
                                MySQL: How to determine foreign key relationships programmatically?
                            
                                wamp cannot load mysqli extension
                            
                                Trying to add 1 to current field value with MySQL, but can't figure out what's wrong with my syntax
                            
                                Mysql SUM with case statement
                            
                                Mysql left/inner join combination not working as expected

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With