Delete Duplicate Rows in SQL

Tags:

I have a table with unique id but duplicate row information.

I can find the rows with duplicates using this query

SELECT
    PersonAliasId, StartDateTime, GroupId, COUNT(*) as Count
FROM
    Attendance
GROUP BY
    PersonAliasId, StartDateTime, GroupId
HAVING
    COUNT(*) > 1

I can manually delete the rows while keeping the 1 I need with this query

Click to copy

Delete
From Attendance
Where Id IN(SELECT
    Id
FROM
    Attendance
Where PersonAliasId = 15
    and StartDateTime = '9/24/2017'
and GroupId = 1429
Order By ModifiedDateTIme Desc
Offset 1 Rows)

I am not versed in SQL enough to figure out how to use the rows in the first query to delete the duplicates leaving behind the most recent. There are over 3481 records returned by the first query to do this one by one manually.

How can I find the duplicate rows like the first query and delete all but the most recent like the second?

260

asked Jan 23 '18 16:01

Kevin Rutledge

2 Answers

You can use a Common Table Expression to delete the duplicates:

Click to copy

WITH Cte AS(
    SELECT *,
        Rn = ROW_NUMBER() OVER(PARTITION BY PersonAliasId, StartDateTime, GroupId 
                                ORDER BY ModifiedDateTIme DESC)
    FROM Attendance
)
DELETE FROM Cte WHERE Rn > 1;

This will keep the most recent record for each PersonAliasId - StartDateTime - GroupId combination.

answered Sep 22 '22 14:09

Felix Pamittan

Use the MAX aggregate function to identify the latest startdatetime for each group/person combination. Then delete records which do not have that latest time.

Click to copy

DELETE a
FROM attendance as a
INNER JOIN (  
   SELECT
        PersonAliasId, MAX(StartDateTime) AS LatestTime, GroupId,
    FROM
        Attendance
    GROUP BY
        PersonAliasId, GroupId
    HAVING
        COUNT(*) > 1
) as b
on a.personaliasid=b.personaliasid and a.groupid=b.groupid and a.startdatetime < b.latesttime

answered Sep 20 '22 14:09

Greg Viers

Related questions
                            
                                Connect to SQL Server Developer edition
                            
                                Find values that do not exist in a table
                            
                                MySQL - How to select rows with max value of a field
                            
                                MYSQL Left join extremely slow on indexed columns
                            
                                Can the order of criteria in a WHERE clause affect performance in MySQL?
                            
                                DENSE_RANK() without duplication
                            
                                Multiple clauses in SQL Server where all columns do not equal zero
                            
                                LISTAGG alternative in Oracle 10g
                            
                                How to update Sql table from excel directly?
                            
                                How to deal with Spark UDF input/output of primitive nullable type
                            
                                change mariadb column type from varchar to blob
                            
                                Spring Boot JOOQ sql dialect not picked up from application.properties
                            
                                FOR JSON path returns less number of Rows on AZURE SQL
                            
                                Add a unique constraint but ignore existing table data
                            
                                How to change sequence using SQL Query
                            
                                Pandas Merge two DataFrames without some columns
                            
                                SQL Server - Trying to de-normalize my table
                            
                                Elasticsearch equal SQL %Like%
                            
                                How to split an SQL Table into half and send the other half of the rows to new columns with SQL Query?
                            
                                Order by x then order by y column in SQL Server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Delete Duplicate Rows in SQL

Tags:

sql

tsql

sql-server-2016

Kevin Rutledge

People also ask

2 Answers

Felix Pamittan

Greg Viers

Recent Activity

Donate For Us