MS Access has a button to generate sql code for finding duplicated rows. I don't know if SQL Server 2005/2008 Managment Studio has this. <ol> <li>If it has, please point where</li> <li>If it has not, please tell me how can I have a T-SQL helper for creating code like this.</li> </ol>

If you're using SQL Server 2005+, you can use the following code to see all the rows along with other columns: <pre class="prettyprint"><code>SELECT *, ROW_NUMBER() OVER (PARTITION BY col1, col2, col3, col4 ORDER BY (SELECT 0)) AS DuplicateRowNumber FROM table </code></pre> Youd can also delete (or otherwise work with) duplicates using this technique: <pre class="prettyprint"><code>WITH cte AS (SELECT *, ROW_NUMBER() OVER (PARTITION BY col1, col2, col3, col4 ORDER BY (SELECT 0)) AS DuplicateRowNumber FROM table ) DELETE FROM cte WHERE DuplicateRowNumber > 1 </code></pre> ROW_NUMBER is extremely powerful - there is much you can do with it - see the BOL article on it at http://msdn.microsoft.com/en-us/library/ms186734.aspx

I found this solution when I need to dump entire rows with one or more duplicate fields but I don't want to type every field name in the table: <pre class="prettyprint"><code>SELECT * FROM db WHERE col IN (SELECT col FROM db GROUP BY col HAVING COUNT(*) > 1) ORDER BY col </code></pre>

How get the T-SQL code to find duplicates?

4 Answers

Well, if you have entire rows as duplicates in your table, you've at least not got a primary key set up for that table, otherwise at least the primary key value would be different.

However, here's how to build a SQL to get duplicates over a set of columns:

SELECT col1, col2, col3, col4 FROM table GROUP BY col1, col2, col3, col4 HAVING COUNT(*) > 1

This will find rows which, for columns col1-col4, has the same combination of values, more than once.

For instance, in the following table, rows 2+3 would be duplicates:

PK    col1    col2    col3    col4    col5 1       1       2       3       4      6 2       1       3       4       7      7 3       1       3       4       7      10 4       2       3       1       4      5

The two rows share common values in columns col1-col4, and thus, by that SQL, is considered duplicates. Expand the list of columns to contain all the columns you wish to analyze this for.

109

answered Oct 18 '22 12:10

Lasse V. Karlsen

If you're using SQL Server 2005+, you can use the following code to see all the rows along with other columns:

SELECT *, ROW_NUMBER() OVER (PARTITION BY col1, col2, col3, col4 ORDER BY (SELECT 0)) AS DuplicateRowNumber FROM table

Youd can also delete (or otherwise work with) duplicates using this technique:

WITH cte AS (SELECT *, ROW_NUMBER() OVER (PARTITION BY col1, col2, col3, col4 ORDER BY (SELECT 0)) AS DuplicateRowNumber     FROM table ) DELETE FROM cte WHERE DuplicateRowNumber > 1

ROW_NUMBER is extremely powerful - there is much you can do with it - see the BOL article on it at http://msdn.microsoft.com/en-us/library/ms186734.aspx

answered Oct 18 '22 14:10

Mike DeFehr

I found this solution when I need to dump entire rows with one or more duplicate fields but I don't want to type every field name in the table:

SELECT * FROM db WHERE col IN
    (SELECT col FROM db GROUP BY col HAVING COUNT(*) > 1)
    ORDER BY col

answered Oct 18 '22 12:10

Ferruccio

AFAIK, it doesn't. Just make a select statement grouping by all the fields of a table, and filtering using a having clause where the count is greater than 1.

If your rows are duplicated except by the key, then don't include the key in the select fields.

answered Oct 18 '22 12:10

eKek0

Related questions
                            
                                What's the fastest way to bulk insert a lot of data in SQL Server (C# client)
                            
                                Rename SQL Server Schema
                            
                                Add a row number to result set of a SQL query
                            
                                Delete duplicate records from a SQL table without a primary key
                            
                                Execute Stored Procedure from a Function
                            
                                Is it possible to force row level locking in SQL Server?
                            
                                Increment Row Number on Group
                            
                                How to get number of rows inserted by a transaction
                            
                                Convert SSMS .rpt output file to .txt/.csv
                            
                                Generate a resultset of incrementing dates in TSQL
                            
                                How can I create a SQL unique constraint based on 2 columns?
                            
                                Can there be constraints with the same name in a DB?
                            
                                SQL Server 2005 drop column with constraints
                            
                                How do I avoid character encoding when using "FOR XML PATH"?
                            
                                SQL Server: how to write an alter index statement to add a column to the unique index?
                            
                                Column name or number of supplied values does not match table definition
                            
                                How to get SQL Profiler to monitor trigger execution
                            
                                Cannot find either column "dbo" or the user-defined function or aggregate "dbo.Splitfn", or the name is ambiguous
                            
                                How to get difference between two rows for a column field?
                            
                                How to insert multiple records and get the identity value?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How get the T-SQL code to find duplicates?

Tags:

tsql

sql-server-2005

ssms

Jader Dias

People also ask

4 Answers

Lasse V. Karlsen

Mike DeFehr

Ferruccio

eKek0

Recent Activity

Donate For Us