I am validating a table which has a transaction level data of an eCommerce site and find the exact errors. I want your help to find duplicate records in a 50 column table on SQL Server. Suppose my data is: <pre class="prettyprint"><code>OrderNo shoppername amountpayed city Item 1 Sam 10 A Iphone 1 Sam 10 A Iphone--->>Duplication to be detected 1 Sam 5 A Ipod 2 John 20 B Macbook 3 John 25 B Macbookair 4 Jack 5 A Ipod </code></pre> Suppose I use the below query: <pre class="prettyprint"><code>Select shoppername,count(*) as cnt from dbo.sales having count(*) > 1 group by shoppername </code></pre> will return me <pre class="prettyprint"><code>Sam 2 John 2 </code></pre> But I don't want to find duplicate just over 1 or 2 columns. I want to find the duplicate over all the columns together in my data. I want the result as: <pre class="prettyprint"><code>1 Sam 10 A Iphone </code></pre>

<pre class="prettyprint"><code>with x as (select *,rn = row_number() over(PARTITION BY OrderNo,item order by OrderNo) from #temp1) select * from x where rn > 1 </code></pre> you can remove duplicates by replacing select statement by <pre class="prettyprint"><code>delete x where rn > 1 </code></pre>

Find duplicate records in a table using SQL Server

Tags:

sql

sql-server

sql-server-2005

I am validating a table which has a transaction level data of an eCommerce site and find the exact errors.

I want your help to find duplicate records in a 50 column table on SQL Server.

Suppose my data is:

OrderNo shoppername amountpayed city Item        1       Sam         10          A    Iphone 1       Sam         10          A    Iphone--->>Duplication to be detected 1       Sam         5           A    Ipod 2       John        20          B    Macbook 3       John        25          B    Macbookair 4       Jack        5           A    Ipod

Suppose I use the below query:

Select shoppername,count(*) as cnt from dbo.sales having count(*) > 1 group by shoppername

will return me

Sam  2 John 2

But I don't want to find duplicate just over 1 or 2 columns. I want to find the duplicate over all the columns together in my data. I want the result as:

1       Sam         10          A    Iphone

493

asked Mar 24 '12 07:03

Sahil

2 Answers

with x as   (select  *,rn = row_number()             over(PARTITION BY OrderNo,item  order by OrderNo)             from    #temp1)  select * from x where rn > 1

you can remove duplicates by replacing select statement by

delete x where rn > 1

answered Sep 16 '22 22:09

Sathya Narayanan

SELECT OrderNo, shoppername, amountPayed, city, item, count(*) as cnt FROM dbo.sales GROUP BY OrderNo, shoppername, amountPayed, city, item HAVING COUNT(*) > 1

answered Sep 19 '22 22:09

Eugene

Related questions
                            
                                MySQL Group By and Sum total value of other column
                            
                                SQL: How To Select Earliest Row
                            
                                Get AVG ignoring Null or Zero values
                            
                                How to delete a range of records at once on MySQL?
                            
                                Get the list of stored procedures created and / or modified on a particular date?
                            
                                Hadoop/Hive : Loading data from .csv on a local machine
                            
                                Select Rows with id having even number
                            
                                Index for multiple columns in ActiveRecord
                            
                                Oracle - Why does the leading zero of a number disappear when converting it TO_CHAR
                            
                                Indexes and multi column primary keys
                            
                                Retrieve column names and types of a stored procedure? [duplicate]
                            
                                MySQL select records for duplicates using multiple columns
                            
                                How to add a Try/Catch to SQL Stored Procedure
                            
                                concatenate two database columns into one resultset column
                            
                                How to compare strings in sql ignoring case?
                            
                                SQL - Difference between COALESCE and ISNULL? [duplicate]
                            
                                Sorting by date & time in descending order?
                            
                                Update if different/changed
                            
                                How to use SQL LIKE condition with multiple values in PostgreSQL?
                            
                                SQL Server giving logins(users) db_owner access to database

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With