I am trying to write a query in SQL server to find out if there are any multiple rows for each hash value. I need all filenames where the hash value has duplicates. The result should be (based on my example below) <pre class="prettyprint"><code>003B4C68BC143B0290E04432A3A96092 File0003.jpg 003B4C68BC143B0290E04432A3A96092 File0004.jpg 003B4C68BC143B0290E04432A3A96092 File0005.jpg </code></pre> Please let me know. Here is the table structure <pre class="prettyprint"><code>File table ----------------------------------------- hash FileName --------------------------------------- 000341A486F5492877D588BED0806650 File0001.jpg 00363EF2ECEEA32F10176EB64A50283F File0002.jpg 003B4C68BC143B0290E04432A3A96092 File0003.jpg 003B4C68BC143B0290E04432A3A96092 File0004.jpg 003B4C68BC143B0290E04432A3A96092 File0005.jpg </code></pre>

You can use <code>EXISTS</code> to check for duplicates, <pre class="prettyprint"><code>SELECT a.* FROM TableName a WHERE EXISTS ( SELECT 1 FROM Tablename b WHERE a.hash = b.hash GROUP BY hash HAVING COUNT(*) > 1 ) </code></pre> <ul> <li>SQLFiddle Demo</li> </ul> or <code>INNER JOIN</code> <pre class="prettyprint"><code>SELECT a.* FROM [File] a INNER JOIN ( SELECT hash FROM [File] b GROUP BY hash HAVING COUNT(*) > 1 ) b ON a.hash = b.hash </code></pre> <ul> <li>SQLFiddle Demo</li> </ul>

SQL query to find duplicates

Tags:

sql

duplicates

I am trying to write a query in SQL server to find out if there are any multiple rows for each hash value.
I need all filenames where the hash value has duplicates.

The result should be (based on my example below)

003B4C68BC143B0290E04432A3A96092    File0003.jpg
003B4C68BC143B0290E04432A3A96092    File0004.jpg
003B4C68BC143B0290E04432A3A96092    File0005.jpg

Please let me know.

Here is the table structure

File table
-----------------------------------------
hash          FileName
---------------------------------------
000341A486F5492877D588BED0806650    File0001.jpg
00363EF2ECEEA32F10176EB64A50283F    File0002.jpg
003B4C68BC143B0290E04432A3A96092    File0003.jpg
003B4C68BC143B0290E04432A3A96092    File0004.jpg
003B4C68BC143B0290E04432A3A96092    File0005.jpg

283

asked May 22 '13 09:05

John Doe

2 Answers

select * 
from File 
where hash in (select 
               hash 
               from File
               group by hash
               having count(*) > 1)

answered Sep 23 '22 15:09

Raphaël Althaus

You can use EXISTS to check for duplicates,

SELECT  a.*
FROM    TableName a
WHERE   EXISTS
        (
            SELECT  1
            FROM    Tablename b
            WHERE   a.hash = b.hash
            GROUP   BY hash
            HAVING  COUNT(*) > 1
        )

SQLFiddle Demo

or INNER JOIN

SELECT  a.*
FROM    [File] a
        INNER JOIN
        (
            SELECT  hash
            FROM    [File] b
            GROUP   BY hash
            HAVING  COUNT(*) > 1
        ) b ON  a.hash = b.hash

SQLFiddle Demo

answered Sep 26 '22 15:09

John Woo

Related questions
                            
                                Oracle 11g - Check constraint with RegEx
                            
                                Cannot lookup row in database by UUID RAW(32)
                            
                                How create a SQL array from a Java List?
                            
                                Update statement- Geography column - sql server
                            
                                Store a single sql server row in variable and then use the column values to construct a query
                            
                                How to make a right join using LINQ to SQL & C#?
                            
                                MySQLi prepared update statement in PHP
                            
                                Date range in WHERE clause from 90 days ago to today's date
                            
                                Shuffle a string with mysql/sql
                            
                                Fast way to select rows within table in R?
                            
                                MySQL Left Join (Unknown Column)
                            
                                Mybatis one-to-many collection mapping always have one default entity
                            
                                SQL syntax for LEFT OUTER JOIN in SQL Server 2012
                            
                                Passing multivalue parameter to a subreport
                            
                                How to fill timestamp gaps in a Postgres query?
                            
                                SQL Server Management Studio network-related or instance-specific error
                            
                                SQL Pivot on dates column?
                            
                                how to check the query is using index
                            
                                delete duplicate rows and need to keep one from all of them in mysql [duplicate]
                            
                                Alter clustered index column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With