Deleting duplicate rows from sqlite database

Tags:

I have a huge table - 36 million rows - in SQLite3. In this very large table, there are two columns:

hash - text
d - real

Some of the rows are duplicates. That is, both hash and d have the same values. If two hashes are identical, then so are the values of d. However, two identical d's does not imply two identical hash'es.

I want to delete the duplicate rows. I don't have a primary key column.

What's the fastest way to do this?

694

asked Nov 18 '11 23:11

Patches

1 Answers

You need a way to distinguish the rows. Based on your comment, you could use the special rowid column for that.

To delete duplicates by keeping the lowest rowid per (hash,d):

Click to copy

delete   from YourTable where    rowid not in          (          select  min(rowid)          from    YourTable          group by                  hash          ,       d          )

130

answered Sep 17 '22 21:09

Andomar

Related questions
                            
                                mysql delete under safe mode
                            
                                Android Room @Delete with parameters
                            
                                Dynamic SQL - EXEC(@SQL) versus EXEC SP_EXECUTESQL(@SQL)
                            
                                How to call a stored procedure from Java and JPA
                            
                                INSERT vs INSERT INTO
                            
                                How to Alter Constraint
                            
                                How to get first/top row of the table in Sqlite via Sql Query
                            
                                Are there disadvantages to using a generic varchar(255) for all text-based fields?
                            
                                What is the difference between "LINQ to Entities", "LINQ to SQL" and "LINQ to Dataset"
                            
                                conditional unique constraint
                            
                                Cannot execute script: Insufficient memory to continue the execution of the program
                            
                                How to get current instance name from T-SQL
                            
                                How do I copy data from one table to another in postgres using copy command
                            
                                How to delete multiple rows in SQL where id = (x to y)
                            
                                WHERE Clause to find all records in a specific month
                            
                                How to check if field is null or empty in MySQL?
                            
                                Delete duplicate records in SQL Server?
                            
                                How to exclude rows that don't join with another table?
                            
                                Checking for an empty field with MySQL
                            
                                How to determine the number of days in a month in SQL Server?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Deleting duplicate rows from sqlite database

Tags:

sql

database

sqlite

Patches

People also ask

1 Answers

Andomar

Recent Activity

Donate For Us