I've seen a number of variations on this but nothing quite matches what I'm trying to accomplish. I have a table, <code>TableA</code>, which contain the answers given by users to configurable questionnaires. The columns are <code>member_id, quiz_num, question_num, answer_num</code>. Somehow a few members got their answers submitted twice. So I need to remove the duplicated records, but make sure that one row is left behind. There is no primary column so there could be two or three rows all with the exact same data. Is there a query to remove all the duplicates?

Instead of <code>drop table TableA</code>, you could delete all registers (<code>delete from TableA;</code>) and then populate original table with registers coming from TableA_Verify (<code>insert into TAbleA select * from TAbleA_Verify</code>). In this way you won't lost all references to original table (indexes,... ) <pre class="prettyprint"><code>CREATE TABLE TableA_Verify AS SELECT DISTINCT * FROM TableA; DELETE FROM TableA; INSERT INTO TableA SELECT * FROM TAbleA_Verify; DROP TABLE TableA_Verify; </code></pre>

This doesn't use TEMP Tables, but real tables instead. If the problem is just about temp tables and not about table creation or dropping tables, this will work: <pre class="prettyprint"><code>SELECT DISTINCT * INTO TableA_Verify FROM TableA; DROP TABLE TableA; RENAME TABLE TableA_Verify TO TableA; </code></pre>

Thanks to jveirasv for the answer above. If you need to remove duplicates of a specific sets of column, you can use this (if you have a timestamp in the table that vary for example) <pre class="prettyprint"><code>CREATE TABLE TableA_Verify AS SELECT * FROM TableA WHERE 1 GROUP BY [COLUMN TO remove duplicates BY]; DELETE FROM TableA; INSERT INTO TableA SELECT * FROM TAbleA_Verify; DROP TABLE TableA_Verify; </code></pre>

An alternative way would be to create a new temporary table with same structure. <pre class="prettyprint"><code>CREATE TABLE temp_table AS SELECT * FROM original_table LIMIT 0 </code></pre> Then create the primary key in the table. <pre class="prettyprint"><code>ALTER TABLE temp_table ADD PRIMARY KEY (primary-key-field) </code></pre> Finally copy all records from the original table while ignoring the duplicate records. <pre class="prettyprint"><code>INSERT IGNORE INTO temp_table AS SELECT * FROM original_table </code></pre> Now you can delete the original table and rename the new table. <pre class="prettyprint"><code>DROP TABLE original_table RENAME TABLE temp_table TO original_table </code></pre>

Tested in mysql 5.Dont know about other versions. If you want to keep the row with the lowest id value: <pre class="prettyprint"><code>DELETE n1 FROM 'yourTableName' n1, 'yourTableName' n2 WHERE n1.id > n2.id AND n1.member_id = n2.member_id and n1.answer_num =n2.answer_num </code></pre> If you want to keep the row with the highest id value: <pre class="prettyprint"><code>DELETE n1 FROM 'yourTableName' n1, 'yourTableName' n2 WHERE n1.id < n2.id AND n1.member_id = n2.member_id and n1.answer_num =n2.answer_num </code></pre>

How do I delete all the duplicate records in a MySQL table without temp tables

Video Answer

9 Answers

Add Unique Index on your table:

ALTER IGNORE TABLE `TableA`   
ADD UNIQUE INDEX (`member_id`, `quiz_num`, `question_num`, `answer_num`);

Another way to do this would be:

Add primary key in your table then you can easily remove duplicates from your table using the following query:

DELETE FROM member  
WHERE id IN (SELECT * 
             FROM (SELECT id FROM member 
                   GROUP BY member_id, quiz_num, question_num, answer_num HAVING (COUNT(*) > 1)
                  ) AS A
            );

answered Oct 02 '22 01:10

Saharsh Shah

Instead of drop table TableA, you could delete all registers (delete from TableA;) and then populate original table with registers coming from TableA_Verify (insert into TAbleA select * from TAbleA_Verify). In this way you won't lost all references to original table (indexes,... )

CREATE TABLE TableA_Verify AS SELECT DISTINCT * FROM TableA;

DELETE FROM TableA;

INSERT INTO TableA SELECT * FROM TAbleA_Verify;

DROP TABLE TableA_Verify;

answered Oct 02 '22 00:10

jveirasv

This doesn't use TEMP Tables, but real tables instead. If the problem is just about temp tables and not about table creation or dropping tables, this will work:

SELECT DISTINCT * INTO TableA_Verify FROM TableA;

DROP TABLE TableA;

RENAME TABLE TableA_Verify TO TableA;

answered Oct 02 '22 02:10

christoph

Thanks to jveirasv for the answer above.

If you need to remove duplicates of a specific sets of column, you can use this (if you have a timestamp in the table that vary for example)

CREATE TABLE TableA_Verify AS SELECT * FROM TableA WHERE 1 GROUP BY [COLUMN TO remove duplicates BY];

DELETE FROM TableA;

INSERT INTO TableA SELECT * FROM TAbleA_Verify;

DROP TABLE TableA_Verify;

answered Oct 02 '22 01:10

nikolais

Add Unique Index on your table:

ALTER IGNORE TABLE TableA   
ADD UNIQUE INDEX (member_id, quiz_num, question_num, answer_num);

is work very well

answered Oct 02 '22 02:10

Dina Elwy

If you are not using any primary key, then execute following queries at one single stroke. By replacing values:

# table_name - Your Table Name
# column_name_of_duplicates - Name of column where duplicate entries are found

create table table_name_temp like table_name;
insert into table_name_temp select distinct(column_name_of_duplicates),value,type from table_name group by column_name_of_duplicates;
delete from table_name;
insert into table_name select * from table_name_temp;
drop table table_name_temp

create temporary table and store distinct(non duplicate) values
make empty original table
insert values to original table from temp table
delete temp table

It is always advisable to take backup of database before you play with it.

answered Oct 02 '22 02:10

Sandesh Mhatre

As noted in the comments, the query in Saharsh Shah's answer must be run multiple times if items are duplicated more than once.

Here's a solution that doesn't delete any data, and keeps the data in the original table the entire time, allowing for duplicates to be deleted while keeping the table 'live':

alter table tableA add column duplicate tinyint(1) not null default '0';

update tableA set
duplicate=if(@member_id=member_id
             and @quiz_num=quiz_num
             and @question_num=question_num
             and @answer_num=answer_num,1,0),
member_id=(@member_id:=member_id),
quiz_num=(@quiz_num:=quiz_num),
question_num=(@question_num:=question_num),
answer_num=(@answer_num:=answer_num)
order by member_id, quiz_num, question_num, answer_num;

delete from tableA where duplicate=1;

alter table tableA drop column duplicate;

This basically checks to see if the current row is the same as the last row, and if it is, marks it as duplicate (the order statement ensures that duplicates will show up next to each other). Then you delete the duplicate records. I remove the duplicate column at the end to bring it back to its original state.

It looks like alter table ignore also might go away soon: http://dev.mysql.com/worklog/task/?id=7395

answered Oct 02 '22 01:10

juacala

An alternative way would be to create a new temporary table with same structure.

CREATE TABLE temp_table AS SELECT * FROM original_table LIMIT 0

Then create the primary key in the table.

ALTER TABLE temp_table ADD PRIMARY KEY (primary-key-field)

Finally copy all records from the original table while ignoring the duplicate records.

INSERT IGNORE INTO temp_table AS SELECT * FROM original_table

Now you can delete the original table and rename the new table.

DROP TABLE original_table
RENAME TABLE temp_table TO original_table

answered Oct 02 '22 01:10

user1838915

Tested in mysql 5.Dont know about other versions. If you want to keep the row with the lowest id value:

DELETE n1 FROM 'yourTableName' n1, 'yourTableName' n2 WHERE n1.id > n2.id AND n1.member_id = n2.member_id and n1.answer_num =n2.answer_num

If you want to keep the row with the highest id value:

DELETE n1 FROM 'yourTableName' n1, 'yourTableName' n2 WHERE n1.id < n2.id AND n1.member_id = n2.member_id and n1.answer_num =n2.answer_num

answered Oct 02 '22 02:10

TanvirChowdhury

Related questions
                            
                                PHP Check for NULL
                            
                                What's better - many small tables or one big table?
                            
                                Database modeling for international and multilingual purposes
                            
                                mysql select sum group by date
                            
                                Getting MySQL working on OSX 10.7 Lion
                            
                                How to get the difference in years from two different dates?
                            
                                selecting rows with id from another table
                            
                                mysqli_fetch_array() expects parameter 1 to be mysqli_result, boolean given in [duplicate]
                            
                                What is Drupal's default password encryption method?
                            
                                SQLSTATE[HY000] [2002] php_network_getaddresses: getaddrinfo failed: Name or service not known
                            
                                How to find nearest location using latitude and longitude from SQL database?
                            
                                Update values incrementally in mysql
                            
                                MySQL count columns on specific value
                            
                                How to bulk update mysql data with one query?
                            
                                Is there any way to show progress on a `gunzip < database.sql.gz | mysql ...` process?
                            
                                How do I count columns of a table
                            
                                Only show hours in MYSQL DATEDIFF
                            
                                Using Docker I get the error: "SQLSTATE[HY000] [2002] No such file or directory"
                            
                                MySQL UPDATE with random number between 1-3
                            
                                MySQL SELECT increment counter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I delete all the duplicate records in a MySQL table without temp tables

Tags:

sql

mysql

duplicates

sql-delete

unique-index

MivaScott

People also ask

Video Answer

9 Answers

Saharsh Shah

jveirasv

christoph

nikolais

Dina Elwy

Sandesh Mhatre

juacala

user1838915

TanvirChowdhury

Recent Activity

Donate For Us