Is it possible to build a single mysql query (without variables) to remove all records from the table, except latest N (sorted by id desc)? Something like this, only it doesn't work :) <pre class="prettyprint"><code>delete from table order by id ASC limit ((select count(*) from table ) - N) </code></pre> Thanks.

You cannot delete the records that way, the main issue being that you cannot use a subquery to specify the value of a LIMIT clause. This works (tested in MySQL 5.0.67): <pre class="prettyprint"><code>DELETE FROM `table` WHERE id NOT IN ( SELECT id FROM ( SELECT id FROM `table` ORDER BY id DESC LIMIT 42 -- keep this many records ) foo ); </code></pre> The intermediate subquery is required. Without it we'd run into two errors: <ol> <li> SQL Error (1093): You can't specify target table 'table' for update in FROM clause - MySQL doesn't allow you to refer to the table you are deleting from within a direct subquery.</li> <li> SQL Error (1235): This version of MySQL doesn't yet support 'LIMIT & IN/ALL/ANY/SOME subquery' - You can't use the LIMIT clause within a direct subquery of a NOT IN operator.</li> </ol> Fortunately, using an intermediate subquery allows us to bypass both of these limitations. <hr> Nicole has pointed out this query can be optimised significantly for certain use cases (such as this one). I recommend reading that answer as well to see if it fits yours.

I know I'm resurrecting quite an old question, but I recently ran into this issue, but needed something that scales to large numbers well. There wasn't any existing performance data, and since this question has had quite a bit of attention, I thought I'd post what I found. The solutions that actually worked were the Alex Barrett's double sub-query/<code>NOT IN</code> method (similar to Bill Karwin's), and Quassnoi's <code>LEFT JOIN</code> method. Unfortunately both of the above methods create very large intermediate temporary tables and performance degrades quickly as the number of records not being deleted gets large. What I settled on utilizes Alex Barrett's double sub-query (thanks!) but uses <code><=</code> instead of <code>NOT IN</code>: <pre class="prettyprint"><code>DELETE FROM `test_sandbox` WHERE id <= ( SELECT id FROM ( SELECT id FROM `test_sandbox` ORDER BY id DESC LIMIT 1 OFFSET 42 -- keep this many records ) foo ); </code></pre> It uses <code>OFFSET</code> to get the id of the Nth record and deletes that record and all previous records. Since ordering is already an assumption of this problem (<code>ORDER BY id DESC</code>), <code><=</code> is a perfect fit. It is much faster, since the temporary table generated by the subquery contains just one record instead of N records. <h3>Test case</h3> I tested the three working methods and the new method above in two test cases. Both test cases use 10000 existing rows, while the first test keeps 9000 (deletes the oldest 1000) and the second test keeps 50 (deletes the oldest 9950). <pre class="prettyprint"><code>+-----------+------------------------+----------------------+ | | 10000 TOTAL, KEEP 9000 | 10000 TOTAL, KEEP 50 | +-----------+------------------------+----------------------+ | NOT IN | 3.2542 seconds | 0.1629 seconds | | NOT IN v2 | 4.5863 seconds | 0.1650 seconds | | <=,OFFSET | 0.0204 seconds | 0.1076 seconds | +-----------+------------------------+----------------------+ </code></pre> What's interesting is that the <code><=</code> method sees better performance across the board, but actually gets better the more you keep, instead of worse.

SQL query: Delete all records from the table except latest N?

Tags:

sql

mysql

sql-delete

Is it possible to build a single mysql query (without variables) to remove all records from the table, except latest N (sorted by id desc)?

Something like this, only it doesn't work :)

delete from table order by id ASC limit ((select count(*) from table ) - N)

Thanks.

692

asked Feb 23 '09 18:02

serg

2 Answers

You cannot delete the records that way, the main issue being that you cannot use a subquery to specify the value of a LIMIT clause.

This works (tested in MySQL 5.0.67):

DELETE FROM `table` WHERE id NOT IN (   SELECT id   FROM (     SELECT id     FROM `table`     ORDER BY id DESC     LIMIT 42 -- keep this many records   ) foo );

The intermediate subquery is required. Without it we'd run into two errors:

SQL Error (1093): You can't specify target table 'table' for update in FROM clause - MySQL doesn't allow you to refer to the table you are deleting from within a direct subquery.
SQL Error (1235): This version of MySQL doesn't yet support 'LIMIT & IN/ALL/ANY/SOME subquery' - You can't use the LIMIT clause within a direct subquery of a NOT IN operator.

Fortunately, using an intermediate subquery allows us to bypass both of these limitations.

Nicole has pointed out this query can be optimised significantly for certain use cases (such as this one). I recommend reading that answer as well to see if it fits yours.

answered Sep 20 '22 09:09

Alex Barrett

I know I'm resurrecting quite an old question, but I recently ran into this issue, but needed something that scales to large numbers well. There wasn't any existing performance data, and since this question has had quite a bit of attention, I thought I'd post what I found.

The solutions that actually worked were the Alex Barrett's double sub-query/NOT IN method (similar to Bill Karwin's), and Quassnoi's LEFT JOIN method.

Unfortunately both of the above methods create very large intermediate temporary tables and performance degrades quickly as the number of records not being deleted gets large.

What I settled on utilizes Alex Barrett's double sub-query (thanks!) but uses <= instead of NOT IN:

DELETE FROM `test_sandbox`   WHERE id <= (     SELECT id     FROM (       SELECT id       FROM `test_sandbox`       ORDER BY id DESC       LIMIT 1 OFFSET 42 -- keep this many records     ) foo   );

It uses OFFSET to get the id of the Nth record and deletes that record and all previous records.

Since ordering is already an assumption of this problem (ORDER BY id DESC), <= is a perfect fit.

It is much faster, since the temporary table generated by the subquery contains just one record instead of N records.

Test case

I tested the three working methods and the new method above in two test cases.

Both test cases use 10000 existing rows, while the first test keeps 9000 (deletes the oldest 1000) and the second test keeps 50 (deletes the oldest 9950).

+-----------+------------------------+----------------------+ |           | 10000 TOTAL, KEEP 9000 | 10000 TOTAL, KEEP 50 | +-----------+------------------------+----------------------+ | NOT IN    |         3.2542 seconds |       0.1629 seconds | | NOT IN v2 |         4.5863 seconds |       0.1650 seconds | | <=,OFFSET |         0.0204 seconds |       0.1076 seconds | +-----------+------------------------+----------------------+

What's interesting is that the <= method sees better performance across the board, but actually gets better the more you keep, instead of worse.

answered Sep 22 '22 09:09

Nicole

Related questions
                            
                                Invalid default value for 'dateAdded'
                            
                                How to part DATE and TIME from DATETIME in MySQL
                            
                                MySQL Error 1264: out of range value for column
                            
                                How do you force mysql LIKE to be case sensitive? [duplicate]
                            
                                Delete from two tables in one query
                            
                                SELECT INTO Variable in MySQL DECLARE causes syntax error?
                            
                                Project Links do not work on Wamp Server
                            
                                Error when trying to install app with mysql2 gem
                            
                                Does adding 'LIMIT 1' to MySQL queries make them faster when you know there will only be 1 result?
                            
                                When to use STRAIGHT_JOIN with MySQL
                            
                                How to connect to MySQL Database?
                            
                                SQL split values to multiple rows
                            
                                How to cast DATETIME as a DATE in mysql?
                            
                                Amazon EC2, mysql aborting start because InnoDB: mmap (x bytes) failed; errno 12
                            
                                Amazon RDS: Restore snapshot to existing instance
                            
                                MySQL - how many rows can I insert in one single INSERT statement?
                            
                                How to change the column position of MySQL table without losing column data?
                            
                                Does MySQL included with MAMP not include a config file?
                            
                                How to figure out size of Indexes in MySQL
                            
                                Create a temporary table in MySQL with an index from a select

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With