I am diagnosing an intermittent slow query, and have found a strange behaviour in MySQL I cannot explain. It's choosing a different, non-optimal key strategy for one specific case, only when doing a <code>LIMIT 1</code>. Table (some unreferenced data columns removed for brevity) <pre class="prettyprint"><code>CREATE TABLE `ch_log` ( `cl_id` BIGINT(20) NOT NULL AUTO_INCREMENT, `cl_unit_id` INT(11) NOT NULL DEFAULT '0', `cl_date` DATETIME NOT NULL DEFAULT '0000-00-00 00:00:00', `cl_type` CHAR(1) NOT NULL DEFAULT '', `cl_data` TEXT NOT NULL, `cl_event` VARCHAR(255) NULL DEFAULT NULL, `cl_timestamp` TIMESTAMP NOT NULL DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP, `cl_record_status` CHAR(1) NOT NULL DEFAULT 'a', PRIMARY KEY (`cl_id`), INDEX `cl_type` (`cl_type`), INDEX `cl_date` (`cl_date`), INDEX `cl_event` (`cl_event`), INDEX `cl_unit_id` (`cl_unit_id`), INDEX `log_type_unit_id` (`cl_unit_id`, `cl_type`), INDEX `unique_user` (`cl_user_number`, `cl_unit_id`) ) ENGINE=InnoDB AUTO_INCREMENT=419582094; </code></pre> This is the query, which only runs slow for one specific <code>cl_unit_id</code>: <pre class="prettyprint"><code>EXPLAIN SELECT * FROM `ch_log` WHERE `ch_log_type` ='I' and ch_log_event = 'G' AND cl_unit_id=1234 ORDER BY cl_date DESC LIMIT 1; </code></pre> <pre class="prettyprint lang-none prettyprint-override"><code>id|select_type|table |type |possible_keys |key |key_len|ref|rows|Extra 1 |SIMPLE |ch_log|index|cl_type,cl_event,cl_unit_id,log_type_unit_id|cl_date|8 |\N |5295|Using where </code></pre> For all other values of <code>cl_unit_id</code> it uses the <code>log_type_unit_id</code> key which is much faster. <pre class="prettyprint lang-none prettyprint-override"><code>id|select_type|table |type|possible_keys |key |key_len|ref |rows|Extra 1 |SIMPLE |ch_log|ref |ch_log_type,ch_log_event,ch_log_unit_id,log_type_unit_id|log_type_unit_id|5 |const,const|3804|Using where; Using filesort </code></pre> <ul> <li>All queries take about 0.01 seconds. </li> <li>The "slow unit" query takes 10-15 minutes!</li> </ul> I can't see anything strange about the data for this 'unit': <ul> <li>Unit 1234 only has 6 records of type I and event G.</li> <li>Other units have many more.</li> <li>Unit 1234 only has 32,000 logs in total which is typical.</li> <li>the data itself is normal, no bigger or older.</li> <li>There are around 3,000 "units" in the database, which represent devices logging stuff. The cl_unit_id is their unique PK (although no constraint).</li> </ul> General info <ul> <li>There are 30m records in total, around 12GB</li> <li>mysql 5.1.69-log</li> <li>Centos 64bit</li> <li>The data is gradually changing (30m = 3months of logs) but I don't know if this has happened before</li> </ul> Things I've tried, and can "solve" the problem with: <ol> <li>Removing the <code>LIMIT 1</code> - the query runs in milliseconds and returns the data.</li> <li>Changing to <code>LIMIT 2</code> or other combinations e.g. 2,3 - runs in milliseconds.</li> <li> Adding a index hint - solves it: <pre class="prettyprint"><code>FROM `ch_log` USE INDEX (log_type_unit_id) </code></pre> but... I don't want to hard-code this into the application. </li> <li> Adding a second order by on the primary key also "solves" it: <pre class="prettyprint"><code>ORDER BY cl_id, cl_date DESC </code></pre> giving explain: <pre class="prettyprint lang-none prettyprint-override"><code>id|select_type|table |type|possible_keys |key |key_len|ref |rows|Extra 1 |SIMPLE |ch_log|ref |ch_log_type,ch_log_event,ch_log_unit_id,log_type_unit_id|log_type_unit_id|5 |const,const|6870|Using where </code></pre> which is slightly different to the type hinted one, with more records examined (6,000) but still runs in 10's of milliseconds. </li> </ol> Again I could do this, but I don't like using side-effects I don't understand. So I think my main question are: a) why does it only happen for <code>LIMIT 1</code>? b) how can the data itself affect the key-strategy so much? And what aspect of the data, seeing as the quantity and spread in the indexes seems typical.

Mysql will pick an explain plan and use different indexes depending on what it thinks is statistically the best choice. For all your first questions, this is the answer: <ol> <li>Removing the <code>LIMIT 1</code> - the query runs in milliseconds and returns the data. and -> Yes, check it, the explain plan is good</li> <li>Changing to <code>LIMIT 2</code> or other combinations e.g. 2,3 - runs in milliseconds. -> the same applies. The optimizer chooses a different index because suddenly, the expected block reads became twice as big as with <code>LIMIT 1</code> (that's just one possibility)</li> <li>Adding a index hint solves it -> Of course, you force a good explain plan</li> <li>Adding a second order by on the primary key also "solves" it -> yes, because by coincidence, the result is a better explain plan</li> </ol> Now, that only answers half of the questions. <blockquote> a) why does it only happen for LIMIT 1? </blockquote> It actually happens not only because of <code>LIMIT 1</code>, but because of <ul> <li>Your data statistic repartition (orients the optimizer's decisions)</li> <li>Your <code>ORDER BY DESC</code> clause. Try with <code>ORDER BY ... ASC</code> and you will probably see an improvement too.</li> </ul> This phenomenon is perfectly aknowledged. Please read on. One of the accepted solutions (bottom down in the article) is to force the index the same way you did. Yes, sometimes, it is justified. Otherwise, this hint thing would have been totally wiped out long ago. Robots cannot be always perfect :-) <blockquote> b) how can the data itself affect the key-strategy so much? And what aspect of the data, seeing as the quantity and spread in the indexes seems typical. </blockquote> You said it, the spread is what usually fucks up. Not only the optimizer might just make a wrong decision with accurate statistics, but it could also be completely off just because the delta on the table is right below 1 / 16th of the total row count...

LIMIT 1 is very slow, for specific records, using different keys

Tags:

performance

sql

optimization

mysql

innodb

I am diagnosing an intermittent slow query, and have found a strange behaviour in MySQL I cannot explain. It's choosing a different, non-optimal key strategy for one specific case, only when doing a LIMIT 1.

Table (some unreferenced data columns removed for brevity)

CREATE TABLE `ch_log` (
    `cl_id` BIGINT(20) NOT NULL AUTO_INCREMENT,
    `cl_unit_id` INT(11) NOT NULL DEFAULT '0',
    `cl_date` DATETIME NOT NULL DEFAULT '0000-00-00 00:00:00',
    `cl_type` CHAR(1) NOT NULL DEFAULT '',
    `cl_data` TEXT NOT NULL,
    `cl_event` VARCHAR(255) NULL DEFAULT NULL,
    `cl_timestamp` TIMESTAMP NOT NULL DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP,
    `cl_record_status` CHAR(1) NOT NULL DEFAULT 'a',
    PRIMARY KEY (`cl_id`),
    INDEX `cl_type` (`cl_type`),
    INDEX `cl_date` (`cl_date`),
    INDEX `cl_event` (`cl_event`),
    INDEX `cl_unit_id` (`cl_unit_id`),
    INDEX `log_type_unit_id` (`cl_unit_id`, `cl_type`),
    INDEX `unique_user` (`cl_user_number`, `cl_unit_id`)
)
ENGINE=InnoDB
AUTO_INCREMENT=419582094;

This is the query, which only runs slow for one specific cl_unit_id:

EXPLAIN
SELECT *
FROM `ch_log`
WHERE `ch_log_type` ='I' and ch_log_event = 'G'  
AND cl_unit_id=1234
ORDER BY cl_date DESC 
LIMIT 1;

id|select_type|table |type |possible_keys                               |key    |key_len|ref|rows|Extra
1 |SIMPLE     |ch_log|index|cl_type,cl_event,cl_unit_id,log_type_unit_id|cl_date|8      |\N |5295|Using where

For all other values of cl_unit_id it uses the log_type_unit_id key which is much faster.

id|select_type|table |type|possible_keys                                           |key             |key_len|ref        |rows|Extra
1 |SIMPLE     |ch_log|ref |ch_log_type,ch_log_event,ch_log_unit_id,log_type_unit_id|log_type_unit_id|5      |const,const|3804|Using where; Using filesort

All queries take about 0.01 seconds.
The "slow unit" query takes 10-15 minutes!

I can't see anything strange about the data for this 'unit':

Unit 1234 only has 6 records of type I and event G.
Other units have many more.
Unit 1234 only has 32,000 logs in total which is typical.
the data itself is normal, no bigger or older.
There are around 3,000 "units" in the database, which represent devices logging stuff. The cl_unit_id is their unique PK (although no constraint).

General info

There are 30m records in total, around 12GB
mysql 5.1.69-log
Centos 64bit
The data is gradually changing (30m = 3months of logs) but I don't know if this has happened before

Things I've tried, and can "solve" the problem with:

Removing the LIMIT 1 - the query runs in milliseconds and returns the data.
Changing to LIMIT 2 or other combinations e.g. 2,3 - runs in milliseconds.
Adding a index hint - solves it:
```
FROM `ch_log` USE INDEX (log_type_unit_id)
```
but... I don't want to hard-code this into the application.

Adding a second order by on the primary key also "solves" it:

ORDER BY cl_id, cl_date DESC

giving explain:

id|select_type|table |type|possible_keys                                           |key             |key_len|ref        |rows|Extra
1 |SIMPLE     |ch_log|ref |ch_log_type,ch_log_event,ch_log_unit_id,log_type_unit_id|log_type_unit_id|5      |const,const|6870|Using where

which is slightly different to the type hinted one, with more records examined (6,000) but still runs in 10's of milliseconds.

Again I could do this, but I don't like using side-effects I don't understand.

So I think my main question are:

a) why does it only happen for LIMIT 1?

b) how can the data itself affect the key-strategy so much? And what aspect of the data, seeing as the quantity and spread in the indexes seems typical.

878

asked Jul 15 '15 03:07

scipilot

1 Answers

Mysql will pick an explain plan and use different indexes depending on what it thinks is statistically the best choice. For all your first questions, this is the answer:

Removing the LIMIT 1 - the query runs in milliseconds and returns the data. and -> Yes, check it, the explain plan is good
Changing to LIMIT 2 or other combinations e.g. 2,3 - runs in milliseconds. -> the same applies. The optimizer chooses a different index because suddenly, the expected block reads became twice as big as with LIMIT 1 (that's just one possibility)
Adding a index hint solves it -> Of course, you force a good explain plan
Adding a second order by on the primary key also "solves" it -> yes, because by coincidence, the result is a better explain plan

Now, that only answers half of the questions.

a) why does it only happen for LIMIT 1?

It actually happens not only because of LIMIT 1, but because of

Your data statistic repartition (orients the optimizer's decisions)
Your ORDER BY DESC clause. Try with ORDER BY ... ASC and you will probably see an improvement too.

This phenomenon is perfectly aknowledged. Please read on.

One of the accepted solutions (bottom down in the article) is to force the index the same way you did. Yes, sometimes, it is justified. Otherwise, this hint thing would have been totally wiped out long ago. Robots cannot be always perfect :-)

b) how can the data itself affect the key-strategy so much? And what aspect of the data, seeing as the quantity and spread in the indexes seems typical.

You said it, the spread is what usually fucks up. Not only the optimizer might just make a wrong decision with accurate statistics, but it could also be completely off just because the delta on the table is right below 1 / 16th of the total row count...

103

answered Oct 19 '22 21:10

Sebas

Related questions
                            
                                SELECT UNION as DISTINCT
                            
                                Can't load mysql.web assembly
                            
                                WordPress prepared statement with IN() condition
                            
                                error: command 'x86_64-linux-gnu-gcc' when installing mysqlclient
                            
                                Update a boolean to its opposite in SQL without using a SELECT
                            
                                Access MySQL field's Comments with PHP
                            
                                Deleting ALL products on Magento
                            
                                MySql Last Insert ID, Connector .net
                            
                                Class does not have a table or tablename specified and does not inherit from an existing table-mapped class
                            
                                Storing "CASE WHEN" condition in Doctrine2 entity
                            
                                Access to MySQL with R using a pre 4.1.1 authentication protocol
                            
                                How to avoid jobs DB table locks issue when using Laravel queues?
                            
                                Convert MySQL script to SQL Server [closed]
                            
                                What is the best way to bind decimal / double / float values with PDO in PHP?
                            
                                Migrate from MySQL to PostgreSQL on Linux (Kubuntu)
                            
                                Paypal IPN, Not getting all the transactions responses after changing the ipn url in the account
                            
                                How would you store a business's hours in the db/model of a Rails app?
                            
                                Set AUTO_INCREMENT using SqlAlchemy with MySQL on Columns with non-primary keys?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With