I have a table of books : <pre class="prettyprint"><code>CREATE TABLE `books` ( `id` INT(11) NOT NULL AUTO_INCREMENT, `nameOfBook` VARCHAR(32), `releaseDate` DATETIME NULL DEFAULT NULL, PRIMARY KEY (`id`), INDEX `Index 2` (`releaseDate`, `id`) ) COLLATE='latin1_swedish_ci' ENGINE=InnoDB </code></pre> AUTO_INCREMENT=33029692; I compared two SQL requests to do a pagiation with sort on releaseDate. Both of theses request return the same result. (simple one) <pre class="prettyprint"><code>select SQL_NO_CACHE id,name, releaseDate from books where releaseDate <= '2016-11-07' AND (releaseDate<'2016-11-07' OR id < 3338191) ORDER by releaseDate DESC, id DESC limit 50; </code></pre> and (tuple comparison or row comparaison) <pre class="prettyprint"><code>select SQL_NO_CACHE id,name, releaseDate from books where (releaseDate ,id) < ('2016-11-07',3338191) ORDER by releaseDate DESC, id DESC limit 50; </code></pre> When I do the explain of the request i got this simple one : <pre class="prettyprint"><code>"id";"select_type";"table";"type";"possible_keys";"key";"key_len";"ref";"rows";"Extra" "1";"SIMPLE";"books";"range";"PRIMARY,Index 2";"Index 2";"9";"";"1015876";"Using where; Using index" </code></pre> We can see it is parsing "1015876" of rows The explain for the tuple comparison : <pre class="prettyprint"><code>"id";"select_type";"table";"type";"possible_keys";"key";"key_len";"ref";"rows";"Extra" "1";"SIMPLE";"books";"index";"";"Index 2";"13";"";"50";"Using where; Using index" </code></pre> We can see it is parsing "50" of rows. But if I checked the exectution time the simple one : <pre class="prettyprint"><code>/* Affected rows: 0 Lignes trouvées: 50 Avertissements: 0 Durée pour 1 query: 0,031 sec. */ </code></pre> and the tuple one : <pre class="prettyprint"><code>/* Affected rows: 0 Lignes trouvées: 50 Avertissements: 0 Durée pour 1 query: 3,682 sec. */ </code></pre> I don't understant why according to the explain the tuple comparison is better but the execution time is badly worse?

I've been irritated by this for years. <code>WHERE (a,b) > (1,2)</code> has never been optimized, in spite of it being easily transformed into the other formulation. Even the other format was poorly optimized until a few years ago. Using <code>EXPLAIN FORMAT=JSON SELECT ...</code> might give you some better clues. Meanwhile, <code>EXPLAIN</code> ignored the <code>LIMIT</code> and suggested 1015876. On many cases, <code>EXPLAIN</code> provides a "decent" Row estimate, but not either of these. Feel free to file a bug report: http://bugs.mysql.com (and post the link here). Another formulation was recently optimized, in spite of <code>OR</code> being historically un-optimizable. <pre class="prettyprint"><code>where releaseDate < '2016-11-07' OR (releaseDate = '2016-11-07' AND id < 3338191) </code></pre> For measuring query optimizations, I like to do: <pre class="prettyprint"><code>FLUSH STATUS; SELECT ... SHOW SESSION STATUS LIKE 'Handler%'; </code></pre> Small values, such as '50' for your case, indicate good optimization; large value (1M) indicate a scan. The Handler numbers are exact; unlike the estimates in <code>EXPLAIN</code>. Update 5.7.3 has improved handling of tuples, aka "row constructors" Update MySQL Bug#104128 covers this.

Using tuple comparison in mysql is it efficient?

Tags:

mysql

tuples

sqlperformance

I have a table of books :

CREATE TABLE `books` (
    `id` INT(11) NOT NULL AUTO_INCREMENT,
    `nameOfBook` VARCHAR(32),
    `releaseDate` DATETIME NULL DEFAULT NULL,
    PRIMARY KEY (`id`),
    INDEX `Index 2` (`releaseDate`, `id`)
)
COLLATE='latin1_swedish_ci'
ENGINE=InnoDB

AUTO_INCREMENT=33029692;

I compared two SQL requests to do a pagiation with sort on releaseDate. Both of theses request return the same result.

(simple one)

select SQL_NO_CACHE  id,name, releaseDate  
from books  
where releaseDate <= '2016-11-07'  
AND (releaseDate<'2016-11-07' OR id <    3338191)  
ORDER  by releaseDate DESC, id DESC limit 50;

and

(tuple comparison or row comparaison)

select SQL_NO_CACHE  id,name, releaseDate 
from books 
where (releaseDate ,id) < ('2016-11-07',3338191) 
ORDER  by releaseDate DESC, id DESC limit 50;

When I do the explain of the request i got this

simple one :

"id";"select_type";"table";"type";"possible_keys";"key";"key_len";"ref";"rows";"Extra"
"1";"SIMPLE";"books";"range";"PRIMARY,Index 2";"Index 2";"9";"";"1015876";"Using where; Using index"

We can see it is parsing "1015876" of rows

The explain for the tuple comparison :

"id";"select_type";"table";"type";"possible_keys";"key";"key_len";"ref";"rows";"Extra"
"1";"SIMPLE";"books";"index";"";"Index 2";"13";"";"50";"Using where; Using index"

We can see it is parsing "50" of rows.

But if I checked the exectution time the simple one :

/* Affected rows: 0  Lignes trouvées: 50  Avertissements: 0  Durée pour 1 query: 0,031 sec. */

and the tuple one :

/* Affected rows: 0  Lignes trouvées: 50  Avertissements: 0  Durée pour 1 query: 3,682 sec. */

I don't understant why according to the explain the tuple comparison is better but the execution time is badly worse?

235

asked Nov 08 '16 14:11

dop

1 Answers

I've been irritated by this for years. WHERE (a,b) > (1,2) has never been optimized, in spite of it being easily transformed into the other formulation. Even the other format was poorly optimized until a few years ago.

Using EXPLAIN FORMAT=JSON SELECT ... might give you some better clues.

Meanwhile, EXPLAIN ignored the LIMIT and suggested 1015876. On many cases, EXPLAIN provides a "decent" Row estimate, but not either of these.

Feel free to file a bug report: http://bugs.mysql.com (and post the link here).

Another formulation was recently optimized, in spite of OR being historically un-optimizable.

where  releaseDate <  '2016-11-07'  
   OR (releaseDate  = '2016-11-07' AND id < 3338191)

For measuring query optimizations, I like to do:

FLUSH STATUS;
SELECT ...
SHOW SESSION STATUS LIKE 'Handler%';

Small values, such as '50' for your case, indicate good optimization; large value (1M) indicate a scan. The Handler numbers are exact; unlike the estimates in EXPLAIN.

Update 5.7.3 has improved handling of tuples, aka "row constructors"

Update MySQL Bug#104128 covers this.

154

answered Sep 24 '22 20:09

Rick James

Related questions
                            
                                MySQL loop through tables
                            
                                How can I access MySQL InnoDB index values directly without the MySQL client?
                            
                                How to make a multiple column mysql fulltext search where partial words are matched
                            
                                MySQL allows connection without password although password is set
                            
                                What is the recommended max value for join_buffer_size in mysql?
                            
                                Checking that a table exists on MySQL [duplicate]
                            
                                What does "pooling=false" in a MySQL connection string mean? [closed]
                            
                                while creating and deleting user in mysql ERROR 1396 (HY000): Operation CREATE USER
                            
                                Can two different devices have same GCM Registration ID?
                            
                                Exporting data from one schema to another in MySQL Workbench
                            
                                Cant connect to my SQL database
                            
                                Dynamic database tables in django
                            
                                doctrine:schema:update doesn't respect column order
                            
                                One to many relationship between AspNetUsers (Identity) and a custom table
                            
                                How to use TIME type in MySQL?
                            
                                How to use group by column depending on other column
                            
                                How to get rows with a bit typed fields from mysql in node.js?
                            
                                Difference between "IS NOT NULL" and "NOT (field = NULL)" in these 2 queries
                            
                                Specifying password in MySQL connection string
                            
                                Write queryDSL predicate query for oneTomany relation based query

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With