<pre class="prettyprint"><code>mysql> CREATE TABLE `t` ( `id` int(11) NOT NULL, `a` int(11) DEFAULT NULL, `b` int(11) DEFAULT NULL, PRIMARY KEY (`id`), KEY `a` (`a`), KEY `b` (`b`) ) ENGINE=InnoDB </code></pre> there is a table named t and it has two indexes named a and b. Insert into t 100000 rows data <pre class="prettyprint"><code>mysql> create procedure idata() begin declare i int; set i=1; while(i<=100000)do insert into t values(i, i, i); set i=i+1; end while; end; Query OK, 0 rows affected (0.01 sec) mysql> delimiter ; mysql> call idata(); </code></pre> I do some experiments, some are as follows <img src="https://i.stack.imgur.com/EZLxj.png" alt="there are some experiments"> Now, i want to know; (1)why <code>explain select * from t where a >= 90000;</code> extra is <code>Using index condition</code>? it has index key, but it doesn't have index filter and table filter, so why it is <code>Using index condition</code>? (2)why <code>explain select * from t where a = 90000;</code> extra is <code>NULL</code>? is needs to have an access to the table,if the first case is <code>Using index condition</code>, why the second can't be <code>Using index condition</code>? (3)why <code>explain select a from t where a >= 90000;</code> extra is <code>Using where; Using index</code>? i know it uses cover index, so extra has <code>Using index</code>;but why extra has <code>Using where</code>? it means server needs to filter the data? but storage engine has already return the correct, why server needs to filer?

First, terminology... "Using index" means that the (in this case) <code>INDEX(a)</code> contains all the columns needed. That is "the index is covering". "Using index condition" is quite different. Internally, it is called ICP (Index Condition Pushdown). This refers to whether the "handler" checks the expression or whether the "condition" (a >= 90000) is handed off to the Engine (InnoDB) to do the work. As for "Using where"; that is still a mystery to me, even after using MySQL for 20 years and looking thousands of Explains. I ignore it. In all 3 of your cases, <code>INDEX(a)</code> is used. This is indicated primarily by "key" ("a"--the name of the key, not the column), "key_len" ("5": 4-byte <code>INT</code> plus 1 for <code>NULLable</code>), and secondarily by "type" (which does not say "All"). Further <ul> <li>If you change the 90000 to 70000, you may find that it will switch to a table scan. Why bounce back and forth between the Index's BTree and the data's BTree (via the <code>PRIMARY KEY</code>). The Optimizer will assume that it will be faster to simply scan all the table, ignoring the rows that fail the <code>WHERE</code> clause.</li> <li><code>EXPLAIN FORMAT=JSON SELECT</code> -- Gives you a lot more information. (Perhaps not much more info for this simple query.) One useful surprise is that it will show how many sorts the single mention of "filesort" really refers to. (A possibly easy way to make this happen is <code>GROUP BY x ORDER BY y</code>; that is group and order by different columns.)</li> <li>Explain rarely has such clean numbers, like your "10001". Usually, the "rows" columns is an approximation, sometimes a terrible approx.</li> <li>The slowlog records "Rows examined"; it will probably say 10001 (or maybe only 10000) and 1 for your tests. For a table scan, it would be a full 100K.</li> <li>Another way to get "Rows examined" is via the "Handler" <code>STATUS</code> values. See http://mysql.rjweb.org/doc.php/index_cookbook_mysql#handler_counts</li> </ul>

Why the EXTRA is NULL in Mysql EXPLAIN? Why >= is Using index condition?

Q: Are NULL values indexed in MySQL?

For example, MySQL can use indexes and ranges to search for NULL with IS NULL .

Q: Why do we use index in MySQL?

Indexes are used to find rows with specific column values quickly. Without an index, MySQL must begin with the first row and then read through the entire table to find the relevant rows. The larger the table, the more this costs.

Q: What is meant by NULL value in MySQL which operator is used with NULL value?

A field with a NULL value is a field with no value. If a field in a table is optional, it is possible to insert a new record or update a record without adding a value to this field. Then, the field will be saved with a NULL value. Note: A NULL value is different from a zero value or a field that contains spaces.

Tags:

mysql

innodb

explain

mysql> CREATE TABLE `t` (
     `id` int(11) NOT NULL,
     `a` int(11) DEFAULT NULL,
     `b` int(11) DEFAULT NULL,
     PRIMARY KEY (`id`),
     KEY `a` (`a`),
     KEY `b` (`b`)
   ) ENGINE=InnoDB

there is a table named t and it has two indexes named a and b. Insert into t 100000 rows data

mysql> create procedure idata()
  begin
   declare i int;
     set i=1;
     while(i<=100000)do
       insert into t values(i, i, i);
       set i=i+1;
     end while;
   end;
Query OK, 0 rows affected (0.01 sec)

mysql> delimiter ;
mysql> call idata();

I do some experiments, some are as follows

there are some experiments

Now, i want to know;

(1)why explain select * from t where a >= 90000; extra is Using index condition? it has index key, but it doesn't have index filter and table filter, so why it is Using index condition?

(2)why explain select * from t where a = 90000; extra is NULL? is needs to have an access to the table,if the first case is Using index condition, why the second can't be Using index condition?

(3)why explain select a from t where a >= 90000; extra is Using where; Using index? i know it uses cover index, so extra has Using index;but why extra has Using where? it means server needs to filter the data? but storage engine has already return the correct, why server needs to filer?

761

asked Jun 18 '20 16:06

Journey

Video Answer

1 Answers

First, terminology...

"Using index" means that the (in this case) INDEX(a) contains all the columns needed. That is "the index is covering".

"Using index condition" is quite different. Internally, it is called ICP (Index Condition Pushdown). This refers to whether the "handler" checks the expression or whether the "condition" (a >= 90000) is handed off to the Engine (InnoDB) to do the work.

As for "Using where"; that is still a mystery to me, even after using MySQL for 20 years and looking thousands of Explains. I ignore it.

In all 3 of your cases, INDEX(a) is used. This is indicated primarily by "key" ("a"--the name of the key, not the column), "key_len" ("5": 4-byte INT plus 1 for NULLable), and secondarily by "type" (which does not say "All").

Further

If you change the 90000 to 70000, you may find that it will switch to a table scan. Why bounce back and forth between the Index's BTree and the data's BTree (via the PRIMARY KEY). The Optimizer will assume that it will be faster to simply scan all the table, ignoring the rows that fail the WHERE clause.
EXPLAIN FORMAT=JSON SELECT -- Gives you a lot more information. (Perhaps not much more info for this simple query.) One useful surprise is that it will show how many sorts the single mention of "filesort" really refers to. (A possibly easy way to make this happen is GROUP BY x ORDER BY y; that is group and order by different columns.)
Explain rarely has such clean numbers, like your "10001". Usually, the "rows" columns is an approximation, sometimes a terrible approx.
The slowlog records "Rows examined"; it will probably say 10001 (or maybe only 10000) and 1 for your tests. For a table scan, it would be a full 100K.
Another way to get "Rows examined" is via the "Handler" STATUS values. See http://mysql.rjweb.org/doc.php/index_cookbook_mysql#handler_counts

158

answered Oct 22 '22 15:10

Rick James

Related questions
                            
                                Understanding Heroku and ClearDB max_questions resource error
                            
                                Soft deleting in pivot with sync method laravel
                            
                                How to reclaim MySql disk space
                            
                                PDO failing with too many records, buffered queries
                            
                                How to perform SELECT FOR UPDATE with SKIP LOCKED on MySQL with Django
                            
                                Mysql not running after upgrading mysql version
                            
                                Auto sync data between couch and mysql
                            
                                Transactions are not getting rollbacked with HikariCP
                            
                                C# DGV - Show Live MYSQL DB table
                            
                                Grails reconnect to mysql when having communications link failure
                            
                                Getting the "Records" and "Duplicates" counts of INSERT ... SELECT ... ON DUPLICATE KEY UPDATE
                            
                                "PROCEDURE ANALYSE" analogues in MySQL 8 and PostgreSQL
                            
                                MySQL yearweek() vs week() returning different result
                            
                                How to access secrets in node.js with Hashicorp Vault
                            
                                AWS MySQL RDS fail over - replication lag handling?
                            
                                Qt MySQL connect to Azure via SSL
                            
                                How to properly use Knex / Bookshelf with MySQL on RDS
                            
                                Laravel scout search in relationships
                            
                                How to properly connect MySQL database with C# application?
                            
                                Access denied for user 'root'@'localhost' in dbeaver

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With