MySQL queries on two different indexes fast, but combined into one query slow. Why?

Tags:

mysql

I have a table with 2 million rows. I have two index (status, gender) and also (birthday).

I find strange that this query is taking 3.6 seconds or more QUERY N° 1

SELECT COUNT(*) FROM ts_user_core
WHERE birthday BETWEEN '1980-01-01' AND '1985-01-01'
    AND status='ok' AND gender='female';

same for this: QUERY N° 2

SELECT COUNT(*) FROM ts_user_core
WHERE status='ok' AND gender='female'
    AND birthday between '1980-01-01' AND '1985-01-01';

While this query is taking 0.140 seconds QUERY N° 3

select count(*) from ts_user_core where (birthday between '1990-01-01' and '2000-01-01');

Also this query takes 0.2 seconds QUERY N° 4

select count(*) from ts_user_core where status='ok' and gender='female'

I expect the first query to be way more faster, how can be possible this behavior? I can't handle so much time for this query.

Here the result of: enter image description here

I know that I can add a new index with 3 columns, but is there a way to have a faster query without adding an index for every where clause?

Thanks for your advice

488

asked Apr 28 '15 13:04

1 Answers

is there a way to optimize the query without adding an index for every possible where clause?

Yes, somewhat. But it takes an understanding of how INDEXes work.

Let's look at all the SELECTs you have presented so far.

To build the optimal index for a SELECT, start with all the = constant items in the WHERE clause. Put those columns into an index in any order. That gives us INDEX(status, gender, ...) or INDEX(gender, status, ...), but nothing deciding between them (yet).
add on one range or all the ORDER BY. In your first couple of SELECTs, that would be birthday. Now we have INDEX(status, gender, birthday) or INDEX(gender, status, birthday). Either of these is 'best' for the first two SELECTs.

Those indexes work quite well for #4: select count(*) from ts_user_core where status='ok' and gender='female', too. So no extra index needed for it.

Now, let's work on #3: select count(*) from ts_user_core where (birthday between '1990-01-01' and '2000-01-01');

It cannot use the indexes we have so far.
INDEX(birthday) is essentially the only choice.

Now, suppose we also had ... WHERE status='foo'; (without gender). That would force us to pick INDEX(status, gender, birthday) instead of the variant of it.

Result: 2 good indexes to handle all 5 selects:

INDEX(status, gender, birthday)
INDEX(birthday)

Suggestion: If you end up with more than 5 INDEXes or an index with more than 5 columns in it, it is probably wise to shorten some indexes. Here is where things get really fuzzy. If you would like to present me with a dozen 'realistic' indexes, I'll walk you through it.

Notes on other comments:

For timing, run each query twice and take the second time -- to avoid caching effects. (Your 3.6 vs 0.140 smells like caching of the index.)
For timing, turn off the Query cache or use SQL_NO_CACHE.
The optimizer rarely uses two indexes in a single query.
Show us the EXPLAIN plain; we can help you read it.
The extra time taken to pick among multiple INDEXes is usually worth it.
If you have INDEX(a,b,c), you don't need INDEX(a,b).

189

answered Oct 29 '22 16:10

Rick James

Related questions
                            
                                Limit a LEFT JOIN Subquery to 1 result
                            
                                MySQLdb and Python ImportError
                            
                                Use SQL keyword as alias name of a column
                            
                                Laravel many to many selecting with Eloquent
                            
                                Call a PHP function multiple times creates MySQL error
                            
                                Only run SQL query if condition met
                            
                                How to make Tree structure with codeigniter and mysql dynamically
                            
                                Laravel view is not working
                            
                                Migrating from MySQL to Crate
                            
                                How to use of AND and OR operator in same query in sequelize?
                            
                                What is the difference between FULLTEXT and FULLTEXT KEY/INDEX?
                            
                                Flask foreign_keys still shows AmbiguousForeignKeysError
                            
                                Codeigniter shows blank page with no error
                            
                                MySQL Group By not working in subquery
                            
                                Docker - Connect Apache Tomcat web server to MySQL server
                            
                                Optimal way to store BLOBs larger than max_allowed_packet in MySQL InnoDB
                            
                                Finding latest message from table, grouped by user in mysql
                            
                                SQL Update and replace substring [duplicate]
                            
                                How to obtain and process mysql records using Airflow?
                            
                                Mysql 8 remote access

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

MySQL queries on two different indexes fast, but combined into one query slow. Why?

Tags:

performance

mysql

Stefano Giacone

People also ask

1 Answers

Rick James

Recent Activity

Donate For Us