<blockquote> Given the following SQL table : Employee(ssn, name, dept, manager, salary) You discover that the following query is significantly slower than expected. There is an index on <code>salary</code>, and you have verified that the query plan is using it. </blockquote> <pre class="prettyprint"><code>SELECT * FROM Employee WHERE salary = 48000 </code></pre> <blockquote> Please give a possible reason why this query is slower than expected, and provide a tuning solution that addresses that reason. </blockquote> I have two ideas for why this query is slower than expected. One is that we are trying to <code>SELECT *</code> instead of <code>SELECT Employee.salary</code> which would slow down the query as we must search across all columns instead of one. Another idea is that the index on <code>salary</code> is non-clustered, and we want to use a clustered index, as the company could be very large and it would make sense to organize the table by the <code>salary</code> field. Would either of those two solutions speed up this query? I.e. either change <code>SELECT *</code> to <code>SELECT Employee.salary</code> or explicitly set the index on <code>salary</code> to be clustered?

What indexes do you have now? Is it really "slow"? What evidence do you have? Comments on "SELECT * instead of SELECT Employee.salary" -- <ul> <li> <code>*</code> is bad form because tomorrow you might add a column, thereby breaking any code that is expecting a certain number of columns in a certain order.</li> <li>Dealing with <code>*</code> versus <code>salary</code> does not happen until after the row(s) is located.</li> <li>Locating the row(s) is the costly part.</li> <li>On the other hand, if you have <code>INDEX(salary)</code> and only look at <code>salary</code> then the index is "covering". That means that the "data" (the other columns) does not need to be fetched. Hence, faster. But this is probably beyond what your teacher has told you about yet.</li> </ul> Comments on "the index on salary is non-clustered, and we want to use a clustered index" -- <ul> <li>In MySQL (not necessarily in other RDBMSs), InnoDB has exactly one <code>PRIMARY KEY</code> and it is always <code>UNIQUE</code> and "clustered".</li> <li>That is, "clustered" implies "unique", which seems inappropriate for "salary".</li> <li>In InnoDB a "secondary key" implicitly includes the column(s) of the PK (<code>ssn</code>?), with which it can reach over into the data.</li> </ul> "verified that the query plan" -- Have you learned about <code>EXPLAIN SELECT ...</code>? More Tips on creating the optimal index for a given <code>SELECT</code>.

How to speed up this SQL index query?

Tags:

Given the following SQL table :

Employee(ssn, name, dept, manager, salary)

You discover that the following query is significantly slower than expected. There is an index on salary, and you have verified that the query plan is using it.

SELECT * 
FROM Employee
WHERE salary = 48000

Please give a possible reason why this query is slower than expected, and provide a tuning solution that addresses that reason.

I have two ideas for why this query is slower than expected. One is that we are trying to SELECT * instead of SELECT Employee.salary which would slow down the query as we must search across all columns instead of one. Another idea is that the index on salary is non-clustered, and we want to use a clustered index, as the company could be very large and it would make sense to organize the table by the salary field.

Would either of those two solutions speed up this query? I.e. either change SELECT * to SELECT Employee.salary or explicitly set the index on salary to be clustered?

505

asked Apr 22 '17 20:04

ABlueCrayon

1 Answers

What indexes do you have now?

Is it really "slow"? What evidence do you have?

Comments on "SELECT * instead of SELECT Employee.salary" --

* is bad form because tomorrow you might add a column, thereby breaking any code that is expecting a certain number of columns in a certain order.
Dealing with * versus salary does not happen until after the row(s) is located.
Locating the row(s) is the costly part.
On the other hand, if you have INDEX(salary) and only look at salary then the index is "covering". That means that the "data" (the other columns) does not need to be fetched. Hence, faster. But this is probably beyond what your teacher has told you about yet.

Comments on "the index on salary is non-clustered, and we want to use a clustered index" --

In MySQL (not necessarily in other RDBMSs), InnoDB has exactly one PRIMARY KEY and it is always UNIQUE and "clustered".
That is, "clustered" implies "unique", which seems inappropriate for "salary".
In InnoDB a "secondary key" implicitly includes the column(s) of the PK (ssn?), with which it can reach over into the data.

"verified that the query plan" -- Have you learned about EXPLAIN SELECT ...?

More Tips on creating the optimal index for a given SELECT.

104

answered Sep 25 '22 10:09

Rick James

Related questions
                            
                                Python: mock file input for testing function
                            
                                How can I search with belongsto laravel?
                            
                                How do we cast context to fragment reference?
                            
                                GKE cluster autoscaler vs Autoscaling in Managed instance groups
                            
                                Detect when rotation has completed and move to new Scene
                            
                                C# Directory.GetFiles returns different result on different computer
                            
                                Gunicorn graceful stopping with docker-compose
                            
                                Export Visual Studio project to Qt Project
                            
                                How to ignore nulls in BigQuery using LAG()?
                            
                                Can we Round off the score in Elasticsearch
                            
                                Retrieving Web App connection strings using ConfigurationBuilder
                            
                                Loading sklearn model in Java. Model created with DNNClassifier in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With