Correct SQL index for Partition + Order to remove SORT

Tags:

I have a SQL Statement which i am trying to optimise to remove the sort operator

SELECT *,ROW_NUMBER() OVER (
        PARTITION BY RuleInstanceId 
        ORDER BY [Timestamp] DESC
   ) AS rn
FROM RuleInstanceHistoricalMembership

Everything I have read (eg. Optimizing SQL queries by removing Sort operator in Execution plan) suggests this is the correct index to add however it appears to have no effect at all.

CREATE NONCLUSTERED INDEX IX_MyIndex ON dbo.[RuleInstanceHistoricalMembership](RuleInstanceId, [Timestamp] DESC)

enter image description here

I must be missing something as I have read heaps of articles which all seem to sugguest an index spanning both columns should solve this issue

634

asked Feb 26 '14 04:02

Not loved

1 Answers

Technically the index you have added does allow you to avoid a sort.

However the index you have created is non covering so SQL Server would then also need to perform 60 million key lookups back to the base table.

Simply scanning the clustered index and sorting it on the fly is costed as being considerably cheaper than that option.

In order to get the index to be used automatically you would need to either.

Remove columns from the query SELECT list so the index covers it.
Add INCLUDE-d columns to the index.

BTW: For a table with 60 million rows you may well find that even if you were to try and force the issue with an index hint on the non covering index you still don't get the desired results of avoiding a sort.

CREATE TABLE RuleInstanceHistoricalMembership
  (
     ID             INT PRIMARY KEY,
     Col2           INT,
     Col3           INT,
     RuleInstanceId INT,
     [Timestamp]    INT
  )

CREATE NONCLUSTERED INDEX IX_MyIndex
  ON dbo.[RuleInstanceHistoricalMembership](RuleInstanceId, [Timestamp] DESC)

/*Fake small table*/
UPDATE STATISTICS RuleInstanceHistoricalMembership 
                  WITH ROWCOUNT = 600, 
                       PAGECOUNT = 10 

SELECT *,
       ROW_NUMBER() OVER ( PARTITION BY RuleInstanceId 
                               ORDER BY [Timestamp] DESC ) AS rn
FROM   RuleInstanceHistoricalMembership WITH (INDEX = IX_MyIndex)

Gives the plan

enter image description here

With no sort but up the row and page count

/*Fake large table*/
UPDATE STATISTICS RuleInstanceHistoricalMembership 
                  WITH ROWCOUNT = 60000000, 
                       PAGECOUNT = 10000000

And try again and you get

enter image description here

Now it has two sorts!

The scan on the NCI is in RuleInstanceId, Timestamp DESC order but then SQL Server reorders it into clustered index key order (Id ASC) per Optimizing I/O Performance by Sorting.

This step is to try and reduce the expected massive cost of 60 million random lookups into the clustered index. Then it gets sorted back into the original RuleInstanceId, Timestamp DESC order that the index delivered it in.

answered Oct 06 '22 20:10

Martin Smith

Related questions
                            
                                Best way to Insert Python NumPy array into PostgreSQL database
                            
                                Can someone give me a high overview of how lucene.net works?
                            
                                SQL SELECT FROM ... AS with data type specifier?
                            
                                SQL Server ALTER field NOT NULL takes forever
                            
                                Mysql deadlock explanation needed
                            
                                Representing ecommerce products and variations cleanly in the database
                            
                                Inverse of COALESCE
                            
                                How to combine two rows and calculate the time difference between two timestamp values in MySQL?
                            
                                SQL query on multiple databases
                            
                                Is it better to maintain a separate count table vs running count query every time?
                            
                                Oracle: Full text search with condition
                            
                                Is there a way to turn off implicit type conversion in SQL Server?
                            
                                Optimizing MySQL query with multiple left joins
                            
                                How do I extend this query to find valid combinations of three items?
                            
                                Automatically generate a user defined table type that matches an existing table
                            
                                Simplified/shorthand SQL Data Definition Languages?
                            
                                SQLite Bracket "don't work"
                            
                                SQL query for join table and multiple values
                            
                                How to query a join table so that multiple criteria are met?
                            
                                How to handle Python multiprocessing database concurrency, specifically with django?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Correct SQL index for Partition + Order to remove SORT

Tags:

performance

sql

sql-server

Not loved

People also ask

1 Answers

Martin Smith

Recent Activity

Donate For Us