If I know an index will have unique values, how will it affect performance on inserts or selects if I declare it as such. If the optimiser knows the index is unique how will that affect the query plan? I understand that specifying uniquenes can serve to preserve integrity, but leaving that discussion aside for the moment, what are the perfomance consequences.

Long story short: if your data are intrinsically <code>UNIQUE</code>, you will benefit from creating a <code>UNIQIE</code> index on them. See the article in my blog for detailed explanation: <ul> <li>Making an index <code>UNIQUE</code></li> </ul> <hr> Now, the gory details. As @Mehrdad said, <code>UNIQUENESS</code> affects the estimated row count in the plan builder. <code>UNIQUE</code> index has maximal possible selectivity, that's why: <pre class="prettyprint"><code>SELECT * FROM table1 t2, table2 t2 WHERE t1.id = :myid AND t2.unique_indexed_field = t1.value </code></pre> almost surely will use <code>NESTED LOOPS</code>, while <pre class="prettyprint"><code>SELECT * FROM table1 t2, table2 t2 WHERE t1.id = :myid AND t2.non_unique_indexed_field = t1.value </code></pre> may benefit from a <code>HASH JOIN</code> if the optimizer thinks that <code>non_unique_indexed_field</code> is not selective. If your index is <code>CLUSTERED</code> (i. e. the rows theirselves are contained in the index leaves) and non-<code>UNIQUE</code>, then a special hidden column called <code>uniquifier</code> is added to each index key, thus making the key larger and the index slower. That's why <code>UNIQUE CLUSTERED</code> index is in fact a little more efficicent than a <code>non-UNIQUE CLUSTERED</code> one. In <code>Oracle</code>, a join on <code>UNIQUE INDEX</code> is required for a such called <code>key preservation</code>, which ensures that each row from a table will be selected at most once and makes a view updatable. This query: <pre class="prettyprint"><code>UPDATE ( SELECT * FROM mytable t1, mytable t2 WHERE t2.reference = t1.unique_indexed_field ) SET value = other_value </code></pre> will work in <code>Oracle</code>, while this one: <pre class="prettyprint"><code>UPDATE ( SELECT * FROM mytable t1, mytable t2 WHERE t2.reference = t1.non_unique_indexed_field ) SET value = other_value </code></pre> will fail. This is not an issue with <code>SQL Server</code>, though. One more thing: for a table like this, <pre class="prettyprint"><code>CREATE TABLE t_indexer (id INT NOT NULL PRIMARY KEY, uval INT NOT NULL, ival INT NOT NULL) CREATE UNIQUE INDEX ux_indexer_ux ON t_indexer (uval) CREATE INDEX ix_indexer_ux ON t_indexer (ival) </code></pre> , this query: <pre class="prettyprint"><code>/* Sorts on the non-unique index first */ SELECT TOP 1 * FROM t_indexer ORDER BY ival, uval </code></pre> will use a <code>TOP N SORT</code>, while this one: <pre class="prettyprint"><code>/* Sorts on the unique index first */ SELECT TOP 1 * FROM t_indexer ORDER BY uval, ival </code></pre> will use just an index scan. For the latter query, there is no point in additional sorting on <code>ival</code>, since <code>uval</code> are unique anyway, and the optimizer takes this into account. On sample data of <code>200,000</code> rows (<code>id == uval == ival</code>), the former query runs for <code>15</code> seconds, while the latter one is instant.

Declaring an Index as unique in SQL Server

2 Answers

Long story short: if your data are intrinsically UNIQUE, you will benefit from creating a UNIQIE index on them.

See the article in my blog for detailed explanation:

Making an index UNIQUE

Now, the gory details.

As @Mehrdad said, UNIQUENESS affects the estimated row count in the plan builder.

UNIQUE index has maximal possible selectivity, that's why:

SELECT  * FROM    table1 t2, table2 t2 WHERE   t1.id = :myid         AND t2.unique_indexed_field = t1.value

almost surely will use NESTED LOOPS, while

SELECT  * FROM    table1 t2, table2 t2 WHERE   t1.id = :myid         AND t2.non_unique_indexed_field = t1.value

may benefit from a HASH JOIN if the optimizer thinks that non_unique_indexed_field is not selective.

If your index is CLUSTERED (i. e. the rows theirselves are contained in the index leaves) and non-UNIQUE, then a special hidden column called uniquifier is added to each index key, thus making the key larger and the index slower.

That's why UNIQUE CLUSTERED index is in fact a little more efficicent than a non-UNIQUE CLUSTERED one.

In Oracle, a join on UNIQUE INDEX is required for a such called key preservation, which ensures that each row from a table will be selected at most once and makes a view updatable.

This query:

UPDATE  (         SELECT  *         FROM    mytable t1, mytable t2         WHERE   t2.reference = t1.unique_indexed_field         ) SET     value = other_value

will work in Oracle, while this one:

UPDATE  (         SELECT  *         FROM    mytable t1, mytable t2         WHERE   t2.reference = t1.non_unique_indexed_field         ) SET     value = other_value

will fail.

This is not an issue with SQL Server, though.

One more thing: for a table like this,

CREATE TABLE t_indexer (id INT NOT NULL PRIMARY KEY, uval INT NOT NULL, ival INT NOT NULL) CREATE UNIQUE INDEX ux_indexer_ux ON t_indexer (uval) CREATE INDEX ix_indexer_ux ON t_indexer (ival)

, this query:

/* Sorts on the non-unique index first */ SELECT  TOP 1 * FROM    t_indexer ORDER BY         ival, uval

will use a TOP N SORT, while this one:

/* Sorts on the unique index first */ SELECT  TOP 1 * FROM    t_indexer ORDER BY         uval, ival

will use just an index scan.

For the latter query, there is no point in additional sorting on ival, since uval are unique anyway, and the optimizer takes this into account.

On sample data of 200,000 rows (id == uval == ival), the former query runs for 15 seconds, while the latter one is instant.

163

answered Sep 19 '22 06:09

Quassnoi

Of course the optimizer will take uniqueness in consideration. It affects the expected row count in query plans.

answered Sep 19 '22 06:09

mmx

Related questions
                            
                                AccessController.doPrivileged
                            
                                How do I attach a process to the debugger in Visual Studio?
                            
                                Eclipse: On Save execute a program
                            
                                When would a Ruby flip-flop be useful?
                            
                                Selenium is to Web UI testing as ________ is to Windows application UI testing [closed]
                            
                                Multiple versions of Python on OS X Leopard
                            
                                Difference between BeginInvoke and Thread.Start
                            
                                Maven creating flat zip assembly
                            
                                Change UITableView section header/footer WHILE RUNNING the app?
                            
                                Proper cleanup of WPF user controls
                            
                                Getting the current ASP.NET machine key
                            
                                LINQ Get Distinct values and fill LIST

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Declaring an Index as unique in SQL Server

Tags:

cindi

People also ask

2 Answers

Quassnoi

mmx

Recent Activity

Donate For Us