Are unique indexes better for column search performance? (PGSQL & MySQL)

Tags:

I am curious as to whether

CREATE INDEX idx ON tbl (columns);

vs.

Click to copy

CREATE UNIQUE INDEX idx ON tbl (columns);

has a significant algorithmic performance benefit in PostgreSQL or MySQL implementations when scanning the indexed column(s), or whether the UNIQUE keyword simply introduces a unique constraint alongside the index.

I imagine it is probably fair to say that there is a marginal benefit insofar as indexes are likely to be internally implemented as some sort of hash¹-like structure, and collision handling by definition result in something other than O(1) performance. Given this premise, it is likely that if a large percentage of values are identical than the structure degenerates into something linear.

So, for purposes of my question, assume that the distribution of values is relatively discrete and uniform.

Thanks in advance!

^{1 Which is a matter of pure speculation for me, as I am not familiar with RDBM internals.}

953

asked Aug 18 '09 12:08

Alex Balashov

2 Answers

If your data are unique, you should create a UNIQUE index on them.

This implies no additional overhead and affects optimizer's decisions in certain cases so that it can choose a better algorithm.

In SQL Server and in PostgreSQL, for instance, if you sort on a UNIQUE key, the optimizer ignores the ORDER BY clauses used after that (since they are irrelevant), i. e. this query:

Click to copy

SELECT  * FROM    mytable ORDER BY         col_unique, other_col LIMIT 10

will use an index on col_unique and won't sort on other_col because it's useless.

This query:

Click to copy

SELECT  * FROM    mytable WHERE   mycol IN         (         SELECT  othercol         FROM    othertable         )

will also be converted into an INNER JOIN (as opposed to a SEMI JOIN) if there is a UNIQUE index on othertable.othercol.

An index always contains some kind of a pointer to the row (ctid in PostgreSQL, row pointer in MyISAM, primary key/uniquifier in InnoDB) and the leaves are ordered on these pointers, so in fact every index leaf is unique is some way (though it may not be obvious).

See this article in my blog for performance details:

Making an index UNIQUE

195

answered Sep 27 '22 20:09

Quassnoi

There is a small penalty during update/insert operations for having the unique constraint. It has to search before the insert/update operation to make sure the uniqueness constraint isn't violated.

answered Sep 27 '22 18:09

Eric

Related questions
                            
                                How to compare the current row with next and previous row in PostgreSQL?
                            
                                How to profile PostgreSQL Database?
                            
                                How to get Dapper to ignore/remove underscores in field names when mapping?
                            
                                Get the default values of table columns in Postgres?
                            
                                SequelizeConnectionError: self signed certificate
                            
                                Hibernate use of PostgreSQL sequence does not affect sequence table
                            
                                Get this week's monday's date in Postgres?
                            
                                PostgreSQL: Select one of two fields depending on which is empty
                            
                                Dynamic alternative to pivot with CASE and GROUP BY
                            
                                order by JSON data type postgres
                            
                                Authentication error when connecting to Heroku PostgreSQL database
                            
                                Is there a function that takes a year, month and day to create a date in PostgreSQL?
                            
                                Open Port in Ubuntu
                            
                                Error: Cannot create TypedQuery for query with more than one return
                            
                                PostgreSQL: Full Text Search - How to search partial words?
                            
                                gem install pg doesn't work on OSX Lion
                            
                                How to copy certain tables from one schema to another within same DB in Postgres keeping the original schema?
                            
                                Are there any good PostgreSQL clients for linux? [closed]
                            
                                purpose of collate in Postgres
                            
                                serial in postgres is being increased even though I added on conflict do nothing

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Are unique indexes better for column search performance? (PGSQL & MySQL)

Tags:

indexing

mysql

postgresql

hash

Alex Balashov

People also ask

2 Answers

Quassnoi

Eric

Recent Activity

Donate For Us