<p>I have a many-to-many join table in Postgres that I would like to index to A) increase performance (obviously) and B) enforce uniqueness. For example:</p> <pre class="prettyprint"><code>a_id | b_id 1 | 2 <- okay 1 | 3 <- okay 2 | 3 <- okay 1 | 3 <- not okay (same as row 2) </code></pre> <p>Is it possible to have a single index on two columns that enforces uniqueness in the values? What type of index should I use?</p>

<h3>As Primary Key</h3> <p>Do this if that unique is primary key:</p> <pre class="prettyprint"><code>create table tbl( a_id int not null, b_id int not null, constraint tbl_pkey primary key(a_id,b_id) ); </code></pre> <h3>Not Primary Key</h3> <p>Do this if that unique is non-primary key:</p> <pre class="prettyprint"><code>create table tbl( -- other primary key here, e.g.: -- id serial primary key, a_id int not null, b_id int not null, constraint tbl_unique unique(a_id,b_id) ); </code></pre> <h3>Existing Table</h3> <p>If you have existing table, do this instead:</p> <pre class="prettyprint"><code>alter table tbl add constraint tbl_unique unique(a_id, b_id) </code></pre> <p>That alter table display this message:</p> <pre class="prettyprint"><code>NOTICE: ALTER TABLE / ADD UNIQUE will create implicit index "tbl_unique" for table "tbl" Query returned successfully with no result in 22 ms. </code></pre> <h3>Drop</h3> <p>If you wanted to drop that constraint(you might want to make unique a combination of 3 fields):</p> <pre class="prettyprint"><code>ALTER TABLE tbl DROP CONSTRAINT tbl_unique; </code></pre> <h3>Index & Constraint & Nulls</h3> <p>Regarding index, from Postgres doc:</p> <blockquote> <p>PostgreSQL automatically creates a unique index when a unique constraint or primary key is defined for a table</p> </blockquote> <p>Source: http://www.postgresql.org/docs/9.1/static/indexes-unique.html</p> <hr> <p>If uniqueness depends on some rules, you shall use <code>CREATE UNIQUE INDEX</code>, for example:</p> <p>Given this:</p> <pre class="prettyprint"><code>CREATE TABLE tbl ( a_id integer NOT NULL, b_id integer NULL ); alter table tbl add constraint tbl_unique unique(a_id, b_id); </code></pre> <p>That unique can catch these duplicates, this will be rejected by database:</p> <pre class="prettyprint"><code>insert into tbl values (1,1), (1,1); </code></pre> <p>Yet that UNIQUE CONSTRAINT cannot catch duplicate nulls. Nulls serves as unknown, they serves as wildcard, that's why it's allowed to have multiple nulls in unique constraint. This will be accepted by database:</p> <pre class="prettyprint"><code>insert into tbl values (1,1), (1,null), -- think of this null as wildcard, some real value can be assigned later. (1,null); -- and so is this. that's why both of these nulls are allowed </code></pre> <p>Think of <code>UNIQUE CONSTRAINT</code> that it allows deferred uniqueness, hence the acceptance of null values above. </p> <p>If you want only one wildcard(null b_id) per a_id, aside from the unique constraint, you need to add a <code>UNIQUE INDEX</code>. UNIQUE CONSTRAINT can't have an expression on them. <code>INDEX</code> and <code>UNIQUE INDEX</code> can. This will be your complete DDL for rejecting multiple null;</p> <p>This will be your complete DDL:</p> <pre class="prettyprint"><code>CREATE TABLE tbl ( a_id integer NOT NULL, b_id integer NULL ); alter table tbl add constraint tbl_unique unique(a_id, b_id); create unique index tbl_unique_a_id on tbl(a_id) where b_id is null; </code></pre> <p>This will be rejected by your database now:</p> <pre class="prettyprint"><code>insert into tbl values (1,1), (1,null), (1,null); </code></pre> <p>This will be allowed:</p> <pre class="prettyprint"><code>insert into tbl values (1,1), (1,null); </code></pre> <p>Related to http://www.ienablemuch.com/2010/12/postgresql-said-sql-server2008-said-non.html</p>

Postgres unique multi-column index for join table

Tags:

sql

join

postgresql

I have a many-to-many join table in Postgres that I would like to index to A) increase performance (obviously) and B) enforce uniqueness. For example:

a_id | b_id
1    | 2     <- okay
1    | 3     <- okay
2    | 3     <- okay
1    | 3     <- not okay (same as row 2)

Is it possible to have a single index on two columns that enforces uniqueness in the values? What type of index should I use?

606

asked May 06 '12 06:05

bloudermilk

1 Answers

As Primary Key

Do this if that unique is primary key:

create table tbl(
   a_id int not null,
   b_id int not null,
   constraint tbl_pkey primary key(a_id,b_id)
);

Not Primary Key

Do this if that unique is non-primary key:

create table tbl(

   -- other primary key here, e.g.:
   -- id serial primary key,

   a_id int not null,
   b_id int not null,
   constraint tbl_unique unique(a_id,b_id)
);

Existing Table

If you have existing table, do this instead:

alter table tbl
      add constraint tbl_unique unique(a_id, b_id)

That alter table display this message:

NOTICE:  ALTER TABLE / ADD UNIQUE will create implicit index "tbl_unique" for table "tbl"


Query returned successfully with no result in 22 ms.

Drop

If you wanted to drop that constraint(you might want to make unique a combination of 3 fields):

ALTER TABLE tbl DROP CONSTRAINT tbl_unique;

Index & Constraint & Nulls

Regarding index, from Postgres doc:

PostgreSQL automatically creates a unique index when a unique constraint or primary key is defined for a table

Source: http://www.postgresql.org/docs/9.1/static/indexes-unique.html

If uniqueness depends on some rules, you shall use CREATE UNIQUE INDEX, for example:

Given this:

CREATE TABLE tbl
(
  a_id integer NOT NULL,
  b_id integer NULL  
);

alter table tbl
    add constraint tbl_unique unique(a_id, b_id);

That unique can catch these duplicates, this will be rejected by database:

insert into tbl values
(1,1),
(1,1);

Yet that UNIQUE CONSTRAINT cannot catch duplicate nulls. Nulls serves as unknown, they serves as wildcard, that's why it's allowed to have multiple nulls in unique constraint. This will be accepted by database:

insert into tbl values
(1,1),
(1,null), -- think of this null as wildcard, some real value can be assigned later.
(1,null); -- and so is this. that's why both of these nulls are allowed

Think of UNIQUE CONSTRAINT that it allows deferred uniqueness, hence the acceptance of null values above.

If you want only one wildcard(null b_id) per a_id, aside from the unique constraint, you need to add a UNIQUE INDEX. UNIQUE CONSTRAINT can't have an expression on them. INDEX and UNIQUE INDEX can. This will be your complete DDL for rejecting multiple null;

This will be your complete DDL:

CREATE TABLE tbl
(
  a_id integer NOT NULL,
  b_id integer NULL  
);
alter table tbl
    add constraint tbl_unique unique(a_id, b_id);

create unique index tbl_unique_a_id on tbl(a_id) where b_id is null;

This will be rejected by your database now:

insert into tbl values
(1,1),
(1,null),
(1,null);

This will be allowed:

insert into tbl values
(1,1),
(1,null);

Related to http://www.ienablemuch.com/2010/12/postgresql-said-sql-server2008-said-non.html

179

answered Oct 19 '22 23:10

Michael Buen

Related questions
                            
                                PostgreSQL: Using subquery abbreviation ('AS') in the WHERE clause
                            
                                SQL Query to get records of parent table that have a list of child records
                            
                                Linq to SQL - what's better?
                            
                                SQL SELECT multiple INNER JOINs
                            
                                Self join to a table
                            
                                T-SQL How to select rows without duplicate values from one column?
                            
                                User Defined Function Best Practice
                            
                                Hashing passwords before sending to server
                            
                                Update SQL with two tables in Oracle
                            
                                SQL multiple SETs in one UPDATE?
                            
                                sqlite: alias column name can't contains a dot "."
                            
                                Using .query() in t-sql to get only inner text
                            
                                Order by Clause conflicts with distinct in access?
                            
                                SQL try-catch statement not handling error (SQL Server 2008)
                            
                                How to append two columns into one column in SQL?
                            
                                SQL Server DISTINCT pagination with ROW_NUMBER() not distinct
                            
                                Rollback in PLSQL Exception
                            
                                How can I use Proc SQL to find all the records that only exist in one table but not the other?
                            
                                SQL Count in Hibernate HQL
                            
                                why using LIKE with TIMESTAMPS do not work in DB2

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With