PostgreSQL - Continue on unique_violation (plpgsql)

Tags:

I have a PostgreSQL table that has some fields indexed and those have to be unique in order to prevent duplicates. This is made thanks to a PLPGSQL function that inserts all the fields and catches the unique_violation exception, altough it stops inserting the records even if there's just one duplicate.

I can't make several INSERTs due to performance issues (some of these are done in hundreds), the issue is that it stops all the process even if there's just one duplicate, like in the firest two values in the following example.

CREATE OR REPLACE FUNCTION easy_import() RETURNS VOID AS
  $$
  BEGIN
    BEGIN
      INSERT INTO things ("title", "uniq1", "uniq2") VALUES
      ('title 1', 100, 102),
      ('title 2', 100, 102),
      ('title 3', 101, 102),
      ('title 4', 102, 102),
      ('title 5', 103, 102),
      ('title 6', 104, 102),
      ('title 7', 105, 102),
      ('title 8', 106, 102),
      ('title 9', 107, 102),
      ('title 10', 108, 102);
      RETURN;
    EXCEPTION WHEN unique_violation THEN
      -- do nothing
    END;
  END;
  $$
  LANGUAGE plpgsql;

Is there a way to ignore the unique_violation just for one record and prevent it from stopping further INSERTs?

Thank you.

Update

The unique index has it on the "uniq1" and "uniq2" fields, I'm sorry about the confusion.
While @cdhowie's solution seems to be the best, it somehow ignores the fact that if you run the same query, it will trigger an error. It's weird, because the query does the JOIN for a reason. Still working on it.

842

asked Jul 25 '12 23:07

metrobalderas

2 Answers

Assuming that the unique constraint is composite around uniq1 and uniq2, this will work:

INSERT INTO things

WITH new_rows (title, uniq1, uniq2) AS (VALUES
    ('title 1', 100, 102),
    ('title 2', 100, 102),
    ('title 3', 101, 102),
    ('title 4', 102, 102),
    ('title 5', 103, 102),
    ('title 6', 104, 102),
    ('title 7', 105, 102),
    ('title 8', 106, 102),
    ('title 9', 107, 102),
    ('title 10', 108, 102)
)

SELECT
    DISTINCT ON (n.uniq1, n.uniq2)
    n.title, n.uniq1, n.uniq2

FROM new_rows AS n

LEFT JOIN things AS t
ON n.uniq1 = t.uniq1 AND n.uniq2 = t.uniq2

WHERE t.uniq1 IS NULL;

This may actually wind up being less performant than individual INSERT statements, but it's about the only other thing that will do the trick. Benchmark each approach and see which works best for you.

166

answered Oct 09 '22 12:10

cdhowie

Your table is like to this:

CREATE TABLE t
(
  title text,
  uniq1 int not null,
  uniq2 int nut null,
  CONSTRAINT t_pk_u1_u2 PRIMARY KEY (uniq1,uniq2)
)

so let me add a rule to that:

CREATE OR REPLACE RULE ignore_duplicate_inserts_on_t AS ON INSERT TO t
   WHERE (EXISTS ( SELECT 1 FROM t WHERE t.uniq1 = new.uniq1 and t.uniq2 = new.uniq2))
   DO INSTEAD NOTHING;

and after that, you can run this query:

insert into t(title,uniq1,uniq2) values 
    ('title 1', 100, 102),
    ('title 2', 100, 102),
    ...;

if your table be large, this way is optimal. I have had a test (for this way and the join way that mentioned above by Mr. cdhowie) on table with about 2 million rows, the result is:

Rule way (mentioned in this comment): 1400 rows per second
Join way (mentioned in above comment): 650 rows per second

answered Oct 09 '22 12:10

Khalil

Related questions
                            
                                Configuration of Sonarqube with postgresql
                            
                                max(), group by and order by
                            
                                SQL - postgres - shortest path in graph - recursion
                            
                                How to formulate an array literal of a composite type containing arrays?
                            
                                postgresql: how to store a user password?
                            
                                Should I store the timezone separately from the timestamp for Postgres and JDBC?
                            
                                Heroku: PG::Error: ERROR: permission denied for relation
                            
                                How to select integer values only from a varchar column in PostgreSQL
                            
                                COPY command: copy only specific columns from csv
                            
                                How to do I deploy a Spring app using Postgresql to Heroku using Spring Boot?
                            
                                Django and postgresql testing schema
                            
                                Understanding postgreSQL shared memory
                            
                                NpGsql EntityFramework 6 - "An operation is already in progress"
                            
                                Amazon RDS (PostgreSQL): Optimize memory usage
                            
                                Unable to generate UUID id for my entities
                            
                                Create tablespace on PostgreSQL RDS
                            
                                Setting the schema name in postgres using R
                            
                                Print a postgresql table to standard output in python
                            
                                Deploying Django to Heroku (Psycopg2 Error)
                            
                                Database lock not working as expected with Rails & Postgres

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

PostgreSQL - Continue on unique_violation (plpgsql)

Tags:

postgresql

plpgsql