While executing an <code>INSERT</code> statement with many rows, I want to skip duplicate entries that would otherwise cause failure. After some research, my options appear to be the use of either: <ul> <li> <code>ON DUPLICATE KEY UPDATE</code> which implies an unnecessary update at some cost, or</li> <li> <code>INSERT IGNORE</code> implies an invitation for other kinds of failure to slip in unannounced.</li> </ul> Am I right in these assumptions? What's the best way to simply skip the rows that might cause duplicates and just continue on to the other rows?

I would recommend using <code>INSERT...ON DUPLICATE KEY UPDATE</code>. If you use <code>INSERT IGNORE</code>, then the row won't actually be inserted if it results in a duplicate key. But the statement won't generate an error. It generates a warning instead. These cases include: <ul> <li>Inserting a duplicate key in columns with <code>PRIMARY KEY</code> or <code>UNIQUE</code> constraints. </li> <li>Inserting a NULL into a column with a <code>NOT NULL</code> constraint.</li> <li>Inserting a row to a partitioned table, but the values you insert don't map to a partition.</li> </ul> If you use <code>REPLACE</code>, MySQL actually does a <code>DELETE</code> followed by an <code>INSERT</code> internally, which has some unexpected side effects: <ul> <li>A new auto-increment ID is allocated.</li> <li>Dependent rows with foreign keys may be deleted (if you use cascading foreign keys) or else prevent the <code>REPLACE</code>.</li> <li>Triggers that fire on <code>DELETE</code> are executed unnecessarily.</li> <li>Side effects are propagated to replicas too.</li> </ul> correction: both <code>REPLACE</code> and <code>INSERT...ON DUPLICATE KEY UPDATE</code> are non-standard, proprietary inventions specific to MySQL. ANSI SQL 2003 defines a <code>MERGE</code> statement that can solve the same need (and more), but MySQL does not support the <code>MERGE</code> statement. <hr> A user tried to edit this post (the edit was rejected by moderators). The edit tried to add a claim that <code>INSERT...ON DUPLICATE KEY UPDATE</code> causes a new auto-increment id to be allocated. It's true that the new id is generated, but it is not used in the changed row. See demonstration below, tested with Percona Server 5.5.28. The configuration variable <code>innodb_autoinc_lock_mode=1</code> (the default): <pre class="prettyprint"><code>mysql> create table foo (id serial primary key, u int, unique key (u)); mysql> insert into foo (u) values (10); mysql> select * from foo; +----+------+ | id | u | +----+------+ | 1 | 10 | +----+------+ mysql> show create table foo\G CREATE TABLE `foo` ( `id` bigint(20) unsigned NOT NULL AUTO_INCREMENT, `u` int(11) DEFAULT NULL, PRIMARY KEY (`id`), UNIQUE KEY `u` (`u`) ) ENGINE=InnoDB AUTO_INCREMENT=2 DEFAULT CHARSET=latin1 mysql> insert into foo (u) values (10) on duplicate key update u = 20; mysql> select * from foo; +----+------+ | id | u | +----+------+ | 1 | 20 | +----+------+ mysql> show create table foo\G CREATE TABLE `foo` ( `id` bigint(20) unsigned NOT NULL AUTO_INCREMENT, `u` int(11) DEFAULT NULL, PRIMARY KEY (`id`), UNIQUE KEY `u` (`u`) ) ENGINE=InnoDB AUTO_INCREMENT=3 DEFAULT CHARSET=latin1 </code></pre> The above demonstrates that the IODKU statement detects the duplicate, and invokes the update to change the value of <code>u</code>. Note the <code>AUTO_INCREMENT=3</code> indicates an id was generated, but not used in the row. Whereas <code>REPLACE</code> does delete the original row and inserts a new row, generating and storing a new auto-increment id: <pre class="prettyprint"><code>mysql> select * from foo; +----+------+ | id | u | +----+------+ | 1 | 20 | +----+------+ mysql> replace into foo (u) values (20); mysql> select * from foo; +----+------+ | id | u | +----+------+ | 3 | 20 | +----+------+ </code></pre>

"INSERT IGNORE" vs "INSERT ... ON DUPLICATE KEY UPDATE"

1 Answers

I would recommend using INSERT...ON DUPLICATE KEY UPDATE.

If you use INSERT IGNORE, then the row won't actually be inserted if it results in a duplicate key. But the statement won't generate an error. It generates a warning instead. These cases include:

Inserting a duplicate key in columns with PRIMARY KEY or UNIQUE constraints.
Inserting a NULL into a column with a NOT NULL constraint.
Inserting a row to a partitioned table, but the values you insert don't map to a partition.

If you use REPLACE, MySQL actually does a DELETE followed by an INSERT internally, which has some unexpected side effects:

A new auto-increment ID is allocated.
Dependent rows with foreign keys may be deleted (if you use cascading foreign keys) or else prevent the REPLACE.
Triggers that fire on DELETE are executed unnecessarily.
Side effects are propagated to replicas too.

correction: both REPLACE and INSERT...ON DUPLICATE KEY UPDATE are non-standard, proprietary inventions specific to MySQL. ANSI SQL 2003 defines a MERGE statement that can solve the same need (and more), but MySQL does not support the MERGE statement.

A user tried to edit this post (the edit was rejected by moderators). The edit tried to add a claim that INSERT...ON DUPLICATE KEY UPDATE causes a new auto-increment id to be allocated. It's true that the new id is generated, but it is not used in the changed row.

See demonstration below, tested with Percona Server 5.5.28. The configuration variable innodb_autoinc_lock_mode=1 (the default):

mysql> create table foo (id serial primary key, u int, unique key (u)); mysql> insert into foo (u) values (10); mysql> select * from foo; +----+------+ | id | u    | +----+------+ |  1 |   10 | +----+------+  mysql> show create table foo\G CREATE TABLE `foo` (   `id` bigint(20) unsigned NOT NULL AUTO_INCREMENT,   `u` int(11) DEFAULT NULL,   PRIMARY KEY (`id`),   UNIQUE KEY `u` (`u`) ) ENGINE=InnoDB AUTO_INCREMENT=2 DEFAULT CHARSET=latin1  mysql> insert into foo (u) values (10) on duplicate key update u = 20; mysql> select * from foo; +----+------+ | id | u    | +----+------+ |  1 |   20 | +----+------+  mysql> show create table foo\G CREATE TABLE `foo` (   `id` bigint(20) unsigned NOT NULL AUTO_INCREMENT,   `u` int(11) DEFAULT NULL,   PRIMARY KEY (`id`),   UNIQUE KEY `u` (`u`) ) ENGINE=InnoDB AUTO_INCREMENT=3 DEFAULT CHARSET=latin1

The above demonstrates that the IODKU statement detects the duplicate, and invokes the update to change the value of u. Note the AUTO_INCREMENT=3 indicates an id was generated, but not used in the row.

Whereas REPLACE does delete the original row and inserts a new row, generating and storing a new auto-increment id:

mysql> select * from foo; +----+------+ | id | u    | +----+------+ |  1 |   20 | +----+------+ mysql> replace into foo (u) values (20); mysql> select * from foo; +----+------+ | id | u    | +----+------+ |  3 |   20 | +----+------+

157

answered Oct 22 '22 06:10

Bill Karwin

Related questions
                            
                                Create new user in MySQL and give it full access to one database
                            
                                Find duplicate records in MySQL
                            
                                MySQL Error 1093 - Can't specify target table for update in FROM clause
                            
                                When to use single quotes, double quotes, and backticks in MySQL
                            
                                SQL injection that gets around mysql_real_escape_string()
                            
                                How do I see what character set a MySQL database / table / column is?
                            
                                'IF' in 'SELECT' statement - choose output value based on column values
                            
                                How can I temporarily disable a foreign key constraint in MySQL?
                            
                                MySQL Query GROUP BY day / month / year
                            
                                How to truncate a foreign key constrained table?
                            
                                Disable ONLY_FULL_GROUP_BY
                            
                                What is the best collation to use for MySQL with PHP? [closed]
                            
                                Host 'xxx.xx.xxx.xxx' is not allowed to connect to this MySQL server
                            
                                How can I do a FULL OUTER JOIN in MySQL?
                            
                                Duplicating a MySQL table, indices, and data
                            
                                Finding duplicate values in MySQL
                            
                                How can I SELECT rows with MAX(Column value), PARTITION by another column in MYSQL?
                            
                                TINYTEXT, TEXT, MEDIUMTEXT, and LONGTEXT maximum storage sizes
                            
                                MySQL: Large VARCHAR vs. TEXT?
                            
                                MyISAM versus InnoDB [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

"INSERT IGNORE" vs "INSERT ... ON DUPLICATE KEY UPDATE"

Tags:

mysql

insert

Thomas G Henry

People also ask

1 Answers

Bill Karwin

Recent Activity

Donate For Us