I want to write a program add new item to table. This item has an unique key <code>name</code> and it can be created by one of 100 threads, so I need to make sure that it is inserted only once. I have two ideas: <ol> <li>Use <code>insert ignore</code></li> <li>Fetch it from database via <code>select</code> then <code>insert</code> it to table if no returned row.</li> </ol> Which option is better? Is there an even more superior idea?

Late to the party, but I'm pondering something similar. I created the following table to track active users on a license per day: <pre class="prettyprint"><code>CREATE TABLE `license_active_users` ( `license_active_user_id` int(11) NOT NULL AUTO_INCREMENT, `license_id` int(11) NOT NULL, `user_id` int(11) NOT NULL, `date` date NOT NULL, PRIMARY KEY (`license_active_user_id`), UNIQUE KEY `license_id` (`license_id`,`user_id`,`date`) ) ENGINE=InnoDB DEFAULT CHARSET=utf8 COLLATE=utf8_unicode_ci; </code></pre> In other words, 1 primary key and 1 unique index across the remaining 3 columns. I then inserted 1 million unique rows into the table. Attempting to re-insert a subset (10,000 rows) of the same data yielded the following results: <ul> <li> <code>INSERT IGNORE</code>: 38 seconds</li> <li> <code>INSERT ... ON DUPLICATE KEY UPDATE</code>: 40 seconds</li> <li> <code>if (!rowExists("SELECT ..."))</code> <code>INSERT</code>: <2 seconds</li> </ul> If those 10,000 rows aren't already present in the table: <ul> <li> <code>INSERT IGNORE</code>: 34 seconds</li> <li> <code>INSERT ... ON DUPLICATE KEY UPDATE</code>: 41 seconds</li> <li> <code>if (!rowExists("SELECT ..."))</code> <code>INSERT</code>: 21 seconds</li> </ul> So the conclusion must be <code>if (!rowExists("SELECT ..."))</code> <code>INSERT</code> is fastest by far - at least for this particular table configuration. The missing test is <code>if (rowExists("SELECT ...")){</code> <code>UPDATE</code> <code>} else {</code> <code>INSERT</code> <code>}</code>, but I'll assume <code>INSERT ... ON DUPLICATE KEY UPDATE</code> is faster for this operation. For your particular case, however, I would go with <code>INSERT IGNORE</code> because (as far as I'm aware) it's an atomic operation and that'll save you a lot of trouble when working with threads.

"Insert ignore" vs "select and insert"

1 Answers

Late to the party, but I'm pondering something similar.

I created the following table to track active users on a license per day:

CREATE TABLE `license_active_users` (
  `license_active_user_id` int(11) NOT NULL AUTO_INCREMENT,
  `license_id` int(11) NOT NULL,
  `user_id` int(11) NOT NULL,
  `date` date NOT NULL,
  PRIMARY KEY (`license_active_user_id`),
  UNIQUE KEY `license_id` (`license_id`,`user_id`,`date`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COLLATE=utf8_unicode_ci;

In other words, 1 primary key and 1 unique index across the remaining 3 columns.

I then inserted 1 million unique rows into the table.

Attempting to re-insert a subset (10,000 rows) of the same data yielded the following results:

INSERT IGNORE: 38 seconds
INSERT ... ON DUPLICATE KEY UPDATE: 40 seconds
if (!rowExists("SELECT ...")) INSERT: <2 seconds

If those 10,000 rows aren't already present in the table:

INSERT IGNORE: 34 seconds
INSERT ... ON DUPLICATE KEY UPDATE: 41 seconds
if (!rowExists("SELECT ...")) INSERT: 21 seconds

So the conclusion must be if (!rowExists("SELECT ...")) INSERT is fastest by far - at least for this particular table configuration.

The missing test is if (rowExists("SELECT ...")){ UPDATE } else { INSERT }, but I'll assume INSERT ... ON DUPLICATE KEY UPDATE is faster for this operation.

For your particular case, however, I would go with INSERT IGNORE because (as far as I'm aware) it's an atomic operation and that'll save you a lot of trouble when working with threads.

106

answered Sep 28 '22 15:09

Woodgnome

Related questions
                            
                                Return order of MySQL SHOW COLUMNS
                            
                                Best way to store span on time in a MySQL database?
                            
                                High Number of MySQL Temporary Disk Tables
                            
                                MySQL: dates before 1970
                            
                                What type should I store IP addresses for MySQL?
                            
                                Count number of rows that are not within 10 seconds of each other
                            
                                MySQL and PDO: Could PDO::lastInsertId theoretically fail?
                            
                                How can you repair all tables in all databases from the MySQL command prompt when MYI file is corrupted or missing?
                            
                                Is it conceptually right to do a SELECT MAX(id) etc. for finding the last inserted row?
                            
                                Serialized array breaks on retrieval from database
                            
                                mysqldump not creating create database syntax
                            
                                How to count the number of groups returned by a group by?
                            
                                mysql count word in sql syntax [duplicate]
                            
                                Django MySQL distinct query for getting multiple values
                            
                                Selecting primary keys that do not have foreign keys in another table
                            
                                Wordpress update mysql table
                            
                                AUTO_INCREMENT and LAST_INSERT_ID
                            
                                SUM of history table in database to show user's total credit (reputation)
                            
                                Create view or use innerjoins?
                            
                                MySQL CONCAT("string",longtext) results in hex string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

"Insert ignore" vs "select and insert"

Tags:

performance

database

optimization

mysql

insert

user3245050

People also ask

1 Answers

Woodgnome

Recent Activity

Donate For Us