If I set the primary key to be <code>INT</code> type (<code>AUTO_INCREMENT</code>) or set it in <code>UUID</code>, what is the difference between these two in the database performance (<code>SELECT</code>, <code>INSERT</code> etc) and why?

<code>UUID</code> returns a universal unique identifier (hopefuly also unique if imported to another DB as well). To quote from MySQL doc (emphasis mine): <blockquote> A UUID is designed as a number that is globally unique in space and time. Two calls to UUID() are expected to generate two different values, even if these calls are performed on two separate computers that are not connected to each other. </blockquote> On the other hand a simply <code>INT</code> primary id key (e.g. AUTO_INCREMENT) will return a unique integer for the specific DB and DB table, but which is not universally unique (so if imported to another DB chances are there will be primary key conflicts). In terms of performance, there shouldn't be any noticeable difference using <code>auto-increment</code> over <code>UUID</code>. Most posts (including some by the authors of this site), state as such. Of course <code>UUID</code> may take a little more time (and space), but this is not a performance bottleneck for most (if not all) cases. Having a column as <code>Primary Key</code> should make both choices equal wrt to performance. See references below: <ol> <li>To <code>UUID</code> or not to <code>UUID</code>?</li> <li>Myths, <code>GUID</code> vs <code>Autoincrement</code></li> <li>Performance: <code>UUID</code> vs <code>auto-increment</code> in cakephp-mysql</li> <li><code>UUID</code> performance in MySQL?</li> <li>Primary Keys: <code>ID</code>s versus <code>GUID</code>s (coding horror)</li> </ol> (<code>UUID</code> vs <code>auto-increment</code> performance results, adapted from Myths, <code>GUID</code> vs <code>Autoincrement</code>) <img src="https://i.stack.imgur.com/hHrt7.png" alt="enter image description here"> <code>UUID</code> pros / cons (adapted from Primary Keys: <code>ID</code>s versus <code>GUID</code>s) <blockquote> <code>GUID</code> Pros <ul> <li>Unique across every table, every database, every server</li> <li>Allows easy merging of records from different databases</li> <li>Allows easy distribution of databases across multiple servers</li> <li>You can generate <code>ID</code>s anywhere, instead of having to roundtrip to the database</li> <li>Most replication scenarios require <code>GUID</code> columns anyway</li> </ul> <code>GUID</code> Cons <ul> <li>It is a whopping 4 times larger than the traditional 4-byte index value; this can have serious performance and storage implications if you're not careful</li> <li>Cumbersome to debug (<code>where userid='{BAE7DF4-DDF-3RG-5TY3E3RF456AS10}'</code>)</li> <li>The generated <code>GUID</code>s should be partially sequential for best performance (eg, <code>newsequentialid()</code> on SQL 2005) and to enable use of clustered indexes.</li> </ul> </blockquote> <h3>Note</h3> I would read carefully the mentioned references and decide whether to use <code>UUID</code> or not depending on my use case. That said, in many cases <code>UUID</code>s would be indeed preferable. For example one can generate <code>UUID</code>s without using/accessing the database at all, or even use <code>UUID</code>s which have been pre-computed and/or stored somewhere else. Plus you can easily generalise/update your database schema and/or clustering scheme without having to worry about <code>ID</code>s breaking and causing conflicts. In terms of possible collisions, for example using v4 UUIDS (random), the probability to find a duplicate within 103 trillion version-4 UUIDs is one in a billion.

The differences between INT and UUID in MySQL

1 Answers

UUID returns a universal unique identifier (hopefuly also unique if imported to another DB as well).

To quote from MySQL doc (emphasis mine):

A UUID is designed as a number that is globally unique in space and time. Two calls to UUID() are expected to generate two different values, even if these calls are performed on two separate computers that are not connected to each other.

On the other hand a simply INT primary id key (e.g. AUTO_INCREMENT) will return a unique integer for the specific DB and DB table, but which is not universally unique (so if imported to another DB chances are there will be primary key conflicts).

In terms of performance, there shouldn't be any noticeable difference using auto-increment over UUID. Most posts (including some by the authors of this site), state as such. Of course UUID may take a little more time (and space), but this is not a performance bottleneck for most (if not all) cases. Having a column as Primary Key should make both choices equal wrt to performance. See references below:

To UUID or not to UUID?
Myths, GUID vs Autoincrement
Performance: UUID vs auto-increment in cakephp-mysql
UUID performance in MySQL?
Primary Keys: IDs versus GUIDs (coding horror)

(UUID vs auto-increment performance results, adapted from Myths, GUID vs Autoincrement)

enter image description here

UUID pros / cons (adapted from Primary Keys: IDs versus GUIDs)

GUID Pros

Unique across every table, every database, every server

Allows easy merging of records from different databases

Allows easy distribution of databases across multiple servers

You can generate IDs anywhere, instead of having to roundtrip to the database

Most replication scenarios require GUID columns anyway

GUID Cons

It is a whopping 4 times larger than the traditional 4-byte index value; this can have serious performance and storage implications if you're not careful

Cumbersome to debug (where userid='{BAE7DF4-DDF-3RG-5TY3E3RF456AS10}')

The generated GUIDs should be partially sequential for best performance (eg, newsequentialid() on SQL 2005) and to enable use of clustered indexes.

Note

I would read carefully the mentioned references and decide whether to use UUID or not depending on my use case. That said, in many cases UUIDs would be indeed preferable. For example one can generate UUIDs without using/accessing the database at all, or even use UUIDs which have been pre-computed and/or stored somewhere else. Plus you can easily generalise/update your database schema and/or clustering scheme without having to worry about IDs breaking and causing conflicts.

In terms of possible collisions, for example using v4 UUIDS (random), the probability to find a duplicate within 103 trillion version-4 UUIDs is one in a billion.

answered Oct 04 '22 14:10

Nikos M.

Related questions
                            
                                MySQL pid ended (cannot start mysql)
                            
                                mysql equivalent data types
                            
                                Running migrations with Rails in a Docker container with multiple container instances
                            
                                Using LIKE vs. = for exact string match
                            
                                MySQL: select * from table where col IN (null, "") possible without OR
                            
                                MySQL: set field default value to other column
                            
                                failed to open stream: No such file or directory in [duplicate]
                            
                                Creating new database in DataGrip JetBrains
                            
                                What's the best way to store html code in mysql? [duplicate]
                            
                                How to program a MySQL trigger to insert row into another table?
                            
                                How to know if when using "on duplicate key update" a row was inserted or updated?
                            
                                What is the disadvantage to using a MySQL longtext sized field when every entry will fit within a mediumtext sized field?
                            
                                MySQL Group Results by day using timestamp
                            
                                Slick 3.0 bulk insert or update (upsert)
                            
                                "Authentication plugin 'caching_sha2_password'
                            
                                Getting time difference between two times in PHP [duplicate]
                            
                                Why INSERT IGNORE increments the auto_increment primary key?
                            
                                Set Auto Increment field start from 1000 in migration laravel 5.1
                            
                                Multiple and single indexes
                            
                                How to export MySQL schema with data?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

The differences between INT and UUID in MySQL

Tags:

performance

mysql

primary-key

孙为强

People also ask

1 Answers

Note

Nikos M.

Recent Activity

Donate For Us