When using UUIDs, should I also use AUTO_INCREMENT?

Tags:

We're building a new web app that will have an offline iPad/Android app version on a number of local devices that will involve inserts of new data. As such we require the use of UUIDs to allow for the necessary two-way synchronization with the master database. For this we'll be storing the UUID as a BINARY(16) primary key.

The problem I've learned after researching is that the time required for non-sequential primary key inserts will increase over time and that these inserts will lead to fragmentation (as answered here). The benefit to AUTO_INCREMENT is that new rows will usually just be added to the end of the table and so will not run into the speed problems with UUIDs.

My question is whether or not it is a better idea to use an AUTO_INCREMENT column as the primary key and then have the UUID column as a non-null unique index? Presumably this will have the speed benefits of sequential inserts whilst retaining the necessary UUIDs required for synchronizing distributed databases.

The one issue I can see with this is that the UUID needs to be used as a reference (using foreign key constraints) to other tables (i.e. a list of problems attached to an inspection which in turn is attached to a site, all of which are involved in inserts and so all of which require UUIDs). Semantically, it makes more sense for the primary key to be the reference, but as its a distributed system we can't use AUTO_INCREMENTS for these. Are there drawbacks to using a (non-null) unique index, rather than primary key, for these references (and, of course, the JOINs that will come with them)?

It might also be worth noting that the master (online) database uses MySQL (InnoDB) and the distributed (offline) databases use SQLite.

Edit:

Considering that it is perhaps better to have the UUID as a primary key (as that's semantically what it is), would I gain the benefit of sequential inserts if I set the UUID as a primary key and the AUTO_INCREMENT column as a non-null unique index? Or is it only the primary key that is of relevance when determining where to insert a new row?

898

asked Mar 12 '13 11:03

MichaelRushton

2 Answers

Using autoincrements as primary plus an uuid column is a valid model, but you would still have to struggle with some problems the autoincrements brings, it all depends on how you do the synchros.

Anyway I've been working with uuid's as primary keys (my current database have half a million records) and it's still pretty fast, it only slow downs a bit on the inserts, but unless you have very high volumes of inserts daily it shouldn't scare you.

If you use Sql-Server another solution you could have a look at is the Sequential UUIDs, which have a slightly greater collision chances than normal UUID's, but the absolute collision chances are still pretty low, and as they are partially sequential that covers the problems with the fragmentation.

answered Oct 21 '22 15:10

aleation

Once you have a big distributed data warehouse, if you use UUID or GUID as unique key and use it in join later on, it is not good.

Instead of using UUID or GUID, please create sequential surrogate key in your master database or in your data pipeline.

Share our project experience as a reference. We have 300 Billion records saved in parallel data warehouse, in our system, auto incremental key even not supported. We use 8 bytes bigint as primary key (actually unique key in our system is not supported either, but that's not affect logical uniqueness), when we processing file and load file, we use 3 bytes to generate file ID, which is 2^24 files, we have about 2,000 files need to load per day, so, 2^24 can support about 25 years, if it is not wrong.

We use the rest of 4 bytes as row id, which is 4 billion rows, we don't have 4 billion rows in any file. We reserve 1 byte. During the ETL processing, we only need to track the file ID in the master database, which support auto incremental ID, when we need to generate record ID when processing file, we combine FileID+reserve 1 byte+4 bytes rowID.

answered Oct 21 '22 16:10

ljh

Related questions
                            
                                mysql circular dependency in foreign key constraints
                            
                                How to remove bad characters that are not suitable for utf8 encoding in MySQL?
                            
                                mongodb equivalent of SELECT field AS `anothername`
                            
                                How to merge local and live databases?
                            
                                MySQL: Split comma separated list into multiple rows
                            
                                How to determine if a column is unsigned?
                            
                                How To Left Join 2 Tables On 2 Different Databases?
                            
                                Two Generated Values in Doctrine
                            
                                Low-priority query in Mysql
                            
                                Operation not allowed when innodb_forced_recovery > 0 [SqlYog]
                            
                                MySQL error 2013
                            
                                How do I determine maximum transaction size in MySQL?
                            
                                Solving "MySQL server has gone away" errors
                            
                                Package libapache2-mod-auth-mysql is not available
                            
                                How to join multiple tables using CakePHP 3?
                            
                                How to import .xls file to .sql with a foreign key
                            
                                How to create a sitemap using PHP & MySQL
                            
                                How do I use an IF statement in an MySQL join query?
                            
                                APACHE Crashing: Parent: child process exited with status 3221225477 -- Restarting
                            
                                SQL Select only rows where multiple relationships exist

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When using UUIDs, should I also use AUTO_INCREMENT?

Tags:

sql

uuid

sqlite

mysql

auto-increment

MichaelRushton

People also ask

2 Answers

aleation

ljh

Recent Activity

Donate For Us