Using GUIDs in Primary Keys / Clusted Indexes

Tags:

I'm fairly well versed in SQL server performace but I constanly have to argue down the idea that GUIDs should be used as the default type for Clusterd Primary Keys.

Assuming that the table has a fairly low amount of inserts per day (5000 +/- rows / day), what kind of performace issues could we run into? How will page splits affect our seek performance? How often should I reindex (or should I defrag)? What should I set the fill factors to (100, 90, 80, ect)?

What if I were inserting 1,000,000 rows per day?

I apologize beforhand for all of the questions, but i'm looking to get some backup for not using GUIDs as our default for PKs. I am however completely open to having my mind changed by the overwehlming knowledge from the StackOverflow user base.

217

asked Sep 24 '09 03:09

NTDLS

3 Answers

If you are doing any kind of volume, GUIDs are extremely bad as a PK bad unless you use sequential GUIDs, for the exact reasons you describe. Page fragmentation is severe:

                 Average                    Average
                 Fragmentation  Fragment    Fragment   Page     Average 
Type             in Percent     Count       Size       Count    Space Used

id               4.35           7           16.43      115      99.89
newidguid        98.77          162         1          162      70.90 
newsequentualid  4.35           7           16.43      115      99.89

And as this comparison between GUIDs and integers shows:

Test1 caused a tremendous amount of page splits, and had a scan density around 12% when I ran a DBCC SHOWCONTIG after the inserts had completed. The Test2 table had a scan density around 98%

If your volume is very low, however, it just doesn't matter that much.

If you do really need a globally unique ID but have high volume (and can't use sequential IDs), just put the GUIDs in an indexed column.

answered Oct 21 '22 19:10

Rex M

Drawbacks of using GUID as primary key:

No meaningful ordering, means indexing doesn't give performance boost as it does with an integer.
Size of a GUID 16 bytes, versus 2, 4 or 8 bytes for an integer.
Very difficult for humans to remember, so no good as a reference id.

Advantages:

Allow non-guessable primary keys that can therefore be less dangerous when displayed in a web page query string or in the application.
Useful in Databases that don't provide an auto increment or identity data type.
Useful when you need to join data between two disparate data sources across platforms or environments.

I thought the decision as to whether to use GUIDs was pretty simple, but maybe I'm unaware of other issues.

answered Oct 21 '22 21:10

Ash

With such a low inserts per day, I doubt that page splitting should be a significant factor. The real question is how does 5,000 compares with the existing row count, as this would be the main information needed to decide on an appropriate initial fill factor to deffer splits.

This said, I'm personally not a big fan of GUIDs. I understand that they can serve well in some contexts but in many cases they are just "in the way" [of efficiency, of ease of use, of ...]

I find the following questions useful to narrow down on deciding whether GUID should be used or not.

Will the PK be shared/published ? (i.e. will it be used beyond its internal use within SQL, will applications need these keys in a somewhat persistent fashion? Will users somehow see these keys?
Could the PK be used to help merge disparate data sources ?
Does the table have a primary -possibly composite- made from column(s) in the data ? What is the size of this possible this key
How do the primary keys sort? If composite, are the first few columns selective ?

answered Oct 21 '22 19:10

mjv

Related questions
                            
                                Add filestream to an existing table column
                            
                                SQL Server 2016 Management Studio - missing 'Edit top 200' options
                            
                                How are Azure BACPAC backup files different than SQL Server BAK files?
                            
                                How to apply WITH (NOLOCK) to an entire query
                            
                                DataGrip / DBeaver cannot resolve table / column names
                            
                                Show SQL result in horizontal format
                            
                                How to access Temp Table in the stored procedure which is created in another stored procedure?
                            
                                C# ASP.NET Core building dynamic forms
                            
                                Powershell restore SQL Server database to new database
                            
                                EF Core 2.0 Migration - dotnet ef migrations 'script' command outputting blank file
                            
                                Entity Framework - The object 'PK_AspNetUserTokens' is dependent on column 'UserId'
                            
                                Accumulating previous rows with grouping
                            
                                Can SQL Begin Try/Catch be lying to me (in the profiler)?
                            
                                Convert SQL Query to Hibernate Criteria and Projections
                            
                                What does (Synchronized) after the database name mean in SSMS.
                            
                                Is it possible to prevent UPDATE or DELETE statements being executed without a WHERE clause?
                            
                                STIntersection result is STIntersects = 0
                            
                                How should I approach migrating data from a "bad" database design to a usable design?
                            
                                In a Data Warehouse scenario is there any disadvantage to using WITH(NOLOCK)
                            
                                Database design to hold a person's information that changes with time?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using GUIDs in Primary Keys / Clusted Indexes

Tags:

sql-server

uniqueidentifier

sql-server-performance

NTDLS

People also ask

3 Answers

Rex M

Ash

mjv

Recent Activity

Donate For Us