Performance value of COMB guids

Tags:

Jimmy Nilsson discusses his COMB guid concept here. This concept is popular in NHibernate, among other circles, for its supposed performance value over standard GUIDs which are typically far more random.

However, in testing, this does not appear to be the case. Am I missing something?

Test case:

I have a table called temp (not a temp table, just a table named "temp") with 585,000 rows in it. I have a new table called Codes, and wish to copy all 585,000 code values from the temp table to the codes table. The test SQL I executed was:

set statistics time on;

truncate table codes;
DBCC DBREINDEX ('codes', '', 90);

insert into codes (codeid, codevalue)
select newid(), codevalue from temp

truncate table codes;
DBCC DBREINDEX ('codes', '', 90);

insert into codes (codeid, codevalue)
select CAST(CAST(NEWID() AS BINARY(10)) + CAST(GETDATE() AS BINARY(6)) AS UNIQUEIDENTIFIER), codevalue from temp

Performance with standard GUID values:

SQL Server Execution Times: CPU time = 17250 ms, elapsed time = 15735 ms.

(585000 row(s) affected)

Performance with COMB GUID values:

SQL Server Execution Times: CPU time = 17500 ms, elapsed time = 16419 ms.

(585000 row(s) affected)

What am I missing? the COMB GUID values resulted in slightly longer times, presumably because of the additional conversions. I thought the point was to reduce the insert time by semi-ordering the GUIDS using the date for the last 6 bytes, but the performance gain appears non-existent.

893

asked Jul 20 '09 19:07

Chris

2 Answers

I'd suggest that you're not seeing the order benefit because the target table has no PK. So, it's the conversion overhead you're seeing. IF it has a PK, the 585k rows must still be sorted on insert. How does SQL know it's semi-sorted?

Now, if it was 5,850 x 100 row inserts, then you may see some benefit because the new rows will go "at the end" not "in the middle" so reducing page splits and overhead.

I'd go further and say that the article is dated 2002, and is for SQL 2000, and has been overtaken by real life.

In SQL Server 2005 we have SEQUENTIAL GUIDs to allow strictly monotonic GUIDs to solve some issues. The GUID as PK has been done here too: recent example: INT vs Unique-Identifier for ID field in database with 3rd party links.

If an ORM dictates GUID as a PK rather than a natural key or standard int-based surrogate key, that's a severe limitation of the ORM. And a case of the client tail wagging the database dog.

119

answered Sep 19 '22 23:09

gbn

I second that you'll see differences only when you have indexes (PK, FK or other kind of indexes, clustered or not clustered) on the Guid colume, because cost of standard guid versus newguid or comb guid is due to the high cost of re-ordering the index data every time an insert is performed.

See my question in which I corroborate this with some real life data from both SQL Server and Oracle.

answered Sep 21 '22 23:09

massimogentilini

Related questions
                            
                                Good Resources for Relational Database Design [closed]
                            
                                NHibernate update on single property updates all properties in sql
                            
                                Can I define an in-cycle variable in T-SQL SELECT (like LET in LINQ)?
                            
                                Best practices for inserting/updating large amount of data in SQL Server 2008
                            
                                SQL How do I query a many-to-many relationship
                            
                                Ideal database for geo (map) data
                            
                                Conditional Where Clause in SQL Query
                            
                                SQL select rows with only a certain value in them
                            
                                JPA @ManyToMany join table indexing
                            
                                Postgres: order data by part of string
                            
                                H2 database CREATE TABLE with constraint
                            
                                IF function in PostgreSQL as in MySQL
                            
                                Set limit to array_agg()
                            
                                How to pass multiple values to single parameter in stored procedure
                            
                                Three dimensional database table
                            
                                Do I need a database reference for a linked server in a SQL Server database project?
                            
                                Oracle sql merge to insert and delete but not update
                            
                                dbms_metadata.get_ddl not working
                            
                                How do I use PostgreSQL JSON(B) operators containing a question mark "?" via JDBC
                            
                                Select constant in JOOQ union

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Performance value of COMB guids

Tags:

performance

sql

guid

sql-server

tsql

Chris

People also ask

2 Answers

gbn

massimogentilini

Recent Activity

Donate For Us