I would like optimize the performance of a database that my team is using for an application. I have been looking for areas to add foreign keys, and in turn index those columns to improve the performance of joins. However, many of our tables are joined on an id that is a <code>GUID</code> type, generated upon insertion of an item, and the data associated with that item in other tables is generally has column <code>item_id</code> containing the GUID. I have read that adding clustered indexes to GUID type columns is a very bad decision because the index will need to be constantly reconstructed in order to be effective. However, I was wondering, is there any detriment to utilizing a non-clustered index in the scenario described above? Or is it reasonable to assume that it would help performance? I can provide more information if needed.

An index on a <code><anytype></code> is by far the best option you have to improve joins and singleton lookups. Lacking this index the query will always have to scan the entire table end-to-end with (often) abysmal performance results and concurrency gone down the drain. It is true that <code>uniqueidentifier</code> makes poor choice for indexes for the reasons you mention, but by no means does that implies that you should not create these indexes. Changing the data type to INT or BIGINT would be advisable, if possible. Using <code>NEWSEQUENTIALID()</code> or <code>UuidCreateSequential</code> to generate them would help with fragmentation issues. If all alternatives fail you may have to do index maintenance (Rebuild, reorganize) operations more often than for other indexes. But by no means do any of these drawbacks outweigh the benefit of having the index in the first place!

Use of non-clustered index on guid type column in SQL Server

Tags:

optimization

sql-server

tsql

indexing

foreign-keys

I would like optimize the performance of a database that my team is using for an application.

I have been looking for areas to add foreign keys, and in turn index those columns to improve the performance of joins. However, many of our tables are joined on an id that is a GUID type, generated upon insertion of an item, and the data associated with that item in other tables is generally has column item_id containing the GUID.

I have read that adding clustered indexes to GUID type columns is a very bad decision because the index will need to be constantly reconstructed in order to be effective. However, I was wondering, is there any detriment to utilizing a non-clustered index in the scenario described above? Or is it reasonable to assume that it would help performance? I can provide more information if needed.

685

asked Dec 10 '12 14:12

Christian

2 Answers

An index on a <anytype> is by far the best option you have to improve joins and singleton lookups. Lacking this index the query will always have to scan the entire table end-to-end with (often) abysmal performance results and concurrency gone down the drain.

It is true that uniqueidentifier makes poor choice for indexes for the reasons you mention, but by no means does that implies that you should not create these indexes. Changing the data type to INT or BIGINT would be advisable, if possible. Using NEWSEQUENTIALID() or UuidCreateSequential to generate them would help with fragmentation issues. If all alternatives fail you may have to do index maintenance (Rebuild, reorganize) operations more often than for other indexes. But by no means do any of these drawbacks outweigh the benefit of having the index in the first place!

answered Oct 23 '22 05:10

Remus Rusanu

Two performance:
- insert
- select

An index should improve select

An index will slow slow down insert.
If the inserts are in order the index does not fragment.
If the inserts are not in order the index will fragment.
Index fragmentation slows down both insert and select.
Via maintenance can defragment the index.

Adding an non-clustered index to the column that references a FK will help the joins.
Since that column is most likely not ordered that fact it is a GUID is of no loss.

On the FK table itself is where GUID is not a good candidate for a PK (clustered index).
With GUID as PK that index fragments on insert.
Int or sequential ID are better candidates as they would not fragment the PK on insert.
But no big deal just defragment those tables.

answered Oct 23 '22 05:10

paparazzo

Related questions
                            
                                Million inserts: SqlBulkCopy timeout
                            
                                How to automate report delivery in SSRS
                            
                                Preserve SQL Indexes While Altering Column Datatype
                            
                                Creating stored procedure with declare and set variables
                            
                                Decimal(19,4) or Decimal(19.2) - which should I use?
                            
                                JSON without array wrapper on lower levels
                            
                                Database triggers
                            
                                Can someone explain DBCC DROPCLEANBUFFERS?
                            
                                Using Full-Text Search in SQL Server 2008 across multiple tables, columns
                            
                                SQL Server, C#: Timeout exception on Transaction Rollback
                            
                                Stop SQL Server from running until needed [closed]
                            
                                Why CTE (Common Table Expressions) in some cases slow down queries comparing to temporary tables in SQL Server
                            
                                Is SQL Server Bulk Insert Transactional?
                            
                                Sql Server Services - Overview anyone?
                            
                                Average of grouped rows in Sql Server
                            
                                How to find the name of stored procedure, based on table name search, using SQL Server 2008?
                            
                                Invoke-SqlCmd doesn't return long string?
                            
                                SQL Data Compare - Some tables missing
                            
                                SSIS Catalog - All Executions report only shows #Error in all headings and hyperlinks
                            
                                How to change a normal column to "computed" column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With