Will SQL Server 2005 penalize me for using an nvarchar(50) as a primary key, instead of an integer?

Tags:

I'm considering altering some tables to use nvarchar(50) as primary key instead of an int primary key. Using an int ID for a key really is irrelevant data, it's the string I'm interested in. What sort of performance hit will occur, or where do you research this? Other than cut and try that is.

864

asked Oct 22 '08 05:10

P a u l

3 Answers

You have hit upon one of the major "holy wars" of database design. The debate you're referring to is the "surrogate vs. natural key" argument that's been raging for as long as there have been RDBMSs (as nearly as I can tell).

The debate essentially boils down to whether a representative key (surrogate, for example an IDENTITY column) should be used versus using the actual data that uniquely describes a record (natural key).

I will say that there is no "right" answer. Performance measures are an artifact of the platform, and should be assessed by experimentation, but performance is not likely to be the major concern.

What I consider to be the primary argument for surrogate keys is the immutability of primary keys. If you choose to use a natural key, you give up the option of altering that key after it is established. You also give up the possibility that it may become non-unique at some point in the future. For those reasons, I typically (not always) use surrogate keys for most of my tables.

However, as I mentioned, there is a very long-standing debate filled with discussions of indexing strategies and normal-form adherance to be read if you are so inclined.

I would Google "surrogate vs. natural keys". Here are a few links to get you started:

Systems Engineering and RDBMS

Techrepublic

Tony Rogerson's blog

Hope this helps.

answered Oct 13 '22 23:10

Jared

Consider using a surrogate key (an int primary key) as the primary key/clustered index key. The trouble with using a nvarchar(50) as a primary key/clustered index key is that your table will be ordered by that key which means it is likely to get highly fragmented, and that any other indexes will have the burden of referencing this heavy primary key.

Another issue is that presumably you are needing to JOIN on other tables by this type of value which is a more expensive operation as the size of the key grows.

I think there are very few situations where an nvarchar(50) primary key would make sense.

Generally, primary keys should be a surrogate UNLESS you have a small natural immutable key. Arguably, SSN, for example, could be considered a natural immutable key.

answered Oct 13 '22 22:10

Hafthor

For performance, I normally ask the following:

how many rows? 1,000 or 1,000,000 or 10,000,000 ??
what server is it sitting on? (memory, diskspace)

I would profile it and then see. Normally for me, the bottleneck is not the database, it's poorly written code, badly deployed etc. etc...

answered Oct 13 '22 23:10

Christian Payne

Related questions
                            
                                Create PK for #temp table failed when the script is run in parallel
                            
                                WHERE clause with nested multiple conditions
                            
                                Generate new GUID in SQL Server if one is not provided
                            
                                How to order by an arbitrary condition in SQL
                            
                                Sql Order By ... using `Case When` for different Ascending, Descending, and Custom Orders
                            
                                SQL server Varchar(max) and space taken
                            
                                What security benefits are provided by using stored procedures to access data?
                            
                                Is it a good idea to use rowguid as unique key in database design?
                            
                                SQL Server: Migrate Stored Procedures between Databases, Instances
                            
                                SQL CASE and local variables
                            
                                How do I apply subtypes into an SQL Server database?
                            
                                T-SQL Query for Replication Articles
                            
                                Can you use an asterisk (*) to filter a column in a WHERE clause under SQL Server 2008?
                            
                                check if all table rows are equal
                            
                                SQL Server 2008 R2 intellisense not working
                            
                                How do I create the deployment scripts from VS 2012 database projects?
                            
                                Using SQL Server named parameters with ExecuteStoreQuery and ExecuteStoreCommand
                            
                                What is the syntax for adding multiple arguments onto the "Variables" parameter in sqlpackage.exe?
                            
                                New line in sql server [duplicate]
                            
                                How to check Port 1433 is working for Sql Server or not?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Will SQL Server 2005 penalize me for using an nvarchar(50) as a primary key, instead of an integer?

Tags:

sql-server

sql-server-2005

primary-key