How significant is the performance difference when joining on nvarchar versus on int

Tags:

I understand that join on nvarchar is slower because index is bigger as nvarchar using 2 bytes for each character but int is 4 bytes all the time. Is the join performance difference significant? Is there any strong reason to avoid join on nvarchar? I couldn't find any MSDN article about the topic.

278

asked Jun 26 '11 12:06

Andras Csehi

1 Answers

At least 8x CPU. This is the measurable increase in comparing nvarchar over varchar: unicode sorting and comparison rules are more complex that straight varchar.

What are the main performance differences between varchar and nvarchar SQL Server data types?
SQL Server uses high CPU when searching inside nvarchar strings

So, assuming varchar and int are equal (they aren't) nvarchar will have overhead compared to int

Then, byte for byte ('1234' vs 1234) you're comparing 10 bytes vs 4 bytes. This also means a wider key for less index and data entries per page = more IO.

Finally, if your nvarchar is more then 450 characters, you can't index it because index key is max 900 bytes wide.

161

answered Oct 24 '22 22:10

gbn

Related questions
                            
                                How can I convert a pyspark.sql.dataframe.DataFrame back to a sql table in databricks notebook
                            
                                Redshift: Executing a dynamic query from a string
                            
                                SQL LIKE in Spark SQL
                            
                                MySQL procedure's cursor stops after first iteration
                            
                                Reset Running Total based on another column
                            
                                How to group results to boolean value in PostgreSQL
                            
                                Sequelize hasMany, belongsTo, or both?
                            
                                How to return different result in the same query?
                            
                                Is it okay to have a lot of database views?
                            
                                How to drop IDENTITY property of column in SQL Server 2005
                            
                                Caching Strategy for queried data
                            
                                Is it better to filter a resultset using a WHERE clause or using application code?
                            
                                Liquibase drop constraint without knowing it's name
                            
                                SQL Server - pull X random records per state
                            
                                How to unit test an SQL query?
                            
                                Left Join that always includes null records
                            
                                JDBC Lock a row using SELECT FOR UPDATE, doesn't work
                            
                                SQL - Update with a CASE statement, do I need to repeat the same CASE multiple times?
                            
                                Data modelling draft/quote/order/invoice
                            
                                Update with sub select - How to handle NULL values?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How significant is the performance difference when joining on nvarchar versus on int

Tags:

sql

sql-server

sql-server-2005

sql-server-2008

Andras Csehi

People also ask

1 Answers

gbn

Recent Activity

Donate For Us