best default collation of a multilingual database

Tags:

sql-server

I am a bit confused about de default collations when creating a database. The data stored in the database will be in different languages. The main users of the database will be using the spanish language, but it will also be used in english, french... As the spanish default collation is Modern_Spanish_CI_AS, and the english, french italian.. defaults to Latin1_General_CI_AS, I would like to be advised on which collation to use, and if there are some drawbacks about using one collation or the other.

Many thanks for your help Regards

Javier

447

asked Sep 06 '10 10:09

javier

1 Answers

A collation has two effects:

For non-Unicode data types it determines the code page of the data, i.e. it determines which characters you can store in the column/variable or not
For all data types, it affects how data is sorted and compared, i.e. ORDER BY and equality

To avoid problems with the first issue, always store and manipulate Unicode data using the nchar/nvarchar data types, because then you don't have to worry about the collation anyway. It requires more disk space, but it avoids some really awkward issues, so for most people it's probably a good tradeoff.

For the second issue, use the collation that makes the most sense for your database, i.e. which collation sorts and compares the data in the way that you want to do it most of the time? For example, if you know that case-sensitive comparisons will be important then Latin1_General_CS_AS might be a better choice.

And you can always use COLLATE to specify the collation explicitly if you need more control over specific queries:

create table #t (name nvarchar(100))

insert into #t select N'Che'
insert into #t select N'Carlos'
insert into #t select N'Cruz'

select name from #t order by name collate Modern_Spanish_CI_AS
select name from #t order by name collate Traditional_Spanish_CI_AS

drop table #t

If you don't know how text data will be sorted or compared and if your users don't know either, then I would just stay with your default collation (and use Unicode!); in the worst case, you can always move the data to a new table with the correct collation. And there's a lot of documentation on collations in Books Online that you should have a look into.

196

answered Oct 23 '22 05:10

Pondlife

Related questions
                            
                                Running SSIS packages in separate memory allocations or increasing the default buffer size?
                            
                                How to Convert SQL server to Oracle?
                            
                                Which files from a VSTS Database Edition GDR R2 project should be excluded from source control?
                            
                                Why does the following SQL Server insert deadlock when run within a transaction?
                            
                                What is the SQL Server equivalent to Oracle's Virtual Private Database?
                            
                                How to reuse code in SQL stored procedures?
                            
                                Spatial Indexing
                            
                                Is GETDATE() expensive as DateTime.Now is?
                            
                                Memory effective way to read BLOB data in C#/SQL 2005
                            
                                SQL Server query plan differences
                            
                                SQL Server Session State, web farm, and IIS configuration
                            
                                Convert Byte Array to string using TransactSQL
                            
                                How to connect to mirrored SQL Server after failover?
                            
                                Framework /starting point for social networking site in .NET?
                            
                                How can I cancel a database query in ASP.NET when the user's browser disconnects?
                            
                                SQL Server INSERT, Scope_Identity() and physical writing to disc
                            
                                SQL Server 2008 and .Net 4.0?
                            
                                Loading city/state from SQL Server to Google Maps?
                            
                                C# & SQL Server Authentication
                            
                                Return parent records with child records equaling specific values AND where total set of child records for a given parent equal a specific value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With