Is it possible to have SQL Server convert collation to UTF-8 / UTF-16

Tags:

In a project I am working on my data is stored in SQL Server, with the collation Danish_Norwegian_CI_AS. The data is output'ed through FreeTDS and ODBC, to python that handles the data as UTF-8. Some of the characters, like å, ø and æ, are not being coded correctly, causing the project progress to grind to a halt.

I spent a couple of hours reading about the confusing world of encodings, collation and code-pages, and feel like I have gotten a better understanding of the entire picture.

Some of the articles I have read, makes me think that it would be possible to: Specify in the SQL select statement, that the collation data should be encoded to UTF-8 when it is output'ed.

The reason I am thinking this is possible is this article which shows an example of how to get to tables, with different collations, to play nice together.

Any pointers in the direction of converting collation to UTF-8 / UTF-16, would be greatly appreciated!

EDIT: I have read that SQL Server provides a unicode option through nchar, nvarchar and ntext, and that the other string variables char, varchar and text are coded according to set collation. I have also read that the above mentioned unicode options are coded in utf-16 variant ucs-2 (I hope I am remembering that right). So; in order to allow tables of locale collation and unicode, to play nice, there should be a conversion function, no?

714

asked May 16 '15 21:05

Rookie

1 Answers

It seems that SQL does not support UTF-8 (see here) but you can try changing the collation in the select like:

SELECT Account COLLATE SQL_Latin1_General_CP1_CI_AS
from Data

You can also strip the accents using this solution: How to remove accents and all chars <> a..z in sql-server?

Another solution could be casting your column to nvarchar

SELECT cast (Account as nvarchar) as NewAccount 
from Data

where Account is varchar on your initial table.

If for example you try:

SELECT cast(cast(N'ţ' as varchar) as nvarchar)

the end result will be "ţ"

157

answered Oct 28 '22 12:10

sbiz

Related questions
                            
                                How to test your query first before running it sql server
                            
                                SQL won't insert null values with BULK INSERT
                            
                                How to calculate Running Multiplication
                            
                                Passing DataTable to stored procedure as an argument
                            
                                What is the fastest way to clear a SQL table?
                            
                                What is the execution order of the PARTITION BY clause compared to other SQL clauses?
                            
                                Entity Framework Core 3.1 Return value (int) from stored procedure
                            
                                Remove the date/time in SQL script generated using SSMS?
                            
                                How do I monitor and find unused indexes in sql database
                            
                                Visual Studio - Open a SQL file with SQL Management Studio in an existing SSMS window?
                            
                                How to convert a byte[] into datetime in C#?
                            
                                replace null values in sql pivot
                            
                                select an xml type column in select query with group by SQL Server 2008
                            
                                SQL Server 100% CPU Utilization - One database shows high CPU usage than others
                            
                                SET DATEFIRST in FUNCTION
                            
                                Whats wrong with this SQL statement for table variable bulk insert
                            
                                What is the impact of WAITFOR on other processes and transactions?
                            
                                Connect Pentaho to MS SQL Server (Native)
                            
                                SQL Server project executing multiple script post deploy
                            
                                SQL Server for xml path add attributes and values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it possible to have SQL Server convert collation to UTF-8 / UTF-16

Tags:

sql-server

unicode

utf-8

collation

pyodbc

Rookie

People also ask

1 Answers

sbiz

Recent Activity

Donate For Us