Can I recover international characters mistakenly stored in a varchar field?

Tags:

My client has an old MS SQL 2000 database that uses varchar(50) fields to store names. He tried to use this database to capture some data (via a web form). Some of the form-fillers are from other countries, and the varchar fields went nutty when some of these folks entered their names. Is it possible to recover the data somehow? Maybe by guessing what the character should be based on what it resolved to in ASCII/varchar and the country the person is from? Some of the data:

Name / Country / First or Last Name?
JiÅ™Ã / CZE / F
TorbjÃ¶rn / FIN / F
HuszÃ¡r / HUN / L
JÃ¼rgen / DEU / F
MÃ¼ller / CHE / L
BumbÃ¡lkovÃ¡ / CZE / L
DoleÅ¾al / CZE / L
Loïc / DEU / L

By the way, the web form specified this content-type:

<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

574

asked Oct 28 '08 00:10

Chris

1 Answers

Working from the 5th example.

Ã is ascii #195 (C3). ¼ is ascii #188 (BC).

I'd guess that MÃ¼ller is meant to be Müller.

If this is UTF-8, based upon http://en.wikipedia.org/wiki/UTF-8#Description

We've got C3 BC = 1100 0011 1011 1100

Applying the UTF-8 mapping:

(110) 00011 (10) 11 1100

0000 0000 1111 1100

00FC which is Unicode ü

U+00FC (see http://en.wikipedia.org/wiki/Latin_characters_in_Unicode)

Seems to me that you could work through this programmatically.

Now solving the first example:

Jiå™ã was actually JiÅ™Ã (The final character not shown).

Ignoring the Ji, which is correct,

C5 99 c3 AD

(110)0 0101 (10)01 1001 (110)0 0011 (10)10 1101

0159 00ED

ří

So the name is: Jiří. Wikipedia says that special r is Czech and so is the i. Furthermore if I google Jiří (http://www.google.com/search?q=Ji%C5%99%C3%AD&ie=utf-8&oe=utf-8) I get plenty of hits. We're on a winner here.

The second example, TorbjÃ¶rn, maps nicely to Torbjörn which sounds convincing.

IMHO there's no great need for human checking of these, they seem to just work.

193

answered Sep 21 '22 14:09

8 revs, 2 users 98%

Related questions
                            
                                Group rows based on the current value starting from the next row
                            
                                IndexOutOfRangeException while trying to select on database by C#
                            
                                Is a non-clustered index implicitly created for each foreign key in a table?
                            
                                How to drop multiple columns in SQL Server
                            
                                Warning: Truncation may occur due to retrieving data from database column
                            
                                sql server for json auto. How to get all of result
                            
                                Index: Avoid duplicates in table when Status = 'S'
                            
                                Why does @@ROWCOUNT return 1 for a NULL statement using sp_executesql?
                            
                                Microsoft.SqlServer.Server namespace
                            
                                Can't connect to docker sql server from NET Core 2.2 Web API
                            
                                custom order by only printing last value
                            
                                SQL sort order in Japanese breaks when text includes non-Japanese characters
                            
                                SQL Server - replace string by appearance
                            
                                If the first condition is FALSE then the second condition is checked in SQL Server?
                            
                                GROUP BY and take the given value if it is not empty
                            
                                Unable to connect to (localdb)\MSSQLLocalDB - Due to trigger execution
                            
                                SQL 2000 'TRY CATCH like' Error Handling
                            
                                SQL server 2005 numeric precision loss
                            
                                Is there a SQL server performance counter for average execution time?
                            
                                Best way to search in a varchar column in sql server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can I recover international characters mistakenly stored in a varchar field?

Tags:

sql-server

sql-server-2000

unicode

utf-8

Chris

People also ask

1 Answers

8 revs, 2 users 98%

Recent Activity

Donate For Us