How to convert a Unicode character to its ASCII equivalent

Tags:

Here's the problem:

In C# I'm getting information from a legacy ACCESS database. .NET converts the content of the database (in the case of this problem a string) to Unicode before handing the content to me.

How do I convert this Unicode string back to it's ASCII equivalent?

Edit
Unicode char 710 is indeed MODIFIER LETTER CIRCUMFLEX ACCENT. Here's the problem a bit more precise:

 -> (Extended) ASCII character ê (Extended ASCII 136) was inserted in the database.
 -> Either Access or the reading component in .NET converted this to U+02C6 U+0065
    (MODIFIER LETTER CIRCUMFLEX ACCENT + LATIN SMALL LETTER E)
 -> I need the (Extended) ASCII character 136 back.

Here's what I've tried (I see now why this did not work...):

string myInput = Convert.ToString(Convert.ToChar(710));
byte[] asBytes = Encoding.ASCII.GetBytes(myInput);

But this does not result in 94 but a byte with value 63...
Here's a new try but it still does not work:

byte[] bytes = Encoding.ASCII.GetBytes("ê");

Soltution
Thanks to both csgero and bzlm for pointing in the right direction I solved the problem here.

640

asked Sep 26 '08 09:09

2 Answers

Okay, let's elaborate. Both csgero and bzlm pointed in the right direction.

Because of blzm's reply I looked up the Windows-1252 page on wiki and found that it's called a codepage. The wikipedia article for Code page which stated the following:

No formal standard existed for these ‘extended character sets’; IBM merely referred to the variants as code pages, as it had always done for variants of EBCDIC encodings.

This led me to codepage 437:

n ASCII-compatible code pages, the lower 128 characters maintained their standard US-ASCII values, and different pages (or sets of characters) could be made available in the upper 128 characters. DOS computers built for the North American market, for example, used code page 437, which included accented characters needed for French, German, and a few other European languages, as well as some graphical line-drawing characters.

So, codepage 437 was the codepage I was calling 'extended ASCII', it had the ê as character 136 so I looked up some other chars as well and they seem right.

csgero came with the Encoding.GetEncoding() hint, I used it to create the following statement which solves my problem:

byte[] bytes = Encoding.GetEncoding(437).GetBytes("ê");

146

answered Sep 19 '22 08:09

You cannot use the default ASCII encoding (Encoding.ASCII) here, but must create the encoding with the appropriate code page using Encoding.GetEncoding(...). You might try to use code page 1252, which is a superset of ISO 8859-1.

answered Sep 19 '22 08:09

csgero

Related questions
                            
                                What is the equivalent of Type.GetGenericArguments() in .NETStandard 1.0 / .NET Core?
                            
                                How to keep user login in to system and logout only after user clicks on logout button?
                            
                                EF - Cannot apply operator '==' to operands of type 'TId' and 'TId'
                            
                                Does calling View Model methods in Code Behind events break the MVVM?
                            
                                Overloading methods in inherited classes
                            
                                What is the first argument in a parameterless constructor?
                            
                                JQuery UI Autocomplete not reaching ActionResult C# MVC
                            
                                Click on pseudo element using Selenium
                            
                                What is the best way to convert Newtonsoft JSON's JToken to JArray?
                            
                                Reboot/Restart an UWP app
                            
                                How can I create a Memory<T> from a Span<T>?
                            
                                Why doesn't a struct in an array have to be initialized?
                            
                                .NET JIT compiler volatile optimizations
                            
                                Cannot load V8 interface assembly. Load failure information for v8-ia32.dll
                            
                                Redirecting in blazor with parameter
                            
                                What is the difference between the old ListView and new CollectionView in Xamarin 4?
                            
                                How can I write unit test for my background service?
                            
                                What is the correct way to use linq type methods with IAsyncEnumerable?
                            
                                How to solve the Error MSB3644 in Visual Studio 2019 com. edition?
                            
                                Set ASP.net executionTimeout in code / "refresh" request

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to convert a Unicode character to its ASCII equivalent

Tags:

c#

.net

unicode

ascii

Huppie

People also ask

2 Answers

Huppie

csgero

Recent Activity

Donate For Us