C# - Comparing strings of different encodings

Tags:

Using C#, I fetch a TextBox.Text value from an .ascx page. When I compare the equality of the value to a regular string object inside a LINQ-query, it always returns false.

I have come to the conclusion that they are differently encoded, but have so far had no luck in converting or comparing them.

Click to copy

docname = "Testdoc 1.docx"; //regular string created in C#
fetchedVal = ((TextBox)e.Item.FindControl("txtSelectedDocs")).Text; //UTF-8

The above two strings are identical when represented as literals, but comparing the byte[] they are obviously different due to the encoding.

I've tried alot of different things, such as:

Click to copy

System.Text.Encoding.Default.GetString(utf8.GetBytes(fetchedVal));

but that will return the value "TestdocÂ 1.docx".

If I instead try

Click to copy

System.Text.Encoding.Default.GetString(System.Text.Encoding.Default.GetBytes(fetchedVal));

it returns "Testdoc 1.docx" but an Equals()-check still returns false.

I have also tried the following, which seem to be the recommended approach, but with no luck:

Click to copy

byte[] utf8Bytes = Encoding.UTF8.GetBytes(fetchedVal);
byte[] unicodeBytes = Encoding.Convert(Encoding.UTF8, Encoding.Unicode, utf8Bytes);
string fetchedValConverted = Encoding.Unicode.GetString(unicodeBytes);

The culprit appears to be the whitespace, because when examining the byte sequence it's always the seventh byte that differs.

How do you properly convert from UTF-8 to default string encoding in C#?

241

asked Sep 29 '14 15:09

Daniel B

1 Answers

Strings don't have encodings or byte arrays. Encodings only come into play when you convert a string into a byte array; you can only do that by specifying which encoding to use to pick bytes.

It sounds like you actually simply have different characters in your strings. You might have an invisible character in one of them, or they might have different characters that look the same.

To find out, look at the Unicode codepoint values of each character in each string (eg, (int) str[0]).

111

answered Oct 12 '22 21:10

SLaks

Related questions
                            
                                WKHTMLTOPDF Not Rendering Base64 Image
                            
                                C# "lock" keyword: Why is an object necessary for the syntax?
                            
                                Multi select combobox with checkbox generic control in wpf
                            
                                Underline not detected after reloading RTF
                            
                                C# - Error CS1928: Checking for list element with derived class
                            
                                C# Web API routing mixed with Angular routing
                            
                                event EventHandler vs EventHandler
                            
                                Sitecore 7.2 - Item Web API-User Authentication
                            
                                Is there a way to disable js/css validation when using System.Web.Optimization Bundling minimisation?
                            
                                Close a non-button flyout?
                            
                                Setting provider and connection string in EntityFramework for MySql
                            
                                Microsoft.VisualStudio.TestTools.UnitTesting.Assert generic method overloads behavior
                            
                                Making binding redirects work for office add-ins
                            
                                LibUsbDotNet No devices found when calling UsbDevice.AllDevices
                            
                                Is there an easy way to manually decode a FlateDecode Filter to extract text in a PDF? C#
                            
                                WPF override style in merged dictionary
                            
                                Select specific constructor with AutoFixture
                            
                                Awesomium Popup - ShowCreatedWebView Example
                            
                                How to preserve lists already created before using List.Clear()
                            
                                How hosting Asp.Net vNext application hosting on Kestrel, helios, WebListener and in separate Console process differs?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

C# - Comparing strings of different encodings

Tags:

string

c#

encoding

Daniel B

People also ask

1 Answers

SLaks

Recent Activity

Donate For Us