What does .NET's String.Normalize do?

2 Answers

One difference between form C and form D is how letters with accents are represented: form C uses a single letter-with-accent codepoint, while form D separates that into a letter and an accent.

For instance, an "à" can be codepoint 224 ("Latin small letter A with grave"), or codepoint 97 ("Latin small letter A") followed by codepoint 786 ("Combining grave accent"). A char-by-char comparison would see these as different. Normalisation lets the comparison succeed.

A side-effect is that this makes it possible to easily create a "remove accents" method.

public static string RemoveAccents(string input) {     return new string(input         .Normalize(System.Text.NormalizationForm.FormD)         .ToCharArray()         .Where(c => CharUnicodeInfo.GetUnicodeCategory(c) != UnicodeCategory.NonSpacingMark)         .ToArray());     // the normalization to FormD splits accented letters in letters+accents     // the rest removes those accents (and other non-spacing characters)     // and creates a new string from the remaining chars }

123

answered Sep 28 '22 04:09

Hans Keﬆing

It makes sure that unicode strings can be compared for equality (even if they are using different unicode encodings).

From Unicode Standard Annex #15:

Essentially, the Unicode Normalization Algorithm puts all combining marks in a specified order, and uses rules for decomposition and composition to transform each string into one of the Unicode Normalization Forms. A binary comparison of the transformed strings will then determine equivalence.

answered Sep 28 '22 05:09

Oded

Related questions
                            
                                The type or namespace name 'Entity' does not exist in the namespace 'System.Data'
                            
                                Keyboard shortcut to close all tabs but current one in Visual Studio?
                            
                                Double.TryParse or Convert.ToDouble - which is faster and safer?
                            
                                How to convert WebResponse.GetResponseStream return into a string?
                            
                                I don't understand Application Domains
                            
                                Return StreamReader to Beginning
                            
                                How to Disable Alt + F4 closing form?
                            
                                WPF versus Windows Forms [duplicate]
                            
                                A list of Entity Framework providers for various databases
                            
                                There's no @Html.Button !
                            
                                Interlocked and volatile
                            
                                Good example of Reactive Extensions Use [closed]
                            
                                How to share the same Resharper settings between multiple solutions, with no manual intervention?
                            
                                Why doesn't C# infer my generic types?
                            
                                When is a custom attribute's constructor run?
                            
                                Deserializing empty xml attribute value into nullable int property using XmlSerializer
                            
                                What is the difference between boxing/unboxing and type casting?
                            
                                Difference between ASP.NET Core (.NET Core) and ASP.NET Core (.NET Framework)
                            
                                How big is an object reference in .NET?
                            
                                How do I allow assembly (unit testing one) to access internal properties of another assembly?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What does .NET's String.Normalize do?

Tags:

string

.net

GeReV

People also ask

2 Answers

Hans Keﬆing

Oded

Recent Activity

Donate For Us