I want to know if I found a bug in the .NET Framework, or if I don't understand something. After running this piece of code: <pre class="prettyprint"><code>var text = "مباركُ وبعض أكثر من نص"; var word = "مبارك"; bool exist = text.Contains(word); int index = text.IndexOf(word); </code></pre> The results are the "exists = true" and "index = -1" How can it be?

<code>Contains</code> is culture-insensitive: <blockquote> This method performs an ordinal (case-sensitive and culture-insensitive) comparison. </blockquote> <code>IndexOf</code> is culture-sensitive: <blockquote> This method performs a word (case-sensitive and culture-sensitive) search using the current culture. </blockquote> That's the difference. If you use <pre class="prettyprint"><code>int index = text.IndexOf(word, StringComparison.Ordinal); </code></pre> then you'll get an index of 0 instead of -1 (so it's consistent with <code>Contains</code>). There's no culture-sensitive overload of <code>Contains</code>; it's unclear to me whether you can use <code>IndexOf</code> reliably for this, but the <code>CompareInfo</code> class gives some more options. (I really don't know much about the details of cultural comparisons, particularly with RTL text. I just know it's complicated!)

Why are String.IndexOf and String.Contains disagreeing when provided with Arabic text?

Tags:

.net

arabic

I want to know if I found a bug in the .NET Framework, or if I don't understand something. After running this piece of code:

var text = "مباركُ وبعض أكثر من نص";
var word = "مبارك";
bool exist = text.Contains(word);
int index = text.IndexOf(word);

The results are the "exists = true" and "index = -1"

How can it be?

236

asked Sep 11 '13 06:09

gil kr

1 Answers

Contains is culture-insensitive:

This method performs an ordinal (case-sensitive and culture-insensitive) comparison.

IndexOf is culture-sensitive:

This method performs a word (case-sensitive and culture-sensitive) search using the current culture.

That's the difference. If you use

int index = text.IndexOf(word, StringComparison.Ordinal);

then you'll get an index of 0 instead of -1 (so it's consistent with Contains).

There's no culture-sensitive overload of Contains; it's unclear to me whether you can use IndexOf reliably for this, but the CompareInfo class gives some more options. (I really don't know much about the details of cultural comparisons, particularly with RTL text. I just know it's complicated!)

145

answered Oct 15 '22 02:10

Jon Skeet

Related questions
                            
                                .NET SVCUTIL does not generate namespaces properly
                            
                                Access an XSLT file as resource from same project?
                            
                                Are basic .NET Windows Forms controls native Win32 controls?
                            
                                Await Inside Foreach Keyword
                            
                                HttpWebRequest Response Stream only returns 64k of data
                            
                                Autofac dependency injection implementation
                            
                                Issue with Code Coverage in VS 2012
                            
                                Configuring Quartz.NET with SQL Server AdoJobStore
                            
                                What does declaring and instantiating a c# array actually mean?
                            
                                Where does the .NET framework come from? Visual Studio or Windows?
                            
                                Azure storage sdk v1.3 to v2 => SetConfigurationSettingPublisher
                            
                                Searching list C# by containing letters
                            
                                Visualize object properties in a WPF control
                            
                                How to retrieve HTTP header information from a C# RESTful Service Method
                            
                                Is there a difference between Assembly.ExportedTypes and Assembly.GetExportedTypes()
                            
                                Could not determine JSON object type for type System.Char
                            
                                How to find usb devices in c#?
                            
                                "The maximum message size quota for incoming messages (65536) has been exceeded.". Even after setting greater size
                            
                                How many HTTP request can we make simultaneously?
                            
                                Removing color cast from image

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With