The string that I want to convert into character array is ষ্টোর it is in Unicode and a Bengali word. The problem is when I am converting it in Visual studio then it is returning 6 characters but when I am converting it in Android Studio then it is showing 5 characters. In VS I am using char[] arrayOfChars = someString.ToCharArray(); and in Android Studio char[] arrayOfChars = someString.toCharArray(); <h3><img src="https://i.stack.imgur.com/oqCCm.png" alt="Visual Studio Debugging info"></h3> <h3><img src="https://i.stack.imgur.com/JeqJs.png" alt="Android Studio Debugging info"></h3> N:B: My Android Studio IDE and Project Encoding is UTF-8. I am expecting same result as Visual Studio in Android Studio.

Those two arrays are unicode equivalent, but are being represented by different normalization forms. What seems to be happening is that the Java <code>ToCharArray</code> (or string representation) is using one normalization form, while the C# <code>ToCharArray</code> (or string representation) is using another. This page contains a chart of different normalization forms for Bengali text - the fourth row there describes exactly what you're seeing: <img src="https://i.stack.imgur.com/pzl1A.png" alt="Bengali table"> I am only learning about this now, but it seems to me that the motivation for this is so that unicode implementations could remain compatible with pre-existing encodings wherever possible and practical. For example, one pre-existing encoding may have used a single unicode character, while another pre-existing encoding may have instead used two characters combined. The solution settled on by the unicode folks is thus to support both, at the cost of not having a single "canonical" representation, as you've encountered here. If you wish for your Java array to be normalized under the "D" normalization form that your C# array seems to be using, it appears that this page provides such a function. You may be looking for something like: <code>someString = Normalizer.normalize(someString, Normalizer.Form.NFD);</code> Unicode standard annex 15 is the official document that describes these normalization forms.

String to character array returning different result in Visual Studio and Android Studio

1 Answers

Those two arrays are unicode equivalent, but are being represented by different normalization forms. What seems to be happening is that the Java ToCharArray (or string representation) is using one normalization form, while the C# ToCharArray (or string representation) is using another.

This page contains a chart of different normalization forms for Bengali text - the fourth row there describes exactly what you're seeing:

Bengali table

I am only learning about this now, but it seems to me that the motivation for this is so that unicode implementations could remain compatible with pre-existing encodings wherever possible and practical.

For example, one pre-existing encoding may have used a single unicode character, while another pre-existing encoding may have instead used two characters combined. The solution settled on by the unicode folks is thus to support both, at the cost of not having a single "canonical" representation, as you've encountered here.

If you wish for your Java array to be normalized under the "D" normalization form that your C# array seems to be using, it appears that this page provides such a function. You may be looking for something like:

someString = Normalizer.normalize(someString, Normalizer.Form.NFD);

Unicode standard annex 15 is the official document that describes these normalization forms.

141

answered Oct 19 '22 06:10

Jeremy

Related questions
                            
                                Equivalence of abstract classes/methods (Java) in Google Go
                            
                                Netbeans: project's main artifact is processed through maven-shade-plugin
                            
                                Running Spring app built with gradle on Heroku
                            
                                How to detect if RecyclerView is empty?
                            
                                Minimal example of Push in Vaadin 7 app ("@Push")
                            
                                how equal operator works with primitive and object type data
                            
                                hibernate dialect for oracle 12c
                            
                                Using kotlin constants in java switch expression
                            
                                swagger date field vs date-time field
                            
                                Why does -Xrs reduce performance
                            
                                Guidelines to set MetaspaceSize - java 8
                            
                                Multimaps in ChronicleMap
                            
                                Formatting Instant to String with specific pattern
                            
                                Unable to start spring-boot application in IntelliJ Idea
                            
                                TextColor vs TextColorPrimary vs TextColorSecondary
                            
                                Converting Map<K, V> to Map<V,List<K>>
                            
                                java.lang.IllegalArgumentException: Failed to find configured root that contains /storage/emulated/0/Android/data/
                            
                                @ManyToMany Relationship Between Three Tables
                            
                                How to sign AAR Artifacts in Android?
                            
                                Using Android Studio's built-in Java on the command line in MacOS

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

String to character array returning different result in Visual Studio and Android Studio

Tags:

java

string

c#

visual-studio

android-studio

bluetoothfx

People also ask

1 Answers

Jeremy

Recent Activity

Donate For Us