Counting unicode characters in Javascript [duplicate]

1 Answers

Your example “द्ध” is a string of three Unicode characters, and the length property correctly indicates this.

What you apparently to want to count is “characters” in some other sense, something like “what a speaker of a language intuitively sees as one character”. This is a vague and mutable concept. The Unicode standard annex UAX #29 Unicode Text Segmentation tries to analyze the concept, calling it “grapheme cluster”, and describes some algorithms on working with it.

Unfortunately, JavaScript has no built-in tools for recognizing whether a character is e.g. combining mark and this should be regarded as part of a cluster. However, if you can limit yourself to handling just one writing system, you can probably code the operations manually, referring to possible Unicode characters by their code numbers.

Moreover, if the intent is to make the count match the way some input editor works (e.g. how the arrow keys more over characters), you would need to know the logic of that editor. It may implement Unicode grapheme clusters in some sense, or something else.

answered Oct 18 '22 05:10

Jukka K. Korpela

Related questions
                            
                                <script>document.write('<base href="' + document.location + '" />');</script>
                            
                                Node.js: http request timing out after 1 minute
                            
                                jQuery multiple selectors with window or document
                            
                                how to disabled and enabled option item with jquery?
                            
                                How do I get the current instance of "this" file in Google Apps Script?
                            
                                Ng-src not working for locally stored image
                            
                                jQuery plugin documentation with JSDoc
                            
                                Disable outside scrolling while in scrollable div [duplicate]
                            
                                Drawing Lines Behind Divs
                            
                                chai is not defined in Karma-mocha
                            
                                ReferenceError: Component is not defined - QML Dynamic object creation
                            
                                Is it possible to use localstorage with context per path like cookie?
                            
                                Including current year in Header file with Gulp Header
                            
                                How could an undefined variable throw a type error?
                            
                                jQuery add CSS Class to bootbox Modal Dynamically
                            
                                For a deep copy of a JavaScript multidimensional array, going one level deep seems sufficient. Is this reliably true?
                            
                                How to shuffle a NodeList
                            
                                How to use orderBy and filter with ng-repeat, using an object (instead of array)?
                            
                                webkitSpeechRecognition is "lagging" behind when gathering results
                            
                                Remove html tag from a string using jQquery

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Counting unicode characters in Javascript [duplicate]

Tags:

javascript

unicode

pewpewlasers

People also ask

1 Answers

Jukka K. Korpela

Recent Activity

Donate For Us