How do I split a string into an array of characters? [duplicate]

Do NOT use `.split('')`

You'll get weird results with non-BMP (non-Basic-Multilingual-Plane) character sets.

Reason is that methods like .split() and .charCodeAt() only respect the characters with a code point below 65536; bec. higher code points are represented by a pair of (lower valued) "surrogate" pseudo-characters.

'𝟙𝟚𝟛'.length     // —> 6
'𝟙𝟚𝟛'.split('')  // —> ["�", "�", "�", "�", "�", "�"]

'😎'.length      // —> 2
'😎'.split('')   // —> ["�", "�"]

Use ES2015 (ES6) features where possible:

Using the spread operator:

let arr = [...str];

Or Array.from

let arr = Array.from(str);

Or split with the new u RegExp flag:

let arr = str.split(/(?!$)/u);

Examples:

[...'𝟙𝟚𝟛']        // —> ["𝟙", "𝟚", "𝟛"]
[...'😎😜🙃']     // —> ["😎", "😜", "🙃"]

For ES5, options are limited:

I came up with this function that internally uses MDN example to get the correct code point of each character.

function stringToArray() {
  var i = 0,
    arr = [],
    codePoint;
  while (!isNaN(codePoint = knownCharCodeAt(str, i))) {
    arr.push(String.fromCodePoint(codePoint));
    i++;
  }
  return arr;
}

This requires knownCharCodeAt() function and for some browsers; a String.fromCodePoint() polyfill.

if (!String.fromCodePoint) {
// ES6 Unicode Shims 0.1 , © 2012 Steven Levithan , MIT License
    String.fromCodePoint = function fromCodePoint () {
        var chars = [], point, offset, units, i;
        for (i = 0; i < arguments.length; ++i) {
            point = arguments[i];
            offset = point - 0x10000;
            units = point > 0xFFFF ? [0xD800 + (offset >> 10), 0xDC00 + (offset & 0x3FF)] : [point];
            chars.push(String.fromCharCode.apply(null, units));
        }
        return chars.join("");
    }
}

Examples:

stringToArray('𝟙𝟚𝟛')     // —> ["𝟙", "𝟚", "𝟛"]
stringToArray('😎😜🙃')  // —> ["😎", "😜", "🙃"]

Note: str[index] (ES5) and str.charAt(index) will also return weird results with non-BMP charsets. e.g. '😎'.charAt(0) returns "�".

UPDATE: Read this nice article about JS and unicode.

It's as simple as:

s.split("");

The delimiter is an empty string, hence it will break up between each single character.

.split('') would split emojis in half.

Onur's solutions and the regex's proposed work for some emojis, but can't handle more complex languages or combined emojis. Consider this emoji being ruined:

[..."🏳️‍🌈"] // returns ["🏳", "️", "‍", "🌈"]  instead of ["🏳️‍🌈"]

Also consider this Hindi text "अनुच्छेद" which is split like this:

[..."अनुच्छेद"]  // returns   ["अ", "न", "ु", "च", "्", "छ", "े", "द"]

but should in fact be split like this:

["अ","नु","च्","छे","द"]

because some of the characters are combining marks (think diacritics/accents in European languages).

You can use the grapheme-splitter library for this:

https://github.com/orling/grapheme-splitter

It does proper standards-based letter split in all the hundreds of exotic edge-cases - yes, there are that many.

Related questions
                            
                                Scroll to element on click in Angular 4
                            
                                Detect Click into Iframe using JavaScript
                            
                                "Use Strict" needed in a TypeScript file?
                            
                                Event listener for when element becomes visible?
                            
                                Maintain model of scope when changing between views in AngularJS
                            
                                "Uncaught TypeError: Illegal invocation" in Chrome
                            
                                How can I `await` on an Rx Observable?
                            
                                Jest SecurityError: localStorage is not available for opaque origins
                            
                                How do I mock a service that returns promise in AngularJS Jasmine unit test?
                            
                                Gulps gulp.watch not triggered for new or deleted files?
                            
                                How to enable CORS in AngularJs
                            
                                setState() inside of componentDidUpdate()
                            
                                `npm build` doesn't run the script named "build" in package.json
                            
                                Mock dependency in Jest with TypeScript
                            
                                How do I capture response of form.submit
                            
                                Remove the string on the beginning of an URL
                            
                                How do I clear the content of a div using JavaScript? [closed]
                            
                                Convert normal date to unix timestamp
                            
                                How to determine one year from now in Javascript
                            
                                ChunkLoadError: Loading chunk node_modules_next_dist_client_dev_noop_js failed

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I split a string into an array of characters? [duplicate]

Tags:

javascript

string

People also ask

Do NOT use `.split('')`

Use ES2015 (ES6) features where possible:

For ES5, options are limited:

Recent Activity

Donate For Us

How do I split a string into an array of characters? [duplicate]

Tags:

javascript

string

People also ask

Do NOT use .split('')

Use ES2015 (ES6) features where possible:

For ES5, options are limited:

Related questions

Recent Activity

Donate For Us

Do NOT use `.split('')`