I was trying to split a string into an array of single letters. Here's what I did,
String str = "abcddadfad";
System.out.println(str.length()); // output: 10
String[] strArr = str.split("");
System.out.println(strArr.length); // output: 11
System.out.println(strArr[0]); // output is nothing
The new array did contain all the letters, however it has nothing at index 0,not even a white space, but still incremented the size of my array. Can anyone explain why this is happening?
Consider the split expression ",1,2,3,4".split(",");
What would you expect? Right, an empty-string to start with. In your case you have a 'nothing' in front of the first 'a' as well as one behind it.
Update: comments indicate this explanation is not enough of an explanation (which it may not be)... but, it really is this simple: the engine starts at the beginning of the string, and it looks to see if what's in front of it matches the pattern. If it does, it assigns what's behind it to a new item in the split.
On the first character, it has "" (nothing behind it), and it looks to see if there's "" (the pattern) in front of it. There is, so it creates a "" match.
It then moves on, and it has 'a' behind it, and again, it again has "" in front of it. So the second result is an "a" string.
An interesting observation is that, if you use split("", -1)
you will also get an empty-string result in the last position of the result array.
Edit 2: If I wrack my brains further, and consider this to be an academic exercise (I would not recommend this in real life...) I can think of only one good way to do a regex split()
of a String into a String[]
array with 1 character in each string (as opposed to char[] - which other people have given great answers for....).
String[] chars = str.split("(?<=.)", str.length());
This will look behind each character, in a non-capturing group, and split on that, and then limit the array size to the number of characters (you can leave the str.length()
out, but if you put -1
you will get an extra space at the end)
Borrowing nitro2k01's alternative (below in the comments) which references the string beginning and end, you can split reliably on:
String[] chars = str.split("(?!(^|$))");
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With