I have a string in a Bash shell script that I want to split into an array of characters, not based on a delimiter but just one character per array index. How can I do this? <strike>Ideally it would not use any external programs.</strike> Let me rephrase that. My goal is portability, so things like <code>sed</code> that are likely to be on any POSIX compatible system are fine.

If your string is stored in variable x, this produces an array y with the individual characters: <pre class="prettyprint"><code>i=0 while [ $i -lt ${#x} ]; do y[$i]=${x:$i:1}; i=$((i+1));done </code></pre>

As an alternative to iterating over <code>0 .. ${#string}-1</code> with a for/while loop, there are two other ways I can think of to do this with only bash: using <code>=~</code> and using <code>printf</code>. (There's a third possibility using <code>eval</code> and a <code>{..}</code> sequence expression, but this lacks clarity.) With the correct environment and NLS enabled in bash these will work with non-ASCII as hoped, removing potential sources of failure with older system tools such as <code>sed</code>, if that's a concern. These will work from bash-3.0 (released 2005). Using <code>=~</code> and regular expressions, converting a string to an array in a single expression: <pre class="prettyprint"><code>string="wonkabars" [[ "$string" =~ ${string//?/(.)} ]] # splits into array printf "%s\n" "${BASH_REMATCH[@]:1}" # loop free: reuse fmtstr declare -a arr=( "${BASH_REMATCH[@]:1}" ) # copy array for later </code></pre> The way this works is to perform an expansion of <code>string</code> which substitutes each single character for <code>(.)</code>, then match this generated regular expression with grouping to capture each individual character into <code>BASH_REMATCH[]</code>. Index 0 is set to the entire string, since that special array is read-only you cannot remove it, note the <code>:1</code> when the array is expanded to skip over index 0, if needed. Some quick testing for non-trivial strings (>64 chars) shows this method is substantially faster than one using bash string and array operations. The above will work with strings containing newlines, <code>=~</code> supports POSIX ERE where <code>.</code> matches anything except NUL by default, i.e. the regex is compiled without <code>REG_NEWLINE</code>. (The behaviour of POSIX text processing utilities is allowed to be different by default in this respect, and usually is.) Second option, using <code>printf</code>: <pre class="prettyprint"><code>string="wonkabars" ii=0 while printf "%s%n" "${string:ii++:1}" xx; do ((xx)) && printf "\n" || break done </code></pre> This loop increments index <code>ii</code> to print one character at a time, and breaks out when there are no characters left. This would be even simpler if the bash <code>printf</code> returned the number of character printed (as in C) rather than an error status, instead the number of characters printed is captured in <code>xx</code> using <code>%n</code>. (This works at least back as far as bash-2.05b.) With bash-3.1 and <code>printf -v var</code> you have slightly more flexibility, and can avoid falling off the end of the string should you be doing something other than printing the characters, e.g. to create an array: <pre class="prettyprint"><code>declare -a arr ii=0 while printf -v cc "%s%n" "${string:(ii++):1}" xx; do ((xx)) && arr+=("$cc") || break done </code></pre>

I have found that the following works the best: <pre class="prettyprint"><code>array=( `echo string | grep -o . ` ) </code></pre> (note the backticks) then if you do: <code>echo ${array[@]}</code> , you get: <code>s t r i n g</code> or: <code>echo ${array[2]}</code> , you get: <code>r</code>

If the text can contain spaces: <pre class="prettyprint"><code>eval a=( $(echo "this is a test" | sed "s/$.$/'\1' /g") ) </code></pre>

<pre class="prettyprint"><code>$ echo hello | awk NF=NF FS= h e l l o </code></pre> Or <pre class="prettyprint"><code>$ echo hello | awk '$0=RT' RS=[[:alnum:]] h e l l o </code></pre>

Yet another on :), the stated question simply says 'Split string into character array' and don't say much about the state of the receiving array, and don't say much about special chars like and control chars. My assumption is that if I want to split a string into an array of chars I want the receiving array containing just that string and no left over from previous runs, yet preserve any special chars. For instance the proposed solution family like <pre class="prettyprint"><code>for (( i=0 ; i < ${#x} ; i++ )); do y[i]=${x:i:1}; done </code></pre> Have left overs in the target array. <pre class="prettyprint"><code>$ y=(1 2 3 4 5 6 7 8) $ x=abc $ for (( i=0 ; i < ${#x} ; i++ )); do y[i]=${x:i:1}; done $ printf '%s ' "${y[@]}" a b c 4 5 6 7 8 </code></pre> Beside writing the long line each time we want to split a problem, so why not hide all this into a function we can keep is a package source file, with a API like <pre class="prettyprint"><code>s2a "Long string" ArrayName </code></pre> I got this one that seems to do the job. <pre class="prettyprint"><code>$ s2a() > { [ "$2" ] && typeset -n __=$2 && unset $2; > [ "$1" ] && __+=("${1:0:1}") && s2a "${1:1}" > } $ a=(1 2 3 4 5 6 7 8 9 0) ; printf '%s ' "${a[@]}" 1 2 3 4 5 6 7 8 9 0 $ s2a "Split It" a ; printf '%s ' "${a[@]}" S p l i t I t </code></pre>

Bash: Split string into character array

Q: How do I echo an array in bash?

How to Echo a Bash Array? To echo an array, use the format echo ${Array[0]}. Array is your array name, and 0 is the index or the key if you are echoing an associative array. You can also use @ or * symbols instead of an index to print the entire array.

Tags:

string

bash

I have a string in a Bash shell script that I want to split into an array of characters, not based on a delimiter but just one character per array index. How can I do this? ~~Ideally it would not use any external programs.~~ Let me rephrase that. My goal is portability, so things like sed that are likely to be on any POSIX compatible system are fine.

721

asked Sep 28 '11 05:09

n s

12 Answers

Try

echo "abcdefg" | fold -w1

Edit: Added a more elegant solution suggested in comments.

echo "abcdefg" | grep -o .

answered Sep 30 '22 19:09

xdazz

You can access each letter individually already without an array conversion:

$ foo="bar"
$ echo ${foo:0:1}
b
$ echo ${foo:1:1}
a
$ echo ${foo:2:1}
r

If that's not enough, you could use something like this:

$ bar=($(echo $foo|sed  's/\(.\)/\1 /g'))
$ echo ${bar[1]}
a

If you can't even use sed or something like that, you can use the first technique above combined with a while loop using the original string's length (${#foo}) to build the array.

Warning: the code below does not work if the string contains whitespace. I think Vaughn Cato's answer has a better chance at surviving with special chars.

thing=($(i=0; while [ $i -lt ${#foo} ] ; do echo ${foo:$i:1} ; i=$((i+1)) ; done))

answered Sep 30 '22 20:09

Mat

If your string is stored in variable x, this produces an array y with the individual characters:

i=0
while [ $i -lt ${#x} ]; do y[$i]=${x:$i:1};  i=$((i+1));done

answered Sep 30 '22 18:09

Vaughn Cato

As an alternative to iterating over 0 .. ${#string}-1 with a for/while loop, there are two other ways I can think of to do this with only bash: using =~ and using printf. (There's a third possibility using eval and a {..} sequence expression, but this lacks clarity.)

With the correct environment and NLS enabled in bash these will work with non-ASCII as hoped, removing potential sources of failure with older system tools such as sed, if that's a concern. These will work from bash-3.0 (released 2005).

Using =~ and regular expressions, converting a string to an array in a single expression:

string="wonkabars"
[[ "$string" =~ ${string//?/(.)} ]]       # splits into array
printf "%s\n" "${BASH_REMATCH[@]:1}"      # loop free: reuse fmtstr
declare -a arr=( "${BASH_REMATCH[@]:1}" ) # copy array for later

The way this works is to perform an expansion of string which substitutes each single character for (.), then match this generated regular expression with grouping to capture each individual character into BASH_REMATCH[]. Index 0 is set to the entire string, since that special array is read-only you cannot remove it, note the :1 when the array is expanded to skip over index 0, if needed. Some quick testing for non-trivial strings (>64 chars) shows this method is substantially faster than one using bash string and array operations.

The above will work with strings containing newlines, =~ supports POSIX ERE where . matches anything except NUL by default, i.e. the regex is compiled without REG_NEWLINE. (The behaviour of POSIX text processing utilities is allowed to be different by default in this respect, and usually is.)

Second option, using printf:

string="wonkabars"
ii=0
while printf "%s%n" "${string:ii++:1}" xx; do 
  ((xx)) && printf "\n" || break
done

This loop increments index ii to print one character at a time, and breaks out when there are no characters left. This would be even simpler if the bash printf returned the number of character printed (as in C) rather than an error status, instead the number of characters printed is captured in xx using %n. (This works at least back as far as bash-2.05b.)

With bash-3.1 and printf -v var you have slightly more flexibility, and can avoid falling off the end of the string should you be doing something other than printing the characters, e.g. to create an array:

declare -a arr
ii=0
while printf -v cc "%s%n" "${string:(ii++):1}" xx; do 
    ((xx)) && arr+=("$cc") || break
done

answered Sep 30 '22 20:09

mr.spuratic

Pure Bash solution with no loop:

#!/usr/bin/env bash

str='The quick brown fox jumps over a lazy dog.'

# Need extglob for the replacement pattern
shopt -s extglob

# Split string characters into array (skip first record)
# Character 037 is the octal representation of ASCII Record Separator
# so it can capture all other characters in the string, including spaces.
IFS= mapfile -s1 -t -d $'\37' array <<<"${str//?()/$'\37'}"

# Strip out captured trailing newline of here-string in last record
array[-1]="${array[-1]%?}"

# Debug print array
declare -p array

answered Sep 30 '22 19:09

Léa Gris

The most simple, complete and elegant solution:

$ read -a ARRAY <<< $(echo "abcdefg" | sed 's/./& /g')

and test

$ echo ${ARRAY[0]}
  a

$ echo ${ARRAY[1]}
  b

Explanation: read -a reads the stdin as an array and assigns it to the variable ARRAY treating spaces as delimiter for each array item.

The evaluation of echoing the string to sed just add needed spaces between each character.

We are using Here String (<<<) to feed the stdin of the read command.

answered Sep 30 '22 18:09

Alexandro de Oliveira

I have found that the following works the best:

array=( `echo string | grep -o . ` )

(note the backticks)

then if you do: echo ${array[@]} , you get: s t r i n g

or: echo ${array[2]} , you get: r

answered Sep 30 '22 19:09

AZAhmed

string=hello123

for i in $(seq 0 ${#string})
    do array[$i]=${string:$i:1}
done

echo "zero element of array is [${array[0]}]"
echo "entire array is [${array[@]}]"

The zero element of array is [h]. The entire array is [h e l l o 1 2 3 ].

answered Sep 30 '22 20:09

0x00

If the text can contain spaces:

eval a=( $(echo "this is a test" | sed "s/\(.\)/'\1' /g") )

answered Sep 30 '22 19:09

Karoly Horvath

$ echo hello | awk NF=NF FS=
h e l l o

$ echo hello | awk '$0=RT' RS=[[:alnum:]]
h
e
l
l
o

answered Sep 30 '22 19:09

Zombo

Yet another on :), the stated question simply says 'Split string into character array' and don't say much about the state of the receiving array, and don't say much about special chars like and control chars.

My assumption is that if I want to split a string into an array of chars I want the receiving array containing just that string and no left over from previous runs, yet preserve any special chars.

For instance the proposed solution family like

for (( i=0 ; i < ${#x} ; i++ )); do y[i]=${x:i:1}; done

Have left overs in the target array.

$ y=(1 2 3 4 5 6 7 8)
$ x=abc
$ for (( i=0 ; i < ${#x} ; i++ )); do y[i]=${x:i:1}; done
$ printf '%s ' "${y[@]}"
a b c 4 5 6 7 8

Beside writing the long line each time we want to split a problem, so why not hide all this into a function we can keep is a package source file, with a API like

s2a "Long string" ArrayName

I got this one that seems to do the job.

$ s2a()
> { [ "$2" ] && typeset -n __=$2 && unset $2;
>   [ "$1" ] && __+=("${1:0:1}") && s2a "${1:1}"
> }

$ a=(1 2 3 4 5 6 7 8 9 0) ; printf '%s ' "${a[@]}"
1 2 3 4 5 6 7 8 9 0 

$ s2a "Split It" a        ; printf '%s ' "${a[@]}"
S p l i t   I t

answered Sep 30 '22 19:09

Phi

I know this is a "bash" question, but please let me show you the perfect solution in zsh, a shell very popular these days:

string='this is a string'
string_array=(${(s::)string})  #Parameter expansion. And that's it!

print ${(t)string_array}  -> type array
print $#string_array -> 16 items

answered Sep 30 '22 20:09

Frat Quintero

Related questions
                            
                                Is there any way to detect strings like putjbtghguhjjjanika?
                            
                                Truncate string on whole words in .NET C#
                            
                                Convert array to JSON string in swift
                            
                                Return the portion of a string before the first occurrence of a character in PHP [duplicate]
                            
                                Need a basename function in Javascript
                            
                                How can I truncate a string to the first 20 words in PHP?
                            
                                C# IPAddress from string [closed]
                            
                                Ruby: How to count the number of times a string appears in another string?
                            
                                Fast String Hashing Algorithm with low collision rates with 32 bit integer [closed]
                            
                                How to strip a specific word from a string?
                            
                                How to convert a 'string pointer' to a string in Golang?
                            
                                Javascript How to get first three characters of a string
                            
                                Why does one long string take MORE space than lots of small strings?
                            
                                How to Format dict string outputs nicely
                            
                                BigDecimal to string
                            
                                JavaScript string with new line - but not using \n
                            
                                Remove all non-"word characters" from a String in Java, leaving accented characters?
                            
                                Change String-Array in Strings.xml to ArrayList
                            
                                Best way to split string by last occurrence of character?
                            
                                How to use stringByAddingPercentEncodingWithAllowedCharacters() for a URL in Swift 2.0

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Bash: Split string into character array

Tags:

string

bash

n s

People also ask

12 Answers

xdazz

Mat

Vaughn Cato

mr.spuratic

Léa Gris

Alexandro de Oliveira

AZAhmed

0x00

Karoly Horvath

Zombo

Phi

Frat Quintero

Recent Activity

Donate For Us