I am trying to split of letter and number boundaries, but the solution with lookarounds fails: <pre class="prettyprint"><code>echo 50cats30dogs100squirrels | awk '{split($0,a,/(?<=\D)(.*)(?=\d)/); print a[1],a[2],a[3]}' awk: illegal primary in regular expression (?<=\D)(.*)(?=\d) at <=\D)(.*)(?=\d) source line number 1 context is >>> {split($0,a,/(?<=\D)(.*)(?=\d)/) <<< </code></pre> Is there a way to do this in Awk in other way? Edit: Sorry for not being clear. The expected output is to just add spaces like this: <pre class="prettyprint"><code>50 cats 30 dogs 100 squirrels </code></pre>

With your shown samples only. Could you please try following, if this is what you are looking for. Written and tested in GNU <code>awk</code>(should work in any <code>awk</code> I believe). <pre class="prettyprint"><code>echo "50cats30dogs100squirrels" | awk '{gsub(/[^0-9]+/," & ")} 1' </code></pre> Output will be as follows for shown samples: <pre class="prettyprint"><code>50 cats 30 dogs 100 squirrels </code></pre>

<blockquote> Is there a way to do this in Awk in other way? </blockquote> I would use GNU <code>AWK</code> for this task following way, let <code>file.txt</code> content be <pre class="prettyprint"><code>50cats30dogs100squirrels </code></pre> then <pre class="prettyprint"><code>awk 'BEGIN{FPAT="([[:alpha:]]+)|([[:digit:]]+)"}{$1=$1;print}' file.txt </code></pre> output <pre class="prettyprint"><code>50 cats 30 dogs 100 squirrels </code></pre> Explanation: I instruct AWK that column is (one or more letters) or (one or more digits) using <code>FPAT</code>. Then I do <code>$1=$1</code> to cause string rebuilt (without <code>$1=$1;</code> output would be same as input) and <code>print</code> it. (tested in gawk 4.2.1)

<code>(?<=\D)(.*)(?=\d)</code> is a PCRE. No mandatory Unix tools as defined by the POSIX standard support PCREs. awk in particular supports EREs. With GNU awk for FPAT: <pre class="prettyprint"><code>$ echo '50cats30dogs100squirrels' | awk -v FPAT='[0-9]+|[^0-9]+' '{$1=$1}1' 50 cats 30 dogs 100 squirrels </code></pre>

Awk split string into words and numbers

Tags:

awk

I am trying to split of letter and number boundaries, but the solution with lookarounds fails:

echo 50cats30dogs100squirrels | awk '{split($0,a,/(?<=\D)(.*)(?=\d)/); print a[1],a[2],a[3]}'

awk: illegal primary in regular expression (?<=\D)(.*)(?=\d) at <=\D)(.*)(?=\d)
 source line number 1
 context is
     >>> {split($0,a,/(?<=\D)(.*)(?=\d)/) <<<

Is there a way to do this in Awk in other way?

Edit:

Sorry for not being clear. The expected output is to just add spaces like this:

50 cats 30 dogs 100 squirrels

692

asked Mar 05 '21 13:03

Lechu

Video Answer

4 Answers

With your shown samples only. Could you please try following, if this is what you are looking for. Written and tested in GNU awk(should work in any awk I believe).

echo "50cats30dogs100squirrels" | awk '{gsub(/[^0-9]+/," & ")} 1'

Output will be as follows for shown samples:

50 cats 30 dogs 100 squirrels

124

answered Oct 21 '22 20:10

RavinderSingh13

Is there a way to do this in Awk in other way?

I would use GNU AWK for this task following way, let file.txt content be

50cats30dogs100squirrels

then

awk 'BEGIN{FPAT="([[:alpha:]]+)|([[:digit:]]+)"}{$1=$1;print}' file.txt

output

50 cats 30 dogs 100 squirrels

Explanation: I instruct AWK that column is (one or more letters) or (one or more digits) using FPAT. Then I do $1=$1 to cause string rebuilt (without $1=$1; output would be same as input) and print it.

(tested in gawk 4.2.1)

answered Oct 21 '22 21:10

Daweo

You could try this:

echo 50cats30dogs100squirrels | awk '{while (match($0, /[0-9]+|[a-zA-Z]+/)) {print substr($0, RSTART, RLENGTH);$0=substr($0, RSTART+RLENGTH)}}'

Which yields:

50
cats
30
dogs
100
squirrels

answered Oct 21 '22 19:10

Nick Mancuso

(?<=\D)(.*)(?=\d) is a PCRE. No mandatory Unix tools as defined by the POSIX standard support PCREs. awk in particular supports EREs.

With GNU awk for FPAT:

$ echo '50cats30dogs100squirrels' | awk -v FPAT='[0-9]+|[^0-9]+' '{$1=$1}1'
50 cats 30 dogs 100 squirrels

answered Oct 21 '22 21:10

Ed Morton

Related questions
                            
                                Parsing pipe delimited input in awk
                            
                                MySQL import from stdin
                            
                                using awk in tcl script
                            
                                Awk between two patterns with pattern in the middle
                            
                                Parsing Json data columnwise in shell
                            
                                Check if nth bit is set in bash
                            
                                Reformatting text file using awk and cut as a one liner
                            
                                AWK Split File every n-th Row but group IDs together
                            
                                Uniq in awk; removing duplicate values in a column using awk
                            
                                how to display all lines from one that match regex in linux
                            
                                Delete line from text file with line numbers from another file
                            
                                Using backticks or $() with xargs and sed or awk
                            
                                How to find files containing exactly 16 lines?
                            
                                reordering columns with AWK
                            
                                Search replace string in a file based on column in other file
                            
                                How to count the number of instances of entries in column 1 and print the value to a new column
                            
                                The differences between gawk and mawk (column width)
                            
                                count pattern occurrence per line
                            
                                Linux awk merge two files
                            
                                Sort rows in csv file without header & first column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With