Am sure this is easy, so apologies. In Perl I might do something like <pre class="prettyprint"><code>my $str = "foo=23"; $str ~= m/foo=([0-9]+)/ print "foo value is " . $1 </code></pre> ie use parentheses in the regex to be able to refer to part of the match later as $1, $2 etc. What is the equivalent in awk?

Also with GNU awk, use the <code>match()</code> function and capture the parenthesis groups into an array. <pre class="prettyprint"><code>str = "foo=23" match(str, /foo=([0-9]+)/, ary) print "foo value is " ary[1] </code></pre>

In GNU awk that'd be: <pre class="prettyprint"><code>$ cat tst.awk BEGIN { str = "foo=23" val = gensub(/foo=([0-9]+)/,"\\1","",str) print "foo value is " val } $ $ gawk -f tst.awk foo value is 23 </code></pre> In other awk's you'd need to use [g]sub() and/or match() and/or substr() depending on what else you do/don't want to match on. For example: <pre class="prettyprint"><code>$ cat tst.awk BEGIN { str = "foo=23" val = substr(str,match(str,/foo=[0-9]+/)+length("foo=")) print "foo value is " val } $ awk -f tst.awk foo value is 23 </code></pre> You'd need a third arg of ',RLENGTH-length("foo=")' on the substr() call if the target pattern isn't at the end of a line. Make "foo=" a variable if you like and if it itself can contain an RE there's a few more steps necessary.

awk syntax for getting part of a matched regex

Tags:

regex

match

awk

perl

Am sure this is easy, so apologies. In Perl I might do something like

my $str = "foo=23";
$str ~= m/foo=([0-9]+)/
print "foo value is " . $1

ie use parentheses in the regex to be able to refer to part of the match later as $1, $2 etc. What is the equivalent in awk?

763

asked Oct 21 '12 15:10

gimmeamilk

2 Answers

Also with GNU awk, use the match() function and capture the parenthesis groups into an array.

str = "foo=23"
match(str, /foo=([0-9]+)/, ary)
print "foo value is " ary[1]

111

answered Nov 04 '22 05:11

glenn jackman

In GNU awk that'd be:

$ cat tst.awk
BEGIN {
   str = "foo=23"
   val = gensub(/foo=([0-9]+)/,"\\1","",str)
   print "foo value is " val
}
$
$ gawk -f tst.awk
foo value is 23

In other awk's you'd need to use [g]sub() and/or match() and/or substr() depending on what else you do/don't want to match on. For example:

$ cat tst.awk
BEGIN {
   str = "foo=23"
   val = substr(str,match(str,/foo=[0-9]+/)+length("foo="))
   print "foo value is " val
}
$ awk -f tst.awk
foo value is 23

You'd need a third arg of ',RLENGTH-length("foo=")' on the substr() call if the target pattern isn't at the end of a line. Make "foo=" a variable if you like and if it itself can contain an RE there's a few more steps necessary.

answered Nov 04 '22 07:11

Ed Morton

Related questions
                            
                                Word boundary won't match the beginning or end in Javascript
                            
                                python: how to interrupt a regex match
                            
                                How to split a string based on punctuation marks and whitespace?
                            
                                How do I fix this multiline regular expression in Ruby?
                            
                                Need help with Regular Expression for nine digit alphanumeric with minimum one space boundary
                            
                                Why does smartmatch return false when I match against a regex containing slashes?
                            
                                How to neatly match "x" and "[x]" with a regex without repeating?
                            
                                Pattern.matches doesn't work, while replaceAll does
                            
                                does java support if-then-else regexp constructs(Perl constructs)?
                            
                                Bug with re.split function and re.DOTALL flag in re module of Python 2.7.1
                            
                                What is the right way to get a grapheme?
                            
                                Python: How to prepend the string 'ub' to every pronounced vowel in a string?
                            
                                Python 3: Searching A Large Text File With REGEX
                            
                                get inner patterns recursively using regex c#
                            
                                Why order matters in this RegEx with alternation?
                            
                                Unicode, regular expressions and PyPy
                            
                                Calculate Number of Consecutive Characters in a String using Perl
                            
                                Find/Replace using grep and Textwrangler
                            
                                BASH - find specific folder with find and filter with regex
                            
                                replace() and replaceAll() in Java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With