The built-in variables <code>@-</code> and <code>@+</code> hold the start and end positions, respectively, of the last successful match. <code>$-[0]</code> and <code>$+[0]</code> correspond to entire pattern, while <code>$-[N]</code> and <code>$+[N]</code> correspond to the <code>$N</code> (<code>$1</code>, <code>$2</code>, etc.) submatches. Forget my previous post, I've got a better idea. <pre class="prettyprint"><code>sub match_positions { my ($regex, $string) = @_; return if not $string =~ /$regex/; return ($-[0], $+[0]); } sub match_all_positions { my ($regex, $string) = @_; my @ret; while ($string =~ /$regex/g) { push @ret, [ $-[0], $+[0] ]; } return @ret } </code></pre> This technique doesn't change the regex in any way. Edited to add: to quote from perlvar on $1..$9. "These variables are all read-only and dynamically scoped to the current BLOCK." In other words, if you want to use $1..$9, you cannot use a subroutine to do the matching. The pos function gives you the position of the match. If you put your regex in parentheses you can get the length (and thus the end) using <code>length $1</code>. Like this <pre class="prettyprint"><code>sub match_positions { my ($regex, $string) = @_; return if not $string =~ /($regex)/; return (pos($string) - length $1, pos($string)); } sub all_match_positions { my ($regex, $string) = @_; my @ret; while ($string =~ /($regex)/g) { push @ret, [pos($string) - length $1, pos($string)]; } return @ret } </code></pre>

How can I find the location of a regex match in Perl?

Tags:

The built-in variables @- and @+ hold the start and end positions, respectively, of the last successful match. $-[0] and $+[0] correspond to entire pattern, while $-[N] and $+[N] correspond to the $N ($1, $2, etc.) submatches.

Forget my previous post, I've got a better idea.

sub match_positions {
    my ($regex, $string) = @_;
    return if not $string =~ /$regex/;
    return ($-[0], $+[0]);
}
sub match_all_positions {
    my ($regex, $string) = @_;
    my @ret;
    while ($string =~ /$regex/g) {
        push @ret, [ $-[0], $+[0] ];
    }
    return @ret
}

This technique doesn't change the regex in any way.

Edited to add: to quote from perlvar on $1..$9. "These variables are all read-only and dynamically scoped to the current BLOCK." In other words, if you want to use $1..$9, you cannot use a subroutine to do the matching.

The pos function gives you the position of the match. If you put your regex in parentheses you can get the length (and thus the end) using length $1. Like this

sub match_positions {
    my ($regex, $string) = @_;
    return if not $string =~ /($regex)/;
    return (pos($string) - length $1, pos($string));
}
sub all_match_positions {
    my ($regex, $string) = @_;
    my @ret;
    while ($string =~ /($regex)/g) {
        push @ret, [pos($string) - length $1, pos($string)];
    }
    return @ret
}

Related questions
                            
                                get string between two strings with javascript [duplicate]
                            
                                Regular Expression to replace " {" with "(newline){" in xcode
                            
                                How to use sed to replace regex capture group?
                            
                                Regex capitalize first letter every word, also after a special character like a dash
                            
                                Difference between ".+" and ".+?"
                            
                                My Vim replace with a regex is throwing a `E488: Trailing characters`
                            
                                Regular expression to select all whitespace that isn't in quotes?
                            
                                Regular expressions (RegEx) and dplyr::filter()
                            
                                Regex to match an optional '+' symbol followed by any number of digits
                            
                                Javascript: highlight substring keeping original case but searching in case insensitive mode
                            
                                Guide on how to use regex in Nginx location block section?
                            
                                Finding and removing Non-ASCII characters from an Oracle Varchar2
                            
                                Fully qualified domain name validation
                            
                                Remove square brackets from a string vector
                            
                                What are non-word boundary in regex (\B), compared to word-boundary?
                            
                                Regular expression for decimal number
                            
                                Regex for extracting filename from path
                            
                                java regular expression to extract content within square brackets
                            
                                HTML input for Positive Whole Numbers Only (Type=number)
                            
                                regular expression matching a 3 or 4 digit cvv of a credit card

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I find the location of a regex match in Perl?

Tags:

regex

perl

Related questions

Recent Activity

Donate For Us