I've got a regular expression with capture groups that matches what I want in a broader context. I then take capture group <code>$1</code> and use it for my needs. That's easy. But how to use capture groups with <code>s///</code> when I just want to replace the content of <code>$1</code>, not the entire regex, with my replacement? For instance, if I do: <pre class="prettyprint"><code>$str =~ s/prefix (something) suffix/42/ </code></pre> <code>prefix</code> and <code>suffix</code> are removed. Instead, I would like <code>something</code> to be replaced by <code>42</code>, while keeping <code>prefix</code> and <code>suffix</code> intact.

If you only need to replace one capture then using <code>@LAST_MATCH_START</code> and <code>@LAST_MATCH_END</code> (with <code>use English</code>; see <code>perldoc perlvar</code>) together with <code>substr</code> might be a viable choice: <pre class="prettyprint"><code>use English qw(-no_match_vars); $your_string =~ m/aaa (bbb) ccc/; substr $your_string, $LAST_MATCH_START[1], $LAST_MATCH_END[1] - $LAST_MATCH_START[1], "new content"; # replaces "bbb" with "new content" </code></pre>

As I understand, you can use look-ahead or look-behind that don't consume characters. Or save data in groups and only remove what you are looking for. Examples: With look-ahead: <pre class="prettyprint"><code>s/your_text(?=ahead_text)//; </code></pre> Grouping data: <pre class="prettyprint"><code>s/(your_text)(ahead_text)/$2/; </code></pre>

This is an old question but I found the below easier for replacing lines that start with <code>>something</code> to <code>>something_else</code>. Good for changing the headers for fasta sequences <pre class="prettyprint"><code> while ($filelines=~ />(.*)\s/g){ unless ($1 =~ /else/i){ $filelines =~ s/($1)/$1\_else/; } } </code></pre>

Replace specific capture group instead of entire regex in Perl

Tags:

regex

replace

perl

capture-group

I've got a regular expression with capture groups that matches what I want in a broader context. I then take capture group $1 and use it for my needs. That's easy.

But how to use capture groups with s/// when I just want to replace the content of $1, not the entire regex, with my replacement?

For instance, if I do:

$str =~ s/prefix (something) suffix/42/

prefix and suffix are removed. Instead, I would like something to be replaced by 42, while keeping prefix and suffix intact.

787

asked Aug 26 '12 14:08

flohei

3 Answers

If you only need to replace one capture then using @LAST_MATCH_START and @LAST_MATCH_END (with use English; see perldoc perlvar) together with substr might be a viable choice:

use English qw(-no_match_vars);
$your_string =~ m/aaa (bbb) ccc/;
substr $your_string, $LAST_MATCH_START[1], $LAST_MATCH_END[1] - $LAST_MATCH_START[1], "new content";
# replaces "bbb" with "new content"

172

answered Oct 12 '22 23:10

Moritz Bunkus

As I understand, you can use look-ahead or look-behind that don't consume characters. Or save data in groups and only remove what you are looking for. Examples:

With look-ahead:

s/your_text(?=ahead_text)//;

Grouping data:

s/(your_text)(ahead_text)/$2/;

answered Oct 12 '22 21:10

Birei

This is an old question but I found the below easier for replacing lines that start with >something to >something_else. Good for changing the headers for fasta sequences

  while ($filelines=~ />(.*)\s/g){
        unless ($1 =~ /else/i){
                $filelines =~ s/($1)/$1\_else/;
        }

  }

answered Oct 12 '22 21:10

Jabda

Related questions
                            
                                Regex to match a CSS class name
                            
                                Regex to match a string NOT surrounded by brackets
                            
                                High performance simple Java regular expressions
                            
                                Regular expressions converting into a diagram [closed]
                            
                                Javascript RegExp non-capturing groups
                            
                                java regex: find pattern of 1 or more numbers followed by a single
                            
                                Why is Perl lazy when regex matching with * against a group?
                            
                                JavaScript: avoiding empty strings with String.split, and regular expression precedence
                            
                                Have trouble understanding capturing groups and back references
                            
                                What's a regex that matches all numbers except 1, 2 and 25?
                            
                                Complex string splitting
                            
                                How do I convert mod_rewrite (QSA option) to Nginx equivalent?
                            
                                How to deal with Polish Characters while using regex?
                            
                                How do I compare Rpm versions in python
                            
                                How exactly do Regular Expression word boundaries work in PHP?
                            
                                /bb|[^b]{2}/ how does it work? [closed]
                            
                                In Python, how to list all characters matched by POSIX extended regex `[:space:]`?
                            
                                Python regex match text between quotes
                            
                                Ruby String#scan equivalent to return MatchData
                            
                                regex for money values in JavaScript

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With