I have a body of text I'm looking to pull repeat sets of 4-digit numbers out from. For Example: <blockquote> The first is 1234 2) The Second is 2098 3) The Third is 3213 </blockquote> Now I know i'm able to get the first set of digits out by simply using: <pre class="prettyprint"><code> /\d{4}/ </code></pre> ...returning 1234 But how do I match the second set of digits, or the third, and so on...? edit: How do i return 2098, or 3213

You don't appear to have a proper answer to your question yet. The solution is to use the <code>/g</code> modifier on your regex. In list context it will find all of the numbers in your string at once, like this <pre class="prettyprint"><code>my $str = 'The first is 1234 2) The Second is 2098 3) The Third is 3213'; my @numbers = $str =~ /\b \d{4} \b/gx; print "@numbers\n"; </code></pre> output <pre class="prettyprint"><code>1234 2098 3213 </code></pre> Or you can iterate through them, using scalar context in a <code>while</code> loop, like this <pre class="prettyprint"><code>while ($str =~ /\b (\d{4}) \b/gx) { my $number = $1; print $number, "\n"; } </code></pre> output <pre class="prettyprint"><code>1234 2098 3213 </code></pre> I have added the <code>\b</code> patterns to the regex so that it only matches whole four-digit numbers and doesn't, for example, find <code>1234</code> in <code>1234567</code>. The <code>/x</code> modifier just allows me to add spaces so that the pattern is more intelligible.

Regex: Matching 4-Digits within words

Tags:

regex

perl

I have a body of text I'm looking to pull repeat sets of 4-digit numbers out from.

For Example:

The first is 1234 2) The Second is 2098 3) The Third is 3213

Now I know i'm able to get the first set of digits out by simply using:

    /\d{4}/

...returning 1234

But how do I match the second set of digits, or the third, and so on...?

edit: How do i return 2098, or 3213

835

asked Aug 24 '13 20:08

Andy 'Drew' Dodd

1 Answers

You don't appear to have a proper answer to your question yet.

The solution is to use the /g modifier on your regex. In list context it will find all of the numbers in your string at once, like this

my $str = 'The first is 1234 2) The Second is 2098 3) The Third is 3213';

my @numbers = $str =~ /\b \d{4} \b/gx;

print "@numbers\n";

output

1234 2098 3213

Or you can iterate through them, using scalar context in a while loop, like this

while ($str =~ /\b (\d{4}) \b/gx) {
  my $number = $1;
  print $number, "\n";
}

output

1234
2098
3213

I have added the \b patterns to the regex so that it only matches whole four-digit numbers and doesn't, for example, find 1234 in 1234567. The /x modifier just allows me to add spaces so that the pattern is more intelligible.

answered Oct 11 '22 03:10

Borodin

Related questions
                            
                                Unicode, regular expressions and PyPy
                            
                                Calculate Number of Consecutive Characters in a String using Perl
                            
                                Find/Replace using grep and Textwrangler
                            
                                BASH - find specific folder with find and filter with regex
                            
                                replace() and replaceAll() in Java
                            
                                awk syntax for getting part of a matched regex
                            
                                How to get float value from string
                            
                                Python re: Storing multiple matches in variables
                            
                                How do I use javascript to replace hash tags with links from a jquery data-attribute
                            
                                How to highlight words beginning with ‘@’ in Vim syntax?
                            
                                Using regular expression to comma separate a large number in south asian numbering system
                            
                                Matching two overlapping patterns with Perl
                            
                                c# regex.ismatch using a variable
                            
                                Use powershell ForEach-Object to match and replace string with regex
                            
                                Extracting single values from a parsed NSString in objective-c
                            
                                Extract all words between two specific words in a character vector
                            
                                Multiple regexpr in one string in R
                            
                                Perl: How to match FULLWIDTH LATIN SMALL
                            
                                Replace newline with <br/> and spaces with &emsp; inside <code> tags
                            
                                meaning of (\/?) in regex / is (\w+)([^>]*?) a redundancy?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With