I need to extract a string with only single or double digit number in them. my file (test) looks like <pre class="prettyprint"><code>test1correct test12something test123wrong </code></pre> In the above example, i want to grep only for test1correct and test12something I tried this grep "test[0-9]{1,2}" test but it gives me all the 3 lines.

use: <code>grep "test[0-9]{1,2}[^0-9]</code>"

Using lookaheads and lookbehinds you can specify "exactly one digit" or "exactly three digits" or whatever. This does exactly one digit: <pre class="prettyprint"><code>echo 'WB123_4' | grep -Po '(?<![[:digit:]])([[:digit:]]{1})(?![[:digit:]])' Result: 4 </code></pre> What it is doing is, find a digit that is not preceded by a digit, and also not followed by a digit. Also works for more than one digit. This does three digits, then at least one of anything else, then one digit: <pre class="prettyprint"><code>echo 'WB123_4' | grep -Po '(?<![[:digit:]])([[:digit:]]{3})(?![[:digit:]]).+(?<![[:digit:]])([[:digit:]]{1})(?![[:digit:]])' Result: 123_4 </code></pre> While I'm at it, this combination of grep and sed will find a string with three digits, then one or more of anything else, then one digit, and extract just those parts nicely. (There might have been another way to do that just in grep with groups.) <pre class="prettyprint"><code>echo 'WB123_4' | grep -Po '(?<![[:digit:]])([[:digit:]]{3})(?![[:digit:]]).+(?<![[:digit:]])([[:digit:]]{1})(?![[:digit:]])' | sed -r -e 's/[^[:digit:]]+/ /' Result: 123 4 </code></pre> Note: the -P flag to grep means to use Perl-style regular expressions, which lets you use lookaheads and lookbehinds.

regex and grep match only string with only single or double digit

Tags:

regex

grep

I need to extract a string with only single or double digit number in them. my file (test) looks like

test1correct
test12something
test123wrong

In the above example, i want to grep only for test1correct and test12something

I tried this grep "test[0-9]{1,2}" test but it gives me all the 3 lines.

622

asked Jul 31 '11 15:07

user238021

2 Answers

use: grep "test[0-9]{1,2}[^0-9]"

answered Oct 21 '22 03:10

Kaken Bok

Using lookaheads and lookbehinds you can specify "exactly one digit" or "exactly three digits" or whatever. This does exactly one digit:

echo 'WB123_4' | grep -Po '(?<![[:digit:]])([[:digit:]]{1})(?![[:digit:]])'
Result: 4

What it is doing is, find a digit that is not preceded by a digit, and also not followed by a digit. Also works for more than one digit. This does three digits, then at least one of anything else, then one digit:

echo 'WB123_4' | grep -Po '(?<![[:digit:]])([[:digit:]]{3})(?![[:digit:]]).+(?<![[:digit:]])([[:digit:]]{1})(?![[:digit:]])'
Result: 123_4

While I'm at it, this combination of grep and sed will find a string with three digits, then one or more of anything else, then one digit, and extract just those parts nicely. (There might have been another way to do that just in grep with groups.)

echo 'WB123_4' | grep -Po '(?<![[:digit:]])([[:digit:]]{3})(?![[:digit:]]).+(?<![[:digit:]])([[:digit:]]{1})(?![[:digit:]])' | sed -r -e 's/[^[:digit:]]+/ /'
Result: 123 4

Note: the -P flag to grep means to use Perl-style regular expressions, which lets you use lookaheads and lookbehinds.

answered Oct 21 '22 03:10

David M. Perlman

Related questions
                            
                                Can you use back references in the pattern part of a regular expression?
                            
                                Regex capture every occurrence of a word within two delimiters
                            
                                Regex elegant pattern match
                            
                                .Net Regex that Matches Strings With Any non-ASCII char in it
                            
                                Regular expression to remove any currency symbol from a string?
                            
                                sed - extract STRING between first occurrence of MATCH1 and next occurrence of MATCH2
                            
                                Java Regex - Extract Hashtags from String
                            
                                RegEx How to handle zero length strings?
                            
                                Java Regex: How detect a URL with file extension
                            
                                Regex PHP Only Match if Not Surrounded By Quotes
                            
                                Java support for non-BMP Unicode characters (i.e. codepoints > 0xFFFF) in their Regular Expression Library?
                            
                                Regex ignore underscores
                            
                                Using Java's Regex to extract a word from a path name
                            
                                Warning: preg_match() [function.preg-match]: Compilation failed: nothing to repeat at offset
                            
                                C# - Regex Match whole words
                            
                                How to redirect multiple domains to another domain except 1 directory using htaccess?
                            
                                RegEx to read multiple parameters of unknown multiplicity
                            
                                Two different regular expressions in one?
                            
                                Regex unordered matches
                            
                                Perl: add character to begin of a line

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With