How to match the Nth word of a line containing a specific word using regex

Tags:

regex

I'm trying to do to get the correct regular expression to match the Nth word of a line containing a specific word.

For example, if I have this input:

this is the first line - blue
this is the second line - green
this is the third line - red

I want to match the seventh word of the lines containing the word "second" and return green.

I'm using Rubular to test the regular expression.

I already tried out this regular expression without success - it is matching the next line:

(.*second.*)(?<data>.*?\s){7}(.*)

Another example input:

this is the Foo line - blue
this is the Bar line - green
this is the Test line - red

I want to match the fourth word of the lines containing the word "red" and return Test.

The word I want to match can come either before or after the word I use to select the line.

625

asked Jan 31 '14 16:01

Jorge

2 Answers

You can use this to match a line containing second and grab the 7th word:

^(?=.*\bsecond\b)(?:\S+ ){6}(\S+)

Make sure that the global and multiline flags are active.

^ matches the beginning of a line.

(?=.*\bsecond\b) is a positive lookahead to make sure there's the word second in that particular line.

(?:\S+ ){6} matches 6 words.

(\S+) will get the 7th.

regex101 demo

You can apply the same principle with other requirements.

With a line containing red and getting the 4th word...

^(?=.*\bred\b)(?:\S+ ){3}(\S+)

107

answered Oct 21 '22 13:10

Jerry

You asked for regex, and you got a very good answer.

Sometimes you need to ask for the solution, and not specify the tool.

Here is the one-liner that I think best suits your need:

awk '/second/ {print $7}' < inputFile.txt

Explanation:

/second/     - for any line that matches this regex (in this case, literal 'second')
print $7     - print the 7th field (by default, fields are separated by space)

I think it is much easier to understand than the regex - and it's more flexible for this kind of processing.

answered Oct 21 '22 11:10

Floris

Related questions
                            
                                Matching only one occurrence of a character from a given set
                            
                                Is there a database that can store regex as values?
                            
                                Lazy quantifier {,}? not working as I would expect
                            
                                Find overlapping Regexp matches
                            
                                Check for valid domain name in a string?
                            
                                Recognize URL in plain text
                            
                                how to replace last occurrence of a word in javascript?
                            
                                Wrap Text in P tag
                            
                                How to convert javascript regex to safe java regex?
                            
                                Regex in Python to find words that follow pattern: vowel, consonant, vowel, consonant
                            
                                JavaScript RegExp match text ignoring HTML
                            
                                Converting new lines to paragraph/br HTML tags, can this be a single regex?
                            
                                In Perl, what is the difference between s/^\s+// and s/\s+$//?
                            
                                Simple PHP Regex String Starts With
                            
                                Javascript replace() and $1 issue
                            
                                How to set up REGEX that doesn't match anything?
                            
                                Easiest way to test vim regex?
                            
                                sed error "Invalid range end"
                            
                                Regex to match nested json objects
                            
                                Bug in .net Regex.Replace?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With