I am new to regex and I am trying to come up with something that will match a text like below: ABC: (z) jan 02 1999 \n Notes: <ul> <li>text will always begin with "ABC:"</li> <li>there may be zero, one or more spaces between ':' and (z). </li> <li>Variations of (z) also possible - (zz), (zzzzzz).. etc but always a non-digit character enclosed in "()"</li> <li>there may be zero,one or more spaces between (z) and jan</li> <li>jan could be jan, january, etc</li> <li>date couldbe in any format and may/may not contain other text as part of it so I would really like to know if there is a regex I can use to capture anything and everything that is found between '(z)' and '\n'</li> </ul> Any help is greatly appreciated! Thank you

Without knowing the exact regex implementation you're making use of, I can only give general advice. (The syntax I will be perl as that's what I know, some languages will require tweaking) Looking at <code>ABC: (z) jan 02 1999 \n</code> <ul> <li> The first thing to match is ABC: So using our regex is <code>/ABC:/</code> </li> <li> You say ABC is always at the start of the string so <code>/^ABC/</code> will ensure that ABC is at the start of the string. </li> <li> You can match spaces with the <code>\s</code> (note the case) directive. With all directives you can match one or more with <code>+</code> (or 0 or more with <code>*</code>) </li> <li> You need to escape the usage of <code>(</code> and <code>)</code> as it's a reserved character. so <code></code> </li> <li> You can match any non space or newline character with <code>.</code> </li> <li> You can match anything at all with <code>.*</code> but you need to be careful you're not too greedy and capture everything. </li> </ul> So in order to capture what you've asked. I would use <code>/^ABC:\s*$.+?$\s*(.+)$/</code> Which I read as: <blockquote> Begins with ABC: May have some spaces has ( has some characters has ) may have some spaces then capture everything until the end of the line (which is <code>$</code>). </blockquote> I highly recommend keeping a copy of the following laying about http://www.cheatography.com/davechild/cheat-sheets/regular-expressions/

Regular Expression with wildcards to match any character

2 Answers

The following should work:

ABC: *\([a-zA-Z]+\) *(.+)

Explanation:

ABC:            # match literal characters 'ABC:'  *              # zero or more spaces \([a-zA-Z]+\)   # one or more letters inside of parentheses  *              # zero or more spaces (.+)            # capture one or more of any character (except newlines)

To get your desired grouping based on the comments below, you can use the following:

(ABC:) *(\([a-zA-Z]+\).+)

answered Sep 23 '22 14:09

Andrew Clark

Without knowing the exact regex implementation you're making use of, I can only give general advice. (The syntax I will be perl as that's what I know, some languages will require tweaking)

Looking at ABC: (z) jan 02 1999 \n

The first thing to match is ABC: So using our regex is /ABC:/
You say ABC is always at the start of the string so /^ABC/ will ensure that ABC is at the start of the string.
You can match spaces with the \s (note the case) directive. With all directives you can match one or more with + (or 0 or more with *)
You need to escape the usage of ( and ) as it's a reserved character. so 
You can match any non space or newline character with .
You can match anything at all with .* but you need to be careful you're not too greedy and capture everything.

So in order to capture what you've asked. I would use /^ABC:\s*$.+?$\s*(.+)$/

Which I read as:

Begins with ABC:

May have some spaces

has (

has some characters

has )

may have some spaces

then capture everything until the end of the line (which is $).

I highly recommend keeping a copy of the following laying about http://www.cheatography.com/davechild/cheat-sheets/regular-expressions/

answered Sep 25 '22 14:09

abablabab

Related questions
                            
                                Regex - match everything but forward slash
                            
                                Regex with -, ::, ( and )
                            
                                Escaping a parenthesis in grep/ack
                            
                                How can you detect if two regular expressions overlap in the strings they can match?
                            
                                How to make "grep" read patterns from a file?
                            
                                Convert SRE_Match object to string
                            
                                How do you capture a group with regex?
                            
                                Pattern matching field names with jq
                            
                                What does (?i) and ?@ in this regex mean [duplicate]
                            
                                Regex BNF Grammar
                            
                                php preg_match non greedy? [duplicate]
                            
                                Regex in spring controller
                            
                                Regular Expression Wildcard Matching
                            
                                Test if a string is regex
                            
                                RegEx for valid international mobile phone number [duplicate]
                            
                                Regex - check if input still has chances to become matching
                            
                                Python regex: matching a parenthesis within parenthesis
                            
                                How to measure similarity between strings?
                            
                                How to delete rows in python pandas DataFrame using regular expressions?
                            
                                Replacing the nth instance of a regex match in Javascript

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Regular Expression with wildcards to match any character

Tags:

regex

chapstick

People also ask

2 Answers

Andrew Clark

abablabab

Recent Activity

Donate For Us