I am new to regular expression and this may be a very easy question (hopefully). I am trying to use one solution for 3 kinds of string <ul> <li>"45%", expected result: "45"</li> <li>"45", expected result: "45"</li> <li>"", expected result: ""</li> </ul> What I am trying (let the string be str): <pre class="prettyprint"><code>str.match(/(.*)(?!%*)/i)[1] </code></pre> This is in my head would sound like "match any instance of anything up until '%' if it is found, or else just match anything" In firebug's head, it seems to sound more like "just match anything and completely disregard the negative lookahead". Also to make it lazy - <code>(.*)?</code> - doesn't seem to help. Let's forget for a second that in this specific situation I am only matching numbers, so a <code>/\d*/</code> would do. I am trying to understand a general rule so that I can apply it whenever. Anybody would be so kind to help me out?

How about the simpler <pre class="prettyprint"><code>str.match(/[^%]*/i)[0] </code></pre> Which means, match zero-or-more character, which is not a <code>%</code>. <hr> Edit: If need to parse until <code></a></code>, then you could parse a sequence pf characters, followed by <code></a></code>, then then discard the <code></a></code>, which means you should use positive look-ahead instead of negative. <pre class="prettyprint"><code>str.match(/.*?(?=<\/a>|$)/i)[0] </code></pre> This means: match zero-or-more character lazily, until reaching a <code></a></code> or end of string. Note that <code>*?</code> is a single operator, <code>(.*)?</code> is not the same as <code>.*?</code>. (And don't parse HTML with a single regex, as usual.)

I think this is what you're looking for: <pre class="prettyprint"><code>/(?:(?!%).)*/ </code></pre> The <code>.</code> matches any character, but only after the negative lookahead, <code>(?!%)</code>, confirms that the character is not <code>%</code>. Note that when the sentinel is a single character like <code>%</code>, you can use a negated character class instead, for example: <pre class="prettyprint"><code>/[^%]*/ </code></pre> But for a multi-character sentinel like <code></a></code>, you have to use the lookahead approach: <pre class="prettyprint"><code>/(?:(?!</a>).)*/i </code></pre> This is actually saying "Match zero or more characters one at a time, but if the next character turns out to be the beginning of the sequence <code></a></code> or <code></A></code>, stop without consuming it".

Javascript regular expression: match anything up until something (if there it exists)

Tags:

I am new to regular expression and this may be a very easy question (hopefully).

I am trying to use one solution for 3 kinds of string

"45%", expected result: "45"
"45", expected result: "45"
"", expected result: ""

What I am trying (let the string be str):

str.match(/(.*)(?!%*)/i)[1]

This is in my head would sound like "match any instance of anything up until '%' if it is found, or else just match anything"

In firebug's head, it seems to sound more like "just match anything and completely disregard the negative lookahead". Also to make it lazy - (.*)? - doesn't seem to help.

Let's forget for a second that in this specific situation I am only matching numbers, so a /\d*/ would do. I am trying to understand a general rule so that I can apply it whenever.

Anybody would be so kind to help me out?

355

asked Dec 21 '11 03:12

undefinederror

2 Answers

How about the simpler

str.match(/[^%]*/i)[0]

Which means, match zero-or-more character, which is not a %.

Edit: If need to parse until </a>, then you could parse a sequence pf characters, followed by </a>, then then discard the </a>, which means you should use positive look-ahead instead of negative.

str.match(/.*?(?=<\/a>|$)/i)[0]

This means: match zero-or-more character lazily, until reaching a </a> or end of string.

Note that *? is a single operator, (.*)? is not the same as .*?.

(And don't parse HTML with a single regex, as usual.)

answered Oct 29 '22 06:10

kennytm

I think this is what you're looking for:

/(?:(?!%).)*/

The . matches any character, but only after the negative lookahead, (?!%), confirms that the character is not %. Note that when the sentinel is a single character like %, you can use a negated character class instead, for example:

/[^%]*/

But for a multi-character sentinel like </a>, you have to use the lookahead approach:

/(?:(?!</a>).)*/i

This is actually saying "Match zero or more characters one at a time, but if the next character turns out to be the beginning of the sequence </a> or </A>, stop without consuming it".

answered Oct 29 '22 06:10

Alan Moore

Related questions
                            
                                how to overwrite with a git push, overwrite changes to the git server?
                            
                                How do I check a WebClient Request for a 404 error
                            
                                jQuery attribute selector for multiple values
                            
                                How to link multiple scripts?
                            
                                Converting data from glReadPixels() to OpenCV::Mat
                            
                                Using java.util.logging to log on the console
                            
                                Combined plot of ggplot2 (Not in a single Plot), using par() or layout() function? [duplicate]
                            
                                Edit selected rows manually in SQL Server
                            
                                Reference Assemblies folder and different assemblies with the same version
                            
                                List<Object> and List<?>
                            
                                Concatenation in smarty
                            
                                Write an InputStream to an HttpServletResponse

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With