First of all, I know how most RegExp questions go; and this is not one of those, "please write my code" questions. My confusion lies in the fact that my <code>RegExp</code> works on regexr, and in chrome's dev tools when polling the <code>document.body.textContent</code>, but not on an HTML file after I have read it in io.js. io.js is version 1.5.1, running on windows 8 Why would it work in both places listed, but not in io.js? Am I not taking something into consideration that io.js does to read files? My <code>RegExp</code> should be matching "<code>@{each ___->___} text and line breaks @{/each}</code>" as it does in the link below, but instead, it returns <code>null</code> Here is what I'm trying to use: http://regexr.com/3aldk RegExp: <code>/@\{each ([a-zA-Z0-9->.]*)\}([\s\S]*)@\{\/each}/g</code> JS (Example): <pre class="prettyprint"><code>fs.readFile('view.html', {encoding:'utf8'}, function(error, html) { console.log(html.match(myRegExp)); // null }); </code></pre> HTML: <pre class="prettyprint"><code><!doctype html> <html> <head> <title>@{title}</title> </head> <body> <h1>@{foo.bar}</h1> Lorem ipsum dolor sit amet, @{foo.baz.hoo} @{each people->person} <div> @{person.name}: @{person.age} </div> @{/each} </body> </html> </code></pre> Am I missing something obvious, like a character, that is present on the back side, but not once served?

The issue here lies on the fine line between specification and implementations. ECMAscript 5.1 Specification states that: <blockquote> A <code>-</code> character can be treated literally or it can denote a range. It is treated literally if it is the first or last character of ClassRanges, the beginning or end limit of a range specification, or immediately follows a range specification. </blockquote> Regular-Expressions.info notes that: <blockquote> Hyphens at other positions in character classes where they can't form a range may be interpreted as literals or as errors. Regex flavors are quite inconsistent about this. </blockquote> <h3>Conclusions:</h3> The safe way of including a dash <code>-</code> minus sign in a character class is by either: <ul> <li>escaping it (eg. <code>[a-zA-Z0-9\->.]</code>)</li> <li>placing it as the first char. in the class (eg. <code>[-.>a-zA-Z0-9]</code>) <ul> <li>exception: in a negated class it goes 2nd, right after <code>^</code> (eg. <code>[^-.>a-zA-Z0-9]</code>)</li> </ul> </li> <li>placing it last in the class (eg. <code>[a-zA-Z0-9.>-]</code>)</li> </ul> General coding guidelines suggest placing your ranges first and ending the character class with the hyphen, this avoids ambiguity and helps readability. <hr> Summing it up, your RegEx should become: <pre class="prettyprint"><code>/@\{each ([a-zA-Z0-9>.-]*)\}([\s\S]*)@\{\/each}/g </code></pre> <hr> As an additional tip: you could also rewrite <code>[\s\S]</code> (any whitespace char. or any non-whitespace char.) into <code>[^]</code> (not nothing) which would end you up with the following RegEx: <pre class="prettyprint"><code>/@\{each ([a-zA-Z0-9>.-]*)\}([^]*)@\{\/each}/g </code></pre> <blockquote> JavaScript ... treats <code>[^]</code> as a negated empty character class that matches any single character. - source </blockquote>

RegExp not working on read HTML file

Tags:

javascript

regex

node.js

First of all, I know how most RegExp questions go; and this is not one of those, "please write my code" questions.

My confusion lies in the fact that my RegExp works on regexr, and in chrome's dev tools when polling the document.body.textContent, but not on an HTML file after I have read it in io.js.

io.js is version 1.5.1, running on windows 8

Why would it work in both places listed, but not in io.js? Am I not taking something into consideration that io.js does to read files?

My RegExp should be matching "@{each ___->___} text and line breaks @{/each}" as it does in the link below, but instead, it returns null

Here is what I'm trying to use: http://regexr.com/3aldk

RegExp:

/@\{each ([a-zA-Z0-9->.]*)\}([\s\S]*)@\{\/each}/g

JS (Example):

Click to copy

fs.readFile('view.html', {encoding:'utf8'}, function(error, html) {
    console.log(html.match(myRegExp)); // null
});

HTML:

Click to copy

<!doctype html>
<html>
    <head>
        <title>@{title}</title>
    </head>
    <body>
        <h1>@{foo.bar}</h1>
        <p>
            Lorem ipsum dolor sit amet, @{foo.baz.hoo}
        </p>
        @{each people->person}
            <div>
                <b>@{person.name}:</b> @{person.age}
            </div>
        @{/each}
    </body>
</html>

Am I missing something obvious, like a character, that is present on the back side, but not once served?

562

asked Mar 22 '15 16:03

ndugger

1 Answers

The issue here lies on the fine line between specification and implementations.

ECMAscript 5.1 Specification states that:

A - character can be treated literally or it can denote a range. It is treated literally if it is the first or last character of ClassRanges, the beginning or end limit of a range specification, or immediately follows a range specification.

Regular-Expressions.info notes that:

Hyphens at other positions in character classes where they can't form a range may be interpreted as literals or as errors. Regex flavors are quite inconsistent about this.

Conclusions:

The safe way of including a dash - minus sign in a character class is by either:

escaping it (eg. [a-zA-Z0-9\->.])
placing it as the first char. in the class (eg. [-.>a-zA-Z0-9])
- _{exception: in a negated class it goes 2nd, right after ^ (eg. [^-.>a-zA-Z0-9])}
placing it last in the class (eg. [a-zA-Z0-9.>-])

General coding guidelines suggest placing your ranges first and ending the character class with the hyphen, this avoids ambiguity and helps readability.

Summing it up, your RegEx should become:

Click to copy

/@\{each ([a-zA-Z0-9>.-]*)\}([\s\S]*)@\{\/each}/g

As an additional tip:

you could also rewrite [\s\S] (any whitespace char. or any non-whitespace char.) into [^] (not nothing)

which would end you up with the following RegEx:

Click to copy

/@\{each ([a-zA-Z0-9>.-]*)\}([^]*)@\{\/each}/g

JavaScript ... treats [^] as a negated empty character class that matches any single character. - source

answered Oct 16 '22 08:10

CSᵠ

Related questions
                            
                                AngularJS $http.get difference between then and success callback
                            
                                ng-click not working in firefox
                            
                                How to get a random color in my CreateJS shape?
                            
                                How do I create a best-fit polynomial curve in Javascript?
                            
                                Ajax not post base64 data of large image
                            
                                Is it possible to build console application with nw.js?
                            
                                Changing Image colour through Javascript
                            
                                AngularJS Directive templateUrl returns 400 though file URL loads
                            
                                Listen keyboard events on Google Map
                            
                                Position an html element at any x/y coordinate in a page
                            
                                How does the browser recognize angularjs tags in html?
                            
                                Mouse position within HTML 5 responsive Canvas
                            
                                How to recompile a directive upon inserting into DOM (angularjs)
                            
                                Compare elements of two arrays
                            
                                In Karma Testing, ReferenceError: describe is not defined
                            
                                Avoiding the deferred anti-pattern [duplicate]
                            
                                Handling events for mouse click and keydown or keypress (for non-modifier keys)
                            
                                Is "class.create" part of standard JavaScript?
                            
                                Cannot use GLOB with JSHint in Windows?
                            
                                How to build an array from a jSon object

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

RegExp not working on read HTML file

Tags:

javascript

regex

node.js

ndugger

People also ask

1 Answers

Conclusions:

CSᵠ

Recent Activity

Donate For Us