Replace comment in JavaScript AST with subtree derived from the comment's content

Tags:

I'm the author of doctest, quick and dirty doctests for JavaScript and CoffeeScript. I'd like to make the library less dirty by using a JavaScript parser rather than regular expressions to locate comments.

I'd like to use Esprima or Acorn to do the following:

Create an AST
Walk the tree, and for each comment node:
1. Create an AST from the comment node's text
2. Replace the comment node in the main tree with this subtree

Input:

!function() {

  // > toUsername("Jesper Nøhr")
  // "jespernhr"
  var toUsername = function(text) {
    return ('' + text).replace(/\W/g, '').toLowerCase()
  }

}()

Output:

!function() {

  doctest.input(function() {
    return toUsername("Jesper Nøhr")
  });
  doctest.output(4, function() {
    return "jespernhr"
  });
  var toUsername = function(text) {
    return ('' + text).replace(/\W/g, '').toLowerCase()
  }

}()

I don't know how to do this. Acorn provides a walker which takes a node type and a function, and walks the tree invoking the function each time a node of the specified type is encountered. This seems promising, but doesn't apply to comments.

With Esprima I can use esprima.parse(input, {comment: true, loc: true}).comments to get the comments, but I'm not sure how to update the tree.

805

asked Feb 06 '13 06:02

davidchambers

2 Answers

Most AST-producing parsers throw away comments. I don't know what Esprima or Acorn do, but that might be the issue.

.... in fact, Esprima lists comment capture as a current bug: http://code.google.com/p/esprima/issues/detail?id=197

... Acorn's code is right there in GitHub. It appears to throw comments away, too.

So, looks like you get to fix either parser to capture the comments first, at which point your task should be straightforward, or, you're stuck.

Our DMS Software Reengineering Toolkit has JavaScript parsers that capture comments, in the tree. It also has language substring parsers, that could be used to parse the comment text into JavaScript ASTs of whatever type the comment represents (e.g, function declaration, expression, variable declaration, ...), and the support machinery to graft such new ASTs into the main tree. If you are going to manipulate ASTs, this substring capability is likely important: most parsers won't parse arbitrary language fragments, they are wired only to parse "whole programs". For DMS, there are no comment nodes to replace; there are comments associated with ASTs nodes, so the grafting process is a little trickier than just "replace comment nodes". Still pretty easy.

I'll observe that most parsers (including these) read the source and break it into tokens by using or applying the equivalent of a regular expressions. So, if you are already using these to locate comments (that means using them to locate *non*comments to throw away, as well, e.g., you need to recognize string literals that contain comment-like text and ignore them), you are doing as well as the parsers would do anyway in terms of finding the comments. And if all you want to do is to replace them exactly with their content, echoing the source stream with the comment prefix/suffix /* */ stripped will do apparantly exactly what you want, so all this parsing machinery seems like overkill.

answered Sep 18 '22 15:09

Ira Baxter

You can already use Esprima to achieve what you want:

Parse the code, get the comments (as an array).
Iterate over the comments, see if each is what you are interested in.
If you need to transform the comment, note its range. Collect all transformations.
Apply the transformation back-to-first so that the ranges are not shifted.

The trick is here not change the AST. Simply apply the text change as if you are doing a typical search replace on the source string directly. Because the position of the replacement might shift, you need to collect everything and then do it from the last one. For an example on how to carry out such a transformation, take a look at my blog post "From double-quotes to single-quotes" (it deals with string quotes but the principle remains the same).

Last but not least, you might want to use a slightly higher-level utility such as Rocambole.

answered Sep 19 '22 15:09

Ariya Hidayat

Related questions
                            
                                How can I add custom scripts in index.html's head part in Docusaurus V2?
                            
                                HTML Label doesn't trigger the respective input if the mouse gets moved while clicking in Firefox
                            
                                Conflict when simultaneously using keyboard events for scrolling and CSS scroll snapping
                            
                                Inline CSS/Javascript into a HTML file
                            
                                How to append a row to a TableViewSection in Titanium?
                            
                                jQuery UI without HTML
                            
                                developing a maintainable RPC system
                            
                                Profile mobile Safari (iPad) javascript?
                            
                                best way to convert a div to image? using either php, javascript or jquery
                            
                                How can I prevent an iframe from accessing parent frame?
                            
                                Read and write to an access database using Javascript
                            
                                How to emulate Event.timeStamp
                            
                                JavaScript not resizing height of UL element sometimes when inserting LI elements using Jquery
                            
                                How to modify jQuery mobile history Back Button behavior
                            
                                Netbeans 7.0.1 in Ubuntu has no javascript support?
                            
                                Javascript Call function after window.open
                            
                                Is there a tap and double tap event in d3.js force directed graph
                            
                                do I really need to call getElementById()? [duplicate]
                            
                                How to convert an image object to a binary blob
                            
                                jQuery mobile activity indicator not showing in android

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Replace comment in JavaScript AST with subtree derived from the comment's content

Tags:

javascript

parsing

abstract-syntax-tree

davidchambers

People also ask

2 Answers

Ira Baxter

Ariya Hidayat

Recent Activity

Donate For Us