How do I reduce my parse tree into an abstract syntax tree?

Tags:

What are the general strategies for reducing a parse tree (ie. concrete syntax tree) into an abstract syntax tree?

For example, I have the following grammar rule:

statement_list : statement
               | statement_list statement

which, if left as a parse tree, will generate fanning output that looks like

program
        statement_list
                statement_list
                        statement
                                definition
                                        p_type
                                        assignment
                statement
                        definition
        statement
                assign
                        assignment

If I concatenate the children of each node (since a statement list has no inherent meaning after parsing), I can achieve the following

program
        definition
                p_type
                assignment
        definition
        assign
                assignment

This worked well - however, I'm unaware of any "rules" for doing this. Are there specific grammar rules I should be looking to simplify? Is it a matter of feel, or is there a more mechanistic process?

940

asked Jul 30 '13 01:07

sdasdadas

1 Answers

It's not a matter of "feel". An abstract syntax tree depends on the meaning (semantics) of what's been parsed, and I think these would be the rules:

Remove nodes for tokens that don't add meaning. Those are intermediate keywords (like "then"), separators (like comma) and brackets (like parenthesis).
Promote meaningful tokens (like "if") to be the parent of other tokens in the same rule.

There's no single recipe. It depends on what the phrases in the target language mean.

120

answered Sep 21 '22 07:09

Apalala

Related questions
                            
                                Best way transform custom XML like syntax
                            
                                Tokenize .htaccess files
                            
                                Values in $1, $2 .. variables always NULL
                            
                                JSON string to rails hash
                            
                                Best way to Enumerate Java
                            
                                finding regular expression literals in a string of javascript code
                            
                                Wikipedia philosophy game diagram in python and R
                            
                                PPI::Document bug or some special subroutine name?
                            
                                Parsing TimeSpan from string including format
                            
                                Handling Antlr Syntax Errors or how to give a better message on unexpected token
                            
                                Parsing ftp url that username/password/path has special characters like @, /
                            
                                Parsing queries in Oracle SQL Developer
                            
                                Good Examples: English Parsing / Natural Language Processing
                            
                                How does the CYK algorithm work?
                            
                                In CUP: How to make something optional to parse?
                            
                                How to write a language with Python-like indentation in syntax?
                            
                                Parsing errors with Bison
                            
                                Safely remove all html code from a string in python
                            
                                How to generate random strings that match a given regexp?
                            
                                How to fix IncompatibleClassChangeError during Android Jackson Parsing using annotations in Android Lollipop?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I reduce my parse tree into an abstract syntax tree?

Tags:

parsing

compiler-construction

grammar

abstract-syntax-tree

concrete-syntax-tree

sdasdadas

People also ask

1 Answers

Apalala

Recent Activity

Donate For Us