I've been given a task where I have to create a parser for a simple C-like language. I can use any programming language and tools I wish to create the parser, but I'm learning Python at the same time so it would be my preferred choice. There are a few restrictions my Parser has to follow. Firstly, it must be able to read in a text file that contains the following information: <pre class="prettyprint"><code>kind1 : spelling1 kind2 : spelling2 kind3 : spelling3 . . . kindn : spellingn </code></pre> Where each kind and spelling refer to the token type and value of the language. This file is the result of putting a sample of code through the language's lexical analyser. Secondly, I must be able to customise the output of the parser. Ideally I would like to output a file that has converted the kind:spelling list into another sequence of tokens that would be passed to the language's compiler to be converted into MIPS Assembly code. Here's a little example of the kind of thing I would like the parser to be able to produce: <pre class="prettyprint"><code>%function int test %variable int x %variable int y %begin %if %id y , %id x > %do %begin %return %num 0 %end %return %num 1 %end </code></pre> It would be a great help if someone could advise me on existing Python Parser Generators and if I'd be able to achieve the sort of thing I'm looking for in the above examples.

PyParsing is a python tool to generate parsers. There are a lot of interesting examples. Easy to get started: <pre class="prettyprint"><code>from pyparsing import Word, alphas # define grammar greet = Word( alphas ) + "," + Word( alphas ) + "!" # input string hello = "Hello, World!" # parse input string print hello, "->", greet.parseString( hello ) </code></pre>

I recommend that you check out Lark: https://github.com/erezsh/lark It can parse ALL context-free grammars, it automatically builds an AST (with line & column numbers), and it accepts the grammar in EBNF format, which is simple to write and it's considered the standard.

Advice on Python Parser Generators

Tags:

python

parser-generator

I've been given a task where I have to create a parser for a simple C-like language. I can use any programming language and tools I wish to create the parser, but I'm learning Python at the same time so it would be my preferred choice.

There are a few restrictions my Parser has to follow. Firstly, it must be able to read in a text file that contains the following information:

kind1 : spelling1
kind2 : spelling2
kind3 : spelling3
      .
      .
      .
kindn : spellingn

Where each kind and spelling refer to the token type and value of the language. This file is the result of putting a sample of code through the language's lexical analyser.

Secondly, I must be able to customise the output of the parser. Ideally I would like to output a file that has converted the kind:spelling list into another sequence of tokens that would be passed to the language's compiler to be converted into MIPS Assembly code. Here's a little example of the kind of thing I would like the parser to be able to produce:

%function int test
  %variable int x
  %variable int y
%begin
  %if %id y , %id x > %do
  %begin
    %return %num 0
  %end
  %return %num 1
%end

It would be a great help if someone could advise me on existing Python Parser Generators and if I'd be able to achieve the sort of thing I'm looking for in the above examples.

269

asked Nov 21 '09 17:11

greenie

2 Answers

PyParsing is a python tool to generate parsers. There are a lot of interesting examples.

Easy to get started:

from pyparsing import Word, alphas

# define grammar
greet = Word( alphas ) + "," + Word( alphas ) + "!"

# input string
hello = "Hello, World!"

# parse input string
print hello, "->", greet.parseString( hello )

113

answered Oct 04 '22 15:10

miku

I recommend that you check out Lark: https://github.com/erezsh/lark

It can parse ALL context-free grammars, it automatically builds an AST (with line & column numbers), and it accepts the grammar in EBNF format, which is simple to write and it's considered the standard.

answered Oct 04 '22 14:10

Erez

Related questions
                            
                                Merge Only When Value is Empty/Null in Pandas
                            
                                Cyclic shift of a pandas series
                            
                                Why is psycopg2 IntegrityError not being caught?
                            
                                Spline with constraints at border
                            
                                pip broken, reinstall doesn't work. EC2
                            
                                How to store scaling parameters for later use
                            
                                Python mock.patch: replace a method
                            
                                ValueError: day is out of range for month
                            
                                How can I create an in-memory database with sqlite?
                            
                                How can I download the chat history of a group in Telegram?
                            
                                How are python's unpacking operators * and ** used?
                            
                                Flatten numpy array with sub-arrays of different dimensions
                            
                                Difference between Context Managers and Decorators in Python
                            
                                Poetry and PyTorch
                            
                                re.findall('(ab|cd)', string) vs re.findall('(ab|cd)+', string)
                            
                                How to concat multiple Pandas DataFrame columns with different token separator?
                            
                                Pandas check if value in one multiindex column is in any column, same row of different multiindex
                            
                                Gunicorn worker terminated with signal 9
                            
                                Are Python list comprehensions the same thing as map/grep in Perl?
                            
                                Django - accessing the RequestContext from within a custom filter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With