Writing a compiler; which VM?

Tags:

compiler-construction

I'm going to try to write a compiler for a dynamic language. Preferably to some existing virtual machine --- I don't (yet) want to deal with garbage collection and the myriad other concerns a good VM handles for you. What VMs do you suggest?

I'm on Linux, so I don't know if .NET (via Mono) is that good an idea. I've heard that Parrot is good for dynamic languages, but I haven't heard of any language use that. Should I invent my own? Does LLVM even count as a VM I should compile against, or is it as hard as straight x86?

Also, what pros and cons are there to stack-based vs register-based VMs?

Performance and tool support would be important. I'll be writing the compiler in Haskell, so a good interface with that is a plus.

400

asked Jul 20 '10 21:07

pavpanchekha

2 Answers

JVM (Java) and the CLR (.NET) seem to be the two most common targets for this, as they both handle most of these issues for you. Both provide fairly straightforward instruction sets to work with.

The CLR has one advantage - it was really designed with the goal of supporting multiple languages from the start, and it's (IMO) slightly easier to work with, especially if you're not going to be writing a language that fits into the original "mold" of the initial languages targeting that runtime. Mono works well enough that I wouldn't shy away from a CLR target because of it.

answered Sep 25 '22 20:09

Reed Copsey

LLVM gives you a much better programming model than straight x86 assembly. Yes, it's low-level. But you don't have to worry about register schedulign or fully optimizing your output. Also, while you're still writing your front-end, you can take advantage of its type system to catch mistakes you might make.

That said, you'll have to develop your own runtime layer to take care of the "dynamic" parts of your language. Just for that part alone, I might tend to stick with CLR.

answered Sep 24 '22 20:09

Karmastan

Related questions
                            
                                How does a compiler decide whether it's worth making my functions inline or not?
                            
                                emulating thiscall in C to achieve struct functions without self-referencing
                            
                                Brainfuck compiler in scala
                            
                                How does CTFE work?
                            
                                How to set Visual Studio 2012 RC Compiler for Qt instead of MinGW?
                            
                                Why Doesn't the Visual Studio 2010 Debugger See static const Class Members?
                            
                                Calling convention on x64 [duplicate]
                            
                                How do purely functional compilers annotate the AST with type info?
                            
                                Is there a BNF grammar openly available for JavaScript ES6? [closed]
                            
                                Building GCC cross compiler (from "Linux" to "Windows")
                            
                                "==" Operator Doesn't Behave Like Compiler-generated Equals() override for anonymous type
                            
                                C: Compiler info at runtime
                            
                                Does it ever make sense for a compiler to pass a structure like this in a cpu register to a function?
                            
                                How do I get sal.h
                            
                                When does whitespace impact on performance?
                            
                                How can I determine if a compiler uses early or late binding on a virtual function?
                            
                                Is it possible to disambiguate conflicting type name in the using declaration?
                            
                                Have different optimizations (plain, SSE, AVX) in the same executable with C/C++
                            
                                What's the difference among Expression,Statements and Declaration from the view of compiler?
                            
                                Fast Standard ML compiler or bytecode interpreter, with read-eval-print loop, for Linux?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With