Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How are classes implemented in compilers

I'd like to implement a class type for my own little language but what I thought at first wouldn't be too hard has got me stumped. I have the parser in place and it's the code generation side of things I'm having problems with. Can anyone shed any light on the best/correct way to go about this? Specifically I'd like to do this in LLVM so while I need to know the generalities of this any specific LLVM code I should be working with would be fantastic.

Thanks T.


N.B. The experience I have with LLVM is basically what comes from the Kaleidoscope tutorials and a little extra from playing around with it but I am far from having a full understanding of the LLVM API's.

like image 643
tjm Avatar asked Oct 28 '10 13:10

tjm


1 Answers

A very, very incomplete overview:

Class is a structure (you know C/C++ don't you?)

Methods are otherwise ordinary functions except they receive an extra implicit argument: the object itself. This argument is usually called 'this' or 'self' within the function. Class-scope symbols may (C++, JavaScript) or may not (PHP, Python) be accessible by default within methods.

Inheritance is essentially gluing together the structures and possibly also merging symbol tables as well, as normally symbols of the base class are accessible by default from within the methods of a class you are now parsing. When you encounter a symbol (field or method) within a method you need to do an ascending lookup, starting from the current class going up the hierarchy. Or you may implement it so that you look it up only in one symbol table which is a result of a merger.

Virtual methods are called indirectly. In some languages all methods are virtual by default. The implementation would depend on whether it's a fully dynamic language, in which case you always look up a function name within a class at run-time and thus all your methods become virtual automatically; or in case of static languages compilers usually build so called virtual method tables. I'm not sure if you need this at all, so I won't go into details here.

Constructors are special methods that are called either when constructing a new object (usually with 'new') or otherwise are called as part of the constructor call chain from within descendant constructors. Many different implementations are possible here, one being that a constructor takes an implicit 'this' argument, which may be NULL if an object hasn't been created yet, and returns it as well.

Destructiors are ordinary methods that are normally called implicitly when an object goes out of scope. Again you need to take into account a possibility of an ascending call chain for destructors.

Interfaces are tricky unless, again, your language is fully dynamic.

like image 130
mojuba Avatar answered Sep 17 '22 15:09

mojuba