Parsing and Modifying LLVM IR code

Tags:

I want to read (parse) LLVM IR code (which is saved in a text file) and add some of my own code to it. I need some example of doing this, that is, how this is done by using the libraries provided by LLVM for this purpose. So basically what I want is to read in the IR code from a text file into the memory (perhaps the LLVM library represents it in AST form, I dont know), make modifications, like adding some more nodes in the AST and then finally write back the AST in the IR text file.

Although I need to both read and modify the IR code, I would greatly appreciate if someone could provide or refer me to some example which just read (parses) it.

970

asked Feb 06 '12 21:02

MetallicPriest

2 Answers

First, to fix an obvious misunderstanding: LLVM is a framework for manipulating code in IR format. There are no ASTs in sight (*) - you read IR, transform/manipulate/analyze it, and you write IR back.

Reading IR is really simple:

int main(int argc, char** argv)
{
    if (argc < 2) {
        errs() << "Expected an argument - IR file name\n";
        exit(1);
    }

    LLVMContext &Context = getGlobalContext();
    SMDiagnostic Err;
    Module *Mod = ParseIRFile(argv[1], Err, Context);

    if (!Mod) {
        Err.print(argv[0], errs());
        return 1;
    }

    [...]
  }

This code accepts a file name. This should be an LLVM IR file (textual). It then goes on to parse it into a Module, which represents a module of IR in LLVM's internal in-memory format. This can then be manipulated with the various passes LLVM has or you add on your own. Take a look at some examples in the LLVM code base (such as lib/Transforms/Hello/Hello.cpp) and read this - http://llvm.org/docs/WritingAnLLVMPass.html.

Spitting IR back into a file is even easier. The Module class just writes itself to a stream:

 some_stream << *Mod;

That's it.

Now, if you have any specific questions about specific modifications you want to do to IR code, you should really ask something more focused. I hope this answer shows you how to parse IR and write it back.

(*) _{IR doesn't have an AST representation inside LLVM, because it's a simple assembly-like language. If you go one step up, to C or C++, you can use Clang to parse that into ASTs, and then do manipulations at the AST level. Clang then knows how to produce LLVM IR from its AST. However, you do have to start with C/C++ here, and not LLVM IR. If LLVM IR is all you care about, forget about ASTs.}

166

answered Oct 02 '22 06:10

Eli Bendersky

This is usually done by implementing an LLVM pass/transform. This way you don't have to parse the IR at all because LLVM will do it for you and you will operate on a object-oriented in-memory representation of the IR.

This is the entry point for writing an LLVM pass. Then you can look at any of the already implemented standard passes that come bundled with LLVM (look into lib/Transforms).

answered Oct 02 '22 04:10

CAFxX

Related questions
                            
                                Build C++ plugin for Unity
                            
                                Multithreaded image processing in C++
                            
                                C++ best way to define cross-file constants
                            
                                GCC/Make Build Time Optimizations
                            
                                Cross-platform primitive data types in C++
                            
                                How to initialize a shared library on Linux
                            
                                Static variable initialization?
                            
                                Which is more efficient: Bit, byte or int?
                            
                                Printing messages to console from C++ DLL
                            
                                C/C++ implementation of simplex method [closed]
                            
                                In C++, initialize a class member with 'this' pointer during construction
                            
                                C++ Class Copy (pointer copy)
                            
                                AfxGetInstanceHandle() triggers an assertion failure
                            
                                Convert iterator to int
                            
                                C++ MFC Get current date and time
                            
                                Windows 7 timing functions - How to use GetSystemTimeAdjustment correctly?
                            
                                Using std::move
                            
                                Why is this considered an extended initializer list?
                            
                                Alternative to using % operator and / Operator in C++
                            
                                building log4cxx in vs 2010 c++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With