What kinds of optimization LLVM does and what kinds of optimizations its frontends have to implement themselves?

Tags:

llvm

Notice: I noticed this question is a lot related to this one, so if you're somebody interested in my question, you should definitely read that other one and its answers too.

I can think of some optimizations an OOP language frontend could do, such as creating temporary variables to hold values from const method calls called in sequence, without intermediary non-const calls to the object in question, to cut off function calls, but I can't think of many more. I'd like to ask people to create a longer list of examples.

I ask this because I want to create a small language as a pet project and I'm not sure how to study this subject very well. Maybe this is a case for the community wiki? A comprehensive list of optimizations the LLVM backend does and that frontends should do themselves, what do you think?

Oh, and I know different frontends can have widely different needs, but my focus is on procedural/OOP languages.

476

asked Sep 05 '11 17:09

Gui Prá

1 Answers

This probably varies a lot by language... clang (C/C++) is able to get away with doing very little in terms of optimizations in the frontend. The only optimization I can think of that is done for performance of the generated code is that clang does some devirtualization of C++ methods in the frontend. clang does some other optimizations like constant folding and dead code elimination, but that's primarily done to speed up compile-time, not for the performance of the generated code.

EDIT: Actually, thinking about it a bit more, I just remembered one more important optimization clang does for C++: clang knows a few tricks to elide copy constructors in C++ (google for NRVO).

In some cases, a language-specific IR optimization pass can be useful. There is a a SimplifyLibCalls pass which knows how to optimize calls into the C standard library. For the new Objective-C ARC language feature, clang puts some ARC-specific passes into the pipeline; those optimize out calls to various Objective-C runtime functions.

In general, implementing optimizations in the frontend is only generally helpful when code has properties which cannot be encoded into the IR (e.g. C++ objects have a constant vtable pointer). And in practice, you most likely want to implement dumb code generation first, and see whether there are important cases which are not optimized. The optimizers can do some surprisingly complex transformations.

See also http://llvm.org/docs/tutorial/LangImpl7.html ; using alloca appropriately is one thing which helps the optimizers substantially, although it isn't really an optimization itself.

168

answered Oct 05 '22 17:10

servn

Related questions
                            
                                How to generate LLVM SSA Format
                            
                                Difference b/w llvm-ld and llvm-link
                            
                                LLVM JIT tutorial code crashes with simple parameterized function. Why?
                            
                                Is there a way to show where LLVM is auto vectorising?
                            
                                Clang Pragma Comprehensive List
                            
                                Objective-C method swizzling performance
                            
                                What sret actually means?
                            
                                pip not installing numba/llvmlite properly within conda environment
                            
                                Last basic block of a function in LLVM
                            
                                Identify enclosing loop of a block in LLVM
                            
                                Pointer analysis in LLVM
                            
                                Which code in LLVM IR runs before "main()"?
                            
                                Execute LLVM IR code generated from Rust/Python source code
                            
                                Why does "empty" loop cause bus error when compiling C program with clang -O2 on macOS?
                            
                                It there an equivalent to size_t in llvm
                            
                                Mapping ANTLR parse rules to custom Java AST classes for code generation
                            
                                How to increment a Global Variable in a LLVM module?
                            
                                Source-to-source compilation with LLVM [closed]
                            
                                Converting GCC IR to LLVM IR
                            
                                How to insert a function in LLVM module

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With