What exactly PHI instruction does and how to use it in LLVM

People also ask

What is Phi instruction in LLVM?

The phi instruction is named after the φ function used in the theory of SSA. This functions magically chooses the right value, depending on the control flow. In LLVM you have to manually specify the name of the value and the previous basic block.

What is SSA in LLVM?

SSA stands for Static Single Assignment. It is a property of program representations (usually IR) designed for enabling various optimizations. Instead of 'normal' variables that can be assigned multiple times, SSA variables can only be assigned once.

What is IR in LLVM?

LLVM IR is a low-level intermediate representation used by the LLVM compiler framework. You can think of LLVM IR as a platform-independent assembly language with an infinite number of function local registers.

What is LLVM address space?

An address space is a fundamental part of the type of a pointer value and the type of operations that manipulate memory. LLVM affords a default address space (numbered zero) and places a number of assumptions on pointer values within that address space: The pointer must have a fixed integral value.

A phi node is an instruction used to select a value depending on the predecessor of the current block (Look here to see the full hierarchy - it's also used as a value, which is one of the classes which it inherits from).

Phi nodes are necessary due to the structure of the SSA (static single assignment) style of the LLVM code - for example, the following C++ function

void m(bool r, bool y){
    bool l = y || r ;
}

gets translated into the following IR: (created through clang -c -emit-llvm file.c -o out.bc - and then viewed through llvm-dis)

define void @_Z1mbb(i1 zeroext %r, i1 zeroext %y) nounwind {
entry:
  %r.addr = alloca i8, align 1
  %y.addr = alloca i8, align 1
  %l = alloca i8, align 1
  %frombool = zext i1 %r to i8
  store i8 %frombool, i8* %r.addr, align 1
  %frombool1 = zext i1 %y to i8
  store i8 %frombool1, i8* %y.addr, align 1
  %0 = load i8* %y.addr, align 1
  %tobool = trunc i8 %0 to i1
  br i1 %tobool, label %lor.end, label %lor.rhs

lor.rhs:                                          ; preds = %entry
  %1 = load i8* %r.addr, align 1
  %tobool2 = trunc i8 %1 to i1
  br label %lor.end

lor.end:                                          ; preds = %lor.rhs, %entry
  %2 = phi i1 [ true, %entry ], [ %tobool2, %lor.rhs ]
  %frombool3 = zext i1 %2 to i8
  store i8 %frombool3, i8* %l, align 1
  ret void
}

So what happens here? Unlike the C++ code, where the variable bool l could be either 0 or 1, in the LLVM IR it has to be defined once. So we check if %tobool is true, and then jump to lor.end or lor.rhs.

In lor.end we finally have the value of the || operator. If we arrived from the entry block - then it's just true. Otherwise, it is equal to the value of %tobool2 - and that's exactly what we get from the following IR line:

%2 = phi i1 [ true, %entry ], [ %tobool2, %lor.rhs ]

You don't need to use phi at all. Just create bunch of temporary variables. LLVM optimization passes will take care of optimizing temporary variables away and will use phi node for that automatically.

For example, if you want to do this:

x = 4;
if (something) x = x + 2;
print(x);

You can use phi node for that (in pseudocode):

assign 4 to x1
if (!something) branch to 4
calculate x2 from x1 by adding 2
assign x3 phi from x1 and x2
call print with x3

But you can do without phi node (in pseudocode):

allocate local variable on stack called x
load into temp x1 value 4
store x1 to x
if (!something) branch to 8
load x to temp x2
add x2 with 4 to temp x3
store x3 to x
load x to temp x4
call print with x4

By running optimization passes with llvm this second code will get optimized to first code.

Related questions
                            
                                LLVM vs. GCC for iOS development [closed]
                            
                                Is there some literal dictionary or array syntax in Objective-C?
                            
                                What does the clang -cc1 option do?
                            
                                Is it possible to debug a gcc-compiled program using lldb, or debug a clang-compiled program using gdb?
                            
                                any C/C++ refactoring tool based on libclang? (even simplest "toy example" ) [closed]
                            
                                lldb: Breakpoint on exceptions (equivalent of gdb's catch throw)
                            
                                Binding FFI and DSL
                            
                                Understanding the simplest LLVM IR
                            
                                Why is llvm considered unsuitable for implementing a JIT?
                            
                                How does C-- compare to LLVM?
                            
                                Is there a llvm java front end that converts java source to llvm's intermediate form?
                            
                                design suggestion: llvm multiple runtime contexts
                            
                                How to detect LLVM and its version through #define directives?
                            
                                LLVM Profile Error: Failed to write file "default.profraw": Permission denied
                            
                                What can make C++ RTTI undesirable to use?
                            
                                How to call methods or execute code in LLDB debugger?
                            
                                What are the differences between LLVM and java bytecode?
                            
                                View array in LLDB: equivalent of GDB's '@' operator in Xcode 4.1
                            
                                LLVM C++ IDE for Windows
                            
                                Confusing Template error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What exactly PHI instruction does and how to use it in LLVM

Tags:

llvm

llvm-ir

People also ask

Recent Activity

Donate For Us