Why does enabling undefined behaviour sanitization interfere with optimizations?

Tags:

Consider the following code:

#include <string_view>

constexpr std::string_view f() { return "hello"; }

static constexpr std::string_view g() {
    auto x = f();
    return x.substr(1, 3);
}

int foo() { return g().length(); }

If I compile it with GCC 10.2, and flags --std=c++17 -O1, I get:

foo():
        mov     eax, 3
        ret

also, to my knowledge, this code does not suffer from any undefined behavior issues.

However - if I add the flag -fsanitize=undefined, the compilation result is:

.LC0:
        .string "hello"
foo():
        sub     rsp, 104
        mov     QWORD PTR [rsp+80], 5
        mov     QWORD PTR [rsp+16], 5
        mov     QWORD PTR [rsp+24], OFFSET FLAT:.LC0
        mov     QWORD PTR [rsp+8], 3
        mov     QWORD PTR [rsp+72], 4
        mov     eax, OFFSET FLAT:.LC0
        cmp     rax, -1
        jnb     .L4
.L2:
        mov     eax, 3
        add     rsp, 104
        ret
.L4:
        mov     edx, OFFSET FLAT:.LC0+1
        mov     rsi, rax
        mov     edi, OFFSET FLAT:.Lubsan_data154
        call    __ubsan_handle_pointer_overflow
        jmp     .L2
.LC1:
        .string "/opt/compiler-explorer/gcc-10.2.0/include/c++/10.2.0/string_view"
.Lubsan_data154:
        .quad   .LC1
        .long   287
        .long   49

See this on Compiler Explorer.

My question: Why should the sanitization interfere with the optimization? Especially since the code doesn't seem to have any UB hazards...

Notes:

I suspect a GCC bug, but maybe I have the wrong perception of what the UBsan does.
Same behavior if I set -O3.
With no optimization flags, the longer code is produced both with and without sanitization.
If you declare x to be a constexpr variable, the sanitization doesn't prevent the optimization.
Same behavior with C++17 and C++20.
With Clang, you get this discrepancy as well, but only with a higher optimization setting (e.g. -O3).

665

asked Oct 23 '20 15:10

einpoklum

1 Answers

Sanitizers add necessary instrumentation to detect violations at run-time. That instrumentation may prevent the function from being computed at compile-time as an optimization by introducing some opaque calls/side-effects that wouldn't be present there otherwise.

The inconsistent behavior you see is because g().length(); call is not done in constexpr context, so it's not required (well, "not expected" would be more accurate) to be computed at compile-time. GCC likely has some heuristics to compute constexpr functions with constexpr arguments in regular contexts that don't trigger once sanitizers get involved by either breaking the constexpr-ness of the function (due to added instrumentation) or one of the heuristics involved.

Adding constexpr to x makes f() call a constant expression (even if g() is not), so it's compiled at compile-time so it doesn't need to be instrumented, which is enough for other optimizations to trigger.

One can view that as a QoI issue, but in general it makes sense as

constexpr function evaluation can take arbitrarily long, so it's not always preferable to evaluate everything at compile time unless asked to
you can always "force" such evaluation (although the standard is somewhat permissive in this case), by using such functions in constant expressions. That'd also take care of any UB for you.

185

answered Oct 20 '22 03:10

Dan M.

Related questions
                            
                                What is C++ Technical Specification?
                            
                                How can I get more details about errors generated during protobuf parsing? (C++)
                            
                                How I'm supposed to use the sanitizer in clang?
                            
                                No speedup for vector sums with threading
                            
                                GoogleTest PrintTo not getting called for a class
                            
                                error: no type named 'vector' in namespace 'std'
                            
                                CUDA __device__ Unresolved extern function [duplicate]
                            
                                How to initialize a shared pointer in the initialization list of a constructor?
                            
                                const-reference qualified member function
                            
                                The correct way of returning std::unique_ptr to an object of polymorphic class
                            
                                Static link libstdc++ using clang
                            
                                Is `std::common_type` associative?
                            
                                One Definition Rule - Multiple definition of inline functions
                            
                                why `S x({})` invoke default constructor in GCC 7/C++1z mode only?
                            
                                The value of a const variable is or is not usable in a constant expression, depending on the variable type
                            
                                c++17 evaluation order with operator overloading functions
                            
                                Is inheritability of lambdas guaranteed by the standard?
                            
                                Overloading operator[] and NOT getting "lvalue required as left operand of assignment" error
                            
                                A compile time way to determine the least expensive argument type
                            
                                Is is necessary to use volatile when writing to hardware in C or C++?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does enabling undefined behaviour sanitization interfere with optimizations?

Tags:

c++

compiler-optimization

gcc

constexpr

ubsan

einpoklum

People also ask

1 Answers

Dan M.

Recent Activity

Donate For Us