Can C++ compilers optimize calls to at()?

Tags:

optimization

Since regular array accesses via the [] operator are unchecked, it's not fun to hit the headlines when your program has a remote code execution exploit or data leakage due to a buffer overflow.

Most standard array containers contains the at() method which allows bounds checked access to the array elements. This makes out of bounds array accesses well defined (throws exception), instead of undefined behavior.

This basically eliminates buffer overflow arbitrary code execution exploits and there is also a clang-tidy check that warns that you should use at() when the index is non-constant. So I changed it quite a few places.

Most managed languages have checked arrays and their compilers can eliminate the checks when they can.

I know C++ compilers can do awesome optimizations. The question is can C++ compilers do this to eliminate calls to at() when they see it can't overflow?

515

asked Oct 04 '19 09:10

Calmarius

1 Answers

Here is a classic case that would be subject to bounds check elimination in managed languages: iterating up to the size.

Click to copy

#include <vector>

int test(std::vector<int> &v)
{
    int sum = 0;
    for (size_t i = 0; i < v.size(); i++)
        sum += v.at(i);
    return sum;
}

This is not as trivial to optimize as when both the index and the size are constants (which could be solved by constant propagation), it requires more advanced reasoning about the relationships between values.

As seen on Godbolt, GCC (9.2), Clang (9.0.0) and even MSVC (v19.22) can handle such code reasonably. GCC and Clang autovectorize. MSVC just generates a basic loop:

Click to copy

$LL4@test:
    add     eax, DWORD PTR [r9+rdx*4]
    inc     rdx
    cmp     rdx, r8
    jb      SHORT $LL4@test

Which is not that bad, but given that it does vectorize a similar loop that uses [] instead of .at(), I have to conclude that: yes, there is a significant cost to using at even in some basic cases where we might expect otherwise (especially given that there is no range check, so the auto-vectorization step got scared for seemingly no reason). If you choose to target only GCC and Clang then there is less of an issue. In more tricky cases, GCC and Clang can also be "sufficiently confused", for example when passing the indexes through a data structure (unlikely code, but the point is, range information can sometimes be lost).

answered Oct 24 '22 21:10

harold

Related questions
                            
                                Why noreturn/__builtin_unreachable prevents tail call optimization
                            
                                Can I use /dev/sda just as an ordinary sequential file?
                            
                                Remove the newline from asctime()
                            
                                Clang doesn't evaluate the value of the constexpr function for the non-constexpr variable at compile time
                            
                                Returning a reference to a class data member and then trying to change that member
                            
                                Prevent const function being called for non-const object
                            
                                How to avoid using temporary variable with std::modf?
                            
                                Take tail of variadic template parameters
                            
                                is it required to call a non-trivial destructor when it is a noop?
                            
                                Range-based for and other increments
                            
                                How to get multiple return values from function in Lua C API?
                            
                                C++: Instantiate many templates in library
                            
                                Declare a variable of a class without creating an instance of it
                            
                                Named Default Arguments in pybind11
                            
                                Using enable_if on virtual functions
                            
                                Do logical operators affect each other in a if statement?
                            
                                Is it a missed optimization, when a compile-time known reference takes space in a non-aggregate struct?
                            
                                C++ multi level inheritance with virtual functions [duplicate]
                            
                                Is there a way to shorten a long switch() statement?
                            
                                C++ identifier is undefined

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With