Why can't GCC optimize the logical bitwise AND pair in "x && (x & 4242)" to "x & 4242"?

Tags:

Here are two functions which I claim do exactly the same thing:

bool fast(int x) {   return x & 4242; }  bool slow(int x) {   return x && (x & 4242); }

Logically they do the same thing, and just to be 100% sure I wrote a test that ran all four billion possible inputs through both of them, and they matched. But the assembly code is a different story:

fast:     andl    $4242, %edi     setne   %al     ret  slow:     xorl    %eax, %eax     testl   %edi, %edi     je      .L3     andl    $4242, %edi     setne   %al .L3:     rep     ret

I was surprised that GCC could not make the leap of logic to eliminate the redundant test. I tried g++ 4.4.3 and 4.7.2 with -O2, -O3, and -Os, all of which generated the same code. The platform is Linux x86_64.

Can someone explain why GCC shouldn't be smart enough to generate the same code in both cases? I'd also like to know if other compilers can do better.

Edit to add test harness:

#include <cstdlib> #include <vector> using namespace std;  int main(int argc, char* argv[]) {     // make vector filled with numbers starting from argv[1]     int seed = atoi(argv[1]);     vector<int> v(100000);     for (int j = 0; j < 100000; ++j)         v[j] = j + seed;      // count how many times the function returns true     int result = 0;     for (int j = 0; j < 100000; ++j)         for (int i : v)             result += slow(i); // or fast(i), try both      return result; }

I tested the above with clang 5.1 on Mac OS with -O3. It took 2.9 seconds using fast() and 3.8 seconds using slow(). If I instead use a vector of all zeros, there is no significant difference in performance between the two functions.

620

asked Apr 14 '14 08:04

John Zwinck

1 Answers

Exactly why should it be able to optimize the code? You're assuming that any transformation that works will be done. That's not at all how optimizers work. They're not Artificial Intelligences. They simply work by parametrically replacing known patterns. E.g. the "Common Subexpression Elimination" scans an expression for common subexpressions, and moves them forwards, if that does not change side effects.

(BTW, CSE shows that optimizers are already quite aware of what code movement is allowed in the possible presence of side effects. They know that you have to be careful with &&. Whether expr && expr can be CSE-optimized or not depends on the side effects of expr.)

So, in summary: which pattern do you think applies here?

answered Sep 16 '22 15:09

MSalters

Related questions
                            
                                Technically, how do variadic functions work? How does printf work?
                            
                                How to create an std::function from a move-capturing lambda expression?
                            
                                immutable strings vs std::string
                            
                                How to open an std::fstream (ofstream or ifstream) with a unicode filename?
                            
                                When are static C++ class members initialized?
                            
                                Dependency Injection framework for C++ [closed]
                            
                                C++11 Range-based for-loop efficiency "const auto &i" versus "auto i"
                            
                                How do I get a const_iterator using auto?
                            
                                What is the following list of behind the scenes inside the range-based for loop?
                            
                                Override identifier after destructor in C++11
                            
                                How much null checking is enough?
                            
                                Extract year/month/day etc. from std::chrono::time_point in C++
                            
                                reading a line from ifstream into a string variable
                            
                                scope resolution operator without a scope
                            
                                What is wrong with making a unit test a friend of the class it is testing? [duplicate]
                            
                                How do you generate a random double uniformly distributed between 0 and 1 from C++?
                            
                                Mixing Qt and Boost
                            
                                Are all integer values perfectly represented as doubles? [duplicate]
                            
                                How to call through a member function pointer?
                            
                                Ambiguous overload call to abs(double)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why can't GCC optimize the logical bitwise AND pair in "x && (x & 4242)" to "x & 4242"?

Tags:

c++

optimization

compiler-optimization

gcc

John Zwinck

People also ask

1 Answers

MSalters

Recent Activity

Donate For Us