Compiler optimization of bitwise not operation

Tags:

I have a simple function testing if two arrays are each others inverse. They are seemingly identical, except for a tmp variable. One works the other doesn't. I can't for the life of me figure out why the compiler would optimize this out - if it indeed is an optimization problem (my compiler is IAR Workbench v4.30.1). Here's my code:

// this works as expected uint8 verifyInverseBuffer(uint8 *buf, uint8 *bufi, uint32 len) {   uint8 tmp;   for (uint32 i = 0; i < len; i++)   {     tmp = ~bufi[i];     if (buf[i] != tmp)     {       return 0;     }   }   return 1;   }  // this does NOT work as expected (I only removed the tmp!) uint8 verifyInverseBuffer(uint8 *buf, uint8 *bufi, uint32 len) {   for (uint32 i = 0; i < len; i++)   {     if (buf[i] != (~bufi[i]))     {       return 0;     }   }   return 1;   }

The first version of the code works, the second does not. Can anyone figure out why? Or come with some tests to probe what is wrong?

650

asked Sep 06 '19 13:09

SupAl

1 Answers

What you see happening is a result of the rules of integer promotions. Anytime a variable smaller than an int is used in an expression the value is promoted to type int.

Suppose bufi[i] contains the value 255. The hex representation of this is 0xFF. This value is then operand of the ~ operator. So the value will first be promoted to int which (assuming it is 32 bit) will have the value 0x000000FF, and applying ~ to this gives you 0xFFFFFF00. You then compare this value with buf[i] which is of type uint8_t. The value 0xFFFFFF00 is outside of this range so the comparison will always be false.

If you assign the result of the ~ back to a variable of type uint8_t, the value 0xFFFFFF00 is converted to 0x00. It is this converted value that is then compared against buf[i].

So the behavior you see is not the result of an optimization but the rules of the language. Using a temp variable as you are is one way to address this issue. You could also cast the result to uint8:

if(buf[i] != (uint8)(~bufi[i]))

Or mask out all but the lowest order byte:

if(buf[i] != (~bufi[i] & 0xff))

151

answered Sep 18 '22 08:09

dbush

Related questions
                            
                                Using shared_from_this in templated classes
                            
                                What does assert(0) mean?
                            
                                Why does const auto &p{nullptr} work while auto *p{nullptr} doesn't in C++17?
                            
                                Compile error in 'winbase.h'
                            
                                Why do arrays of different integer sizes have different performance?
                            
                                How does the compiler benefit from C++'s new final keyword?
                            
                                Parallel for loop in openmp
                            
                                Pointer-to-pointer dynamic two-dimensional array
                            
                                Assignment operator not available in derived class
                            
                                Operating System compile time
                            
                                Boost 1.46.1, Property Tree: How to iterate through ptree receiving sub ptrees?
                            
                                Class and std::async on class member in C++
                            
                                What happens if a constructor throws an exception?
                            
                                Can functions from the C standard library be used in C++?
                            
                                Get bytes from std::string in C++
                            
                                How to implement serialization in C++
                            
                                Constructor Overloading in C++
                            
                                Strange definition of FALSE and TRUE, why? [duplicate]
                            
                                Can I determine the number of channels in cv::Mat Opencv
                            
                                C++11 way to index tuple at runtime without using switch

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Compiler optimization of bitwise not operation

Tags:

c++

c

embedded

iar

SupAl

People also ask

1 Answers

dbush

Recent Activity

Donate For Us