I found that in some answers they recommended using <code>lower = (some_var << 32) >> 32;</code> But I tested and found the following is faster: <code>lower = some_var & 0xffffffff;</code> So which is better? Is the former safer in some cases or faster after compiler optimized?

The masking with AND is better since it doesn't depend on the signedness of the value. But the most efficient way to take the lower 32 bit is to assign it to a 32-bit variable. <pre class="prettyprint"><code>uint64_t u = 0x1122334455667788; uint32_t n; n = static_cast<uint32_t>(u); // 0x55667788 </code></pre> The difference to a bit-wise AND is that the CPU just takes the lower part without doing any logical operation. If you have a 32-bit CPU it just ignores the upper value stored in a second register or memory place. If you have a 64-bit CPU it has a single instruction to extend (unsigned) a 32 bit value to 64 bit value.

Which way is better to get lower 32 bits of a 64 bits integer

3 Answers

Masking with & is better:

& is reliable for signed and unsigned some_var, while bitshifting right a negative number produces an implementation defined result:

The value of E1 >> E2 is E1 right-shifted E2 bit positions. [...] If E1 has a signed type and a negative value, the resulting value is implementation-defined.

on every CPU I've ever known (Z80, 6502C, x86, 68000, UltraSparc), bitwise-AND is a single CPU instruction and takes one clock cycle... it's extremely unlikely to be slower or take more bytes of machine code than the bit-shifting approach you mention, though the compiler might optimise that to a bitwise AND anyway.

The one disadvantage of masking is that it's relatively easy to accidentally have 7 or 9 Fs, whereas a typo in 32 is obvious: there are other ways to generate the masking value though, e.g. (1LL<<32)-1, or the hackish but somehow elegant uint32_t(-1).

Of course, if lower is uint32_t and some_var uint64_t, you can just let the conversion be implicit, so the optimiser doesn't even need to realise the bitwise-AND can be removed before assignment, but that might give you a compiler warning, which you can silence ala...

uint32_t lower = static_cast<uint32_t>(some_var);

The masking is mainly useful when assigning to another uint64_t, or when the mask isn't for all the 32 least significant bits.

143

answered Nov 06 '22 08:11

Tony Delroy

The masking with AND is better since it doesn't depend on the signedness of the value.

But the most efficient way to take the lower 32 bit is to assign it to a 32-bit variable.

uint64_t u = 0x1122334455667788;
uint32_t n;

n = static_cast<uint32_t>(u);  // 0x55667788

The difference to a bit-wise AND is that the CPU just takes the lower part without doing any logical operation.

If you have a 32-bit CPU it just ignores the upper value stored in a second register or memory place.

If you have a 64-bit CPU it has a single instruction to extend (unsigned) a 32 bit value to 64 bit value.

answered Nov 06 '22 08:11

harper

A good optimizer would generate the same code in both cases. To me this is the most straight forward method: lower = some_var & 0xffffffff; The other form may generate unnecessary shiftage.

Sometimes I use union to overlap variables when I want to be absolutely sure the compiler doesn't mess things up.

For example:

typedef union {
    int64 QWORD;
    int32 DWORD[2];
} overlapper64;

overlapper someVariable;

Then access it as:

someVariable.QWORD;

int32 myVar32 = someVariable.DWORD[0];

Depending on platform/compiler the order in which the overlap occurs may vary. Be sure to test it on your specific platform. In C, I use a bunch of platform specific #ifdefs to control the order automatically.

answered Nov 06 '22 07:11

particle xlr8r

Related questions
                            
                                Design decision regarding std::array fill
                            
                                How is std::has_virtual_destructor implemented?
                            
                                STL - assignment operator vs. `assign` member function
                            
                                What does the compiler do in assembly when optimizing code? ie -O2 flag
                            
                                Print current time in output window using TracePoint
                            
                                DirectX SDK vs Windows SDK: which one to use?
                            
                                Error that is neither syntactic nor semantic?
                            
                                How to debug C++11 code with unique_ptr in DDD (or gdb)?
                            
                                Is it OK to chain signals and slots several times?
                            
                                Interdependent initialization with commas?
                            
                                Time complexity of iterating through a C++ unordered_map [duplicate]
                            
                                Convert char* to uint8_t
                            
                                Test that two std::vectors are equal using CATCH C++ unit test framework
                            
                                Call a C++ function from Swift
                            
                                Behavior when dereferencing the .end() of a vector of strings
                            
                                Is there a penalty to using char variables in CUDA kernels?
                            
                                Why does std::vector::reserve call the copy constructor?
                            
                                When do I use std::random_device?
                            
                                How implicit conversion works for non-type template parameters?
                            
                                Are global variables extern by default or is it equivalent to declaring variable with extern in global?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Which way is better to get lower 32 bits of a 64 bits integer

Tags:

c++

MoorLi

People also ask

3 Answers

Tony Delroy

harper

particle xlr8r

Recent Activity

Donate For Us