Is there a way to simulate integer bitwise operations for _m256 types on AVX?

Tags:

I have a boolean expression that I have managed to implement in SSE2. Now I would have liked to try implementing it in AVX exploiting an additional factor 2 in parallelism increase (from 128 bit SIMD type to 256). However, AVX does not support integer operation (which AVX2 does, but I am working on a Sandy Bridge processor so it is not an option currently). However, since there are AVX intrinsics for bitwise operations. I figured I could make a try by just converting my integer types to float types and see if it works.

First test was a success:

__m256 ones = _mm256_set_ps(1,1,1,1,1,1,1,1);
__m256 twos = _mm256_set_ps(2,2,2,2,2,2,2,2); 
__m256 result = _mm256_and_ps(ones, twos);

I'm guetting all 0's as I am supposed to. Simularly AND'ing the twos instead I get a result of 2. But when trying 11 XOR 4 accordingly:

__m256 elevens = _mm256_set_ps(11,11,11,11,11,11,11,11); 
__m256 fours = _mm256_set_ps(4,4,4,4,4,4,4,4); 
__m256 result2 = _mm256_xor_ps(elevens, fours);

The result is 6.46e-46 (i.e. close to 0) and not 15. Simularly doing 11 OR 4 gives me a value of 22 and not 15 as it should be. I don't understand why this is. Is it a bug or some configuration I am missing?

I was actually expecting my hypothesis of working with float as if they were integers to not work since the integer initialized to a float value might not actually be the precise value but a close approximation. But even then, I am surprised by the result I get.

Does anyone have a solution to this problem or must I upgrade my CPU to get AVX2 support enable this?

837

asked Dec 11 '13 19:12

Toby999

1 Answers

The first test worked by accident.

1 as a float is 0x3f800000, 2 is 0x40000000. In general, it wouldn't work that way.

But you can absolutely do it, you just have to make sure that you're working with the right bit-pattern. Don't convert your integers to floats - reinterpret-cast them. That corresponds to intrinsics such as _mm256_castsi256_ps, or storing your ints to memory and reading them as floats (that won't change them, in general only math operations care about what the floats mean, the rest work with the raw bit patterns, check the list of exceptions that an instruction can make to make sure).

answered Nov 15 '22 10:11

harold

Related questions
                            
                                How to deactivate input statement after some time?
                            
                                timer accuracy: c clock( ) vs. WinAPI's QPC or timeGetTime( )
                            
                                Type of a C++ string literal
                            
                                C++ unexpected behaviour (where are my temporaries!?)
                            
                                Combining two const char* together
                            
                                C++ Why use an implicit conversion from (std::string) to (void) type?
                            
                                Need to save a new file with QFileDialog
                            
                                Visual Studio 2010 c++ compiler issue
                            
                                How do I change the avr32-gcc C compiler for the C++ in Atmel Studio 6 without having to create a new project?
                            
                                How to access a C++ function which takes pointers as input argument from a C#
                            
                                Optimal way to convert an int into a char array
                            
                                std::move() as performance bottleneck?
                            
                                What is a "Microsoft C++ exception"?
                            
                                Convert Vec4i to Java openCV
                            
                                Why is my non-recursive sqrt function recursive?
                            
                                Value_type of a container template parameter
                            
                                C++ move semantics: why copy assignment operator=(&) is called instead of move assignment operator=(&&)?
                            
                                How can I prevent my program from closing when a open console window is closed?
                            
                                Understanding an error in a recursive function?
                            
                                Bulk-allocating objects with calling new operator once?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a way to simulate integer bitwise operations for _m256 types on AVX?

Tags:

c++

c

integer

avx

sse

Toby999

People also ask

1 Answers

harold

Recent Activity

Donate For Us