What is the difference between these 128bit SIMD xor operations

Tags:

Intel provides several SIMD commands, which seems all performing bitwise XOR on 128-bit data:

_mm_xor_pd(__m128d, __m128d)
_mm_xor_ps(__m128, __m128)
_mm_xor_si128(__m128i, __m128i)

Isn't bitwise operations only operate on bit streams? Why there are three operations that have different type but same data size?

880

asked Mar 18 '15 13:03

jiandingzhe

2 Answers

_mm_xor_pd(__m128d, __m128d) operates on two 64 bit double precision floats

[https://msdn.microsoft.com/en-us/library/w87cdc33%28v=vs.90%29.aspx1

_mm_xor_ps(__m128d, __m128d) operates on four 32 bit single precision floats

https://msdn.microsoft.com/en-us/library/ss6k3wk8(v=vs.90).aspx

_mm_xor_si128(__m128d, __m128d) operates on one 128 bit value

https://msdn.microsoft.com/en-us/library/fzt08www%28v=vs.90%29.aspx

An XOR can be used between any two binary numbers regardless of their format. Why three? Because it's a balance to support common data types (float, double and 128 bits) and not have two many instructions.

The balance is the amount of silicon used, as each set of operations may occur in a separate functional units (integer, float, double). If they use different silicon all the different types of operation could execute in parallel.

144

answered Oct 29 '22 16:10

Tim Child

From a strict C point of view, they are all different because of the types.

They might also be hints for the CPUs about which kind of data you are intending to manage. At least this is the best interpretation the experts come with. As they said, this needs to be checked on hardware though.

answered Oct 29 '22 17:10

AntoineL

Related questions
                            
                                A64 Neon SIMD - 256-bit comparison
                            
                                determinant calculation with SIMD
                            
                                GNU C native vectors: how to broadcast a scalar, like x86's _mm_set1_epi16
                            
                                How to extract 8 integers from a 256 vector using intel intrinsics?
                            
                                Enabling HVX SIMD in Hexagon DSP by using instruction intrinsics
                            
                                Converting to and from __m256i and std::vector<uint32_t>
                            
                                Use C# Vector<T> SIMD to find index of matching element
                            
                                XNOR two 64 bits registers in 8 bit blocks
                            
                                What's a "wavefront" in the context of real-time rendering?
                            
                                Optimization using NEON assembly
                            
                                How should I pass SSE data to my functions/operators?
                            
                                How to store a vector to an unaligned location in memory with Altivec
                            
                                Is OpenMP vectorization guaranteed?
                            
                                Minimum SIMD vector width data type
                            
                                What is packed and unpacked and extended packed data
                            
                                implement _mm256_permutevar8x32_ps using AVX instructions
                            
                                Emulating shifts on 32 bytes with AVX
                            
                                Optimizing Array Compaction
                            
                                How to efficiently perform double/int64 conversions with SSE/AVX?
                            
                                GCC fails to optimize aligned std::array like C array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between these 128bit SIMD xor operations

Tags:

simd

sse

intrinsics

sse2

jiandingzhe

People also ask

2 Answers

Tim Child

AntoineL

Recent Activity

Donate For Us