Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to compare two vectors using SIMD and get a single boolean result?

I have two vectors of 4 integers each and I'd like to use a SIMD command to compare them (say generate a result vector where each entry is 0 or 1 according to the result of the comparison).

Then, I'd like to compare the result vector to a vector of 4 zeros and only if they're equal do something.

Do you know what SIMD commands I can use to do it?

like image 724
N.M Avatar asked Jul 29 '11 15:07

N.M


People also ask

How to check if two STL vectors contain same elements?

Quickly check if two STL vectors contain same elements or not. Unlike normal C/C++ arrays, we don’t need to do element by element comparison to find if two given vectors contain same elements or not. In case of vectors, the operator “==” is overloaded to find the result quickly. Below is an example to demonstrate same.

What is a SIMD vector?

While there's nothing particular wrong about doing two operations to achieve this, we can do better: a SIMD vector allows us to store multiple pieces of data in a single register by splitting our 64 bits of memory into smaller sections.

How do you find the similarity score of a vector?

On the other hand, if similar means "in similar directions", compute the angle between the vectors: (u.v)/ ( |u| |v|). If similarity score means any sensible measure of the length of | x - y |, the vector norms relationship is provide the results. Definition.

What is the range of SIMD vector types in Swift?

Starting from Swift 5, SIMD Vector types that range from 2 to 64 lanes are available for use. This is how the previous character example can be written in Swift using SIMD2<Int32>: let character = SIMD2<Int32>( arrayLiteral: 2, 4) let knockback = SIMD2<Int32>( arrayLiteral: 2, 2) let result: SIMD2<Int32> = character &* knockback // 4,8


1 Answers

To compare two SIMD vectors:

#include <stdint.h>
#include <xmmintrin.h>

int32_t __attribute__ ((aligned(16))) vector1[4] = { 1, 2, 3, 4 };
int32_t __attribute__ ((aligned(16))) vector2[4] = { 1, 2, 2, 2 };
int32_t __attribute__ ((aligned(16))) result[4];

__m128i v1 = _mm_load_si128((__m128i *)vector1);
__m128i v2 = _mm_load_si128((__m128i *)vector2);
__m128i vcmp = _mm_cmpeq_epi32(v1, v2);
_mm_store_si128((__m128i *)result, vcmp);

Notes:

  • data is assumed to be 32 bit integers
  • vector1, vector2, result all need to be 16 byte aligned
  • result will be -1 for equal, 0 for not equal ({ -1, -1, 0, 0 } for above code example)

UPDATE

If you just want a single Boolean result for the case where all 4 elements match then you can do it like this:

#include <stdint.h>
#include <xmmintrin.h>

int32_t __attribute__ ((aligned(16))) vector1[4] = { 1, 2, 3, 4 };
int32_t __attribute__ ((aligned(16))) vector2[4] = { 1, 2, 2, 2 };

__m128i v1 = _mm_load_si128((__m128i *)vector1);
__m128i v2 = _mm_load_si128((__m128i *)vector2);
__m128i vcmp = _mm_cmpeq_epi32(v1, v2);
uint16_t mask = _mm_movemask_epi8(vcmp);
int result = (mask == 0xffff);
like image 200
Paul R Avatar answered Sep 22 '22 13:09

Paul R