Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

Is it okay to mix legacy SSE encoded instructions and VEX encoded ones in the same code path?

assembly x86 sse avx intel

Fast counting the number of equal bytes between two arrays [duplicate]

c++ c sse simd sse2

Where is VPERMB in AVX2?

assembly x86 intel sse avx2

Vectorizing Modular Arithmetic

c assembly x86-64 sse intrinsics

Load constant floats into SSE registers

assembly sse

Is it possible to vectorize myNum += a[b[i]] * c[i]; on x86_64?

What's the difference between __popcnt() and _mm_popcnt_u32()?

x86 sse intrinsics sse4

AVX/SSE version of xorshift128+

c performance sse avx

SSE and C++ containers

128-bit values - From XMM registers to General Purpose

assembly x86 sse

Bitwise xor of two 256-bit integers

sse simd avx

A better 8x8 bytes matrix transpose with SSE?

c matrix optimization sse simd

Why don't GCC and Clang use cvtss2sd [memory]?

Get sum of values stored in __m256d with SSE/AVX

c++ optimization sse avx avx2

SIMD programming languages

How to load a pixel struct into an SSE register?

c pixel x86-64 sse intrinsics intel

Testing equality between two __m128i variables

c x86 sse simd

How can I check if my installed numpy is compiled with SSE/SSE2 instruction set?

python numpy sse

How to properly use prefetch instructions?

Complex Mul and Div using sse Instructions

x86 sse simd complex-numbers