Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in simd

Most efficient way to store 4 dot products into a contiguous array in C using SSE intrinsics

Fast counting the number of equal bytes between two arrays [duplicate]

c++ c sse simd sse2

Is it possible to use SIMD instructions in Rust?

rust simd avx avx2

Is it possible to vectorize myNum += a[b[i]] * c[i]; on x86_64?

what's the difference between _mm256_lddqu_si256 and _mm256_loadu_si256

Bitwise xor of two 256-bit integers

sse simd avx

A better 8x8 bytes matrix transpose with SSE?

c matrix optimization sse simd

Fastest Implementation of Exponential Function Using AVX

x86 simd avx exponential avx2

SIMD programming languages

How do I reorder vector data using ARM Neon intrinsics?

arm simd neon intrinsics

Testing equality between two __m128i variables

c x86 sse simd

Complex Mul and Div using sse Instructions

x86 sse simd complex-numbers

SSE: convert short integer to float

x86 sse simd

Why do I get this compile error with GCC 5 and cilk-plus?

c gcc simd cilk-plus

How to get GCC to use more than two SIMD registers when using intrinsics?

gcc assembly x86 sse simd

byte array permute SSE optimization

c++ gcc x86-64 sse simd

NEON vs Intel SSE - equivalence of certain operations

c++ c sse simd neon

indexing into an array with SSE

c sse simd

8 bit shift operation in AVX2 with shifting in zeros

c sse simd avx avx2

Why are some Haswell AVX latencies advertised by Intel as 3x slower than Sandy Bridge?