Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in simd

How to cast SIMD int vectors to float in GCC?

c gcc vectorization simd

Writing a portable SSE/AVX version of std::copysign

c++ x86-64 sse simd avx

How to convert byte array of image pixels data to grayscale using vector SSE operation

How to reverse an __m128 type variable?

c++ c x86 sse simd

SSE intrinsic over int16[8] to extract the sign of each element

c x86 sse simd sign

Count leading zeros in __m256i word

c x86 simd intrinsics avx

How to perform uint32/float conversion with SSE?

c x86 sse simd

Why do processors with only AVX out-perform AVX2 processors for many SIMD algorithms?

c# c++ simd avx avx2

Which one is better, gcc or armcc for NEON optimizations?

embedded arm simd neon cortex-a8

Fast interleave 2 double arrays into an array of structs with 2 float and 1 int (loop invariant) member, with SIMD double->float conversion?

c++ x86 simd intrinsics avx

Using SIMD/AVX/SSE for tree traversal

SSE2 intrinsics - comparing unsigned integers

c++ x86 sse simd intrinsics

Best way to shuffle 64-bit portions of two __m128i's

intel sse simd intrinsics

Alignment of multi-dimensional array for omp simd

fortran openmp simd

How to make the most of SIMD in OpenCL?

opencl gpgpu simd spmd

Why can't gcc or clang properly @encode SIMD vector types?

Optimizing horizontal boolean reduction in ARM NEON

arm simd neon

Fastest way to perform AVX inner product operations with mixed (float, double) input vectors

c++ vectorization simd avx sse2

efficient way to convert scatter indices into gather indices?

Loading data for GCC's vector extensions