Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in avx

Reverse a AVX register containing doubles using a single AVX intrinsic

c sse vectorization simd avx

set individual bit in AVX register (__m256i), need "random access" operator

Dot Product of Vectors with SIMD

c x86 simd avx

6 element double precision vector matrix vector multiply in AVX

c++ c optimization avx

How to force GCC to pass 128bits/256bits struct as function param in xmm/ymm register?

Why _mm256_load_pd compiled to MOVUPD instead of MOVAPD?

c visual-studio alignment avx

Reading constants into an SSE/AVX registers in inline assembly

valarray on aligned memory for SSE / AVX

c++ sse avx valarray

AVX inside a VirtualBox VM?

ubuntu x86 virtualbox avx

Checking If A Vector Contains Any Element Greater Than Zero

c vector avx

Vectorization - Speed up expected for SSE, AVX and AVX2

c vectorization sse avx avx512

How to extract 8 integers from a 256 vector using intel intrinsics?

c x86 simd intrinsics avx

Is it possible to popcount __m256i and store result in 8 32-bit words instead of the 4 64-bit using Wojciech Mula algorithm's?

c++ intel sse avx avx2

FMA3 in GCC: how to enable

c++ gcc intel avx fma

How to efficiently perform double/int64 conversions with SSE/AVX?

c++ floating-point sse simd avx