Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in avx

Penalty for switching from SSE to AVX?

c++ sse avx sse2

Getting wrong results with using AVX instructions and -O3 compiling option

c compiler-optimization avx

Testing whether AVX register contains some equal integer numbers

c++ x86 simd avx avx2

Why is this code using VMULPD to write registers that will be overwritten by VFMADD? Isn't that useless?

assembly avx fma

Why _umul128 works slower than scalar code for mul128x64x2 function?

How to optimise my AVX Code

Does Hyperthreading have trouble with AVX?

eigen vectorization with arrays

sse eigen avx eigen3

Can't use AVX intrinsic ,because my function compiled without support for 'xsave'

xcode macos avx

SSE/AVX: Choose from two __m256 float vectors based on per-element min and max absolute value

sse intrinsics avx avx512

developing for new instruction sets

x86 sse avx

How to perform element-wise left shift with __m128i?

c sse avx

AVX2: BitScanReverse or CountLeadingZeros on 8 bit elements in AVX register

c++ simd intrinsics avx avx2

Intel C Compiler uses unaligned SIMD moves with aligned memory

inlining failed in call to always_inline '__m256d _mm256_broadcast_sd(const double*)'

c++ gcc x86 intrinsics avx

Comparing 2 vectors in AVX/AVX2 (c)

c simd avx avx2

Writing a vector sum function with SIMD (System.Numerics) and making it faster than a for loop

c# arrays performance simd avx

How does _mm256_shuffle_ps work?

c x86 simd intrinsics avx

__m256i version of _mm_test_all_zeros

simd intrinsics avx avx2

Union with __m256 and array of two __m128