Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in avx

How to disable AVX in Visual Studio 2017 15.5?

visual-studio-2017 avx

How to compute sincos fast on a x64 CPU?

C preprocessing fails to stop immediately after an #error

sse/avx equivalent for neon vuzp

sse simd neon avx

How to optimise this 8-bit positional popcount using assembly?

go assembly x86 simd avx

implict SIMD (SSE/AVX) broadcasts with GCC

gcc sse simd avx

Wrapper for `__m256` Producing Segmentation Fault with Constructor - Windows 64 + MinGW + AVX Issues

c++ g++ mingw-w64 avx windows64

How to speed up calculation of integral image?

best way to shuffle across AVX lanes?

c++ x86 sse simd avx

GEMM kernel implemented using AVX2 is faster than AVX2/FMA on a Zen 2 CPU

For for an SSE vector that has all the same components, generate on the fly or precompute?

c++ sse simd avx

How to write c++ code that the compiler can efficiently compile to SSE or AVX?

Tensorflow AVX Support

Find the first instance of a character using simd

x86 sse simd avx avx2

In assembly, how to add integers without destroying either operand?

How many clock cycles does cost AVX/SSE exponentiation on modern x86_64 CPU?

c++ x86 x86-64 sse avx

Forcing AVX intrinsics to use SSE instructions instead

Slow vpermpd instruction being generated; why?

SSE and AVX intrinsics mixture

c++ performance sse simd avx

Why is permute needed in parallel SIMD/SSE/AVX ?

permutation sse simd avx