Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

How to avoid SSE pipeline flush?

c++ sse

Troubleshooting auto vectorize reason '1200'

Automatically generate xmm register name in a gas macro?

assembly x86 gnu-assembler sse

Best way to compute max mask of sse var

SSE strangeness with Functions

d sse inline-assembly

SSE normalization slower than simple approximation?

Cannot access memory as SSE type on x86 but works fine on x64

c++ x86 sse

Getting GCC to generate a PTEST instruction when using vector extensions

c gcc vectorization sse avx2

_mm_max_ss has different behavior between clang and gcc

c++ gcc x86 clang sse

SSE program takes a lot longer on AMD than on Intel

optimization sse

Equivalent of SSE unpacklo_ps/unpackhi_ps in AVX (for doubles)

c sse avx

Channel/lane shuffling for SSE and AVX?

c++ c vectorization sse avx

Comparing two vector<bool> with SSE

c++ x86 sse simd

How to compute sincos fast on a x64 CPU?

Extract the low bit of each bool byte in a __m128i? bool array to packed bitmap

Fast SSE low precision exponential using double precision operations

How to check overflow for multiplication of 16 bit integers in SSE?

Optimal implementation of iterative Kahan summation

How to detect sse availability in CMake

build cross-platform cmake sse

What's the best way to load 2 unaligned 64-bit values into an sse register with SSSE3?

sse simd intrinsics