Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

Usage of _mm_shuffle_epi8 intrinsic

SSE Compare Packed Unsigned Bytes

x86 comparison unsigned sse

What's the difference between GCC builtin vectorization types and C arrays?

gcc assembly sse vectorization

How to avoid SSE pipeline flush?

c++ sse

Troubleshooting auto vectorize reason '1200'

Automatically generate xmm register name in a gas macro?

assembly x86 gnu-assembler sse

Best way to compute max mask of sse var

SSE strangeness with Functions

d sse inline-assembly

SSE normalization slower than simple approximation?

Cannot access memory as SSE type on x86 but works fine on x64

c++ x86 sse

Getting GCC to generate a PTEST instruction when using vector extensions

c gcc vectorization sse avx2

_mm_max_ss has different behavior between clang and gcc

c++ gcc x86 clang sse

SSE program takes a lot longer on AMD than on Intel

optimization sse

Equivalent of SSE unpacklo_ps/unpackhi_ps in AVX (for doubles)

c sse avx

Channel/lane shuffling for SSE and AVX?

c++ c vectorization sse avx

Comparing two vector<bool> with SSE

c++ x86 sse simd

How to compute sincos fast on a x64 CPU?

Extract the low bit of each bool byte in a __m128i? bool array to packed bitmap

Fast SSE low precision exponential using double precision operations