Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

g++ SSE intrinsics dilemma - value from intrinsic "saturates"

g++ sse intrinsics

Mapped memory and SSE

Alignment and performance

c++ c linux sse libc

Is there a way to force PMULHRSW to treat 0x8000 as 1.0 instead of -1.0?

Why does gcc add this movss instruction only with _mm_set_ss?

STL unordered_map crashes with __m128 values

stl sse unordered-map

gcc 4.x not supporting x87 FPU math?

linux g++ sse libstdc++ x87

implement _mm256_permutevar8x32_ps using AVX instructions

c++ sse simd avx

implications of using _mm_shuffle_ps on integer vector

sse avx

For XMM/YMM FP operation on Intel Haswell, can FMA be used in place of ADD?

sse avx throughput flops fma

What is the difference between these 128bit SIMD xor operations

simd sse intrinsics sse2

Determine cause of segfault when using -O3?

c++ gdb sse gcc4.9

access violation _mm_store_si128 SSE Intrinsics

intel c++ x86 simd sse intrinsics

AVX scalar operations are much faster

intel c memory x86 sse avx

Most efficient way to convert vector of uint32 to vector of float?

SSE2 instruction to typecast an integer register to short register and vice-versa

x86 sse simd sse2

Is there a way to utilize all XMM registers?

Implement a near real-time CPU capability like glAlphaFunc(GL_GREATER) with RGB source and RGBA overlay

c++ opengl assembly sse rgba

Setting last or first n bits in SSE register

c++ x86 sse simd intrinsics

AVX 256-bit code performing slightly worse than equivalent 128-bit SSSE3 code

c++ performance sse avx2