Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in sse

The best way to shift a __m128i?

Sep 28, 2022

c bitwise-operators sse bit-shift sse2

What is packed and unpacked and extended packed data

Feb 10, 2022

cpu-architecture sse simd avx avx2

g++ SSE intrinsics dilemma - value from intrinsic "saturates"

Jan 21, 2021

g++ sse intrinsics

Mapped memory and SSE

Aug 30, 2022

intel assembly sse memory-mapping

Alignment and performance

Sep 14, 2018

c++ c linux sse libc

Is there a way to force PMULHRSW to treat 0x8000 as 1.0 instead of -1.0?

Dec 10, 2019

image-processing assembly sse fixed-point

Why does gcc add this movss instruction only with _mm_set_ss?

Aug 15, 2022

c optimization sse compiler-optimization

STL unordered_map crashes with __m128 values

Nov 10, 2019

stl sse unordered-map

gcc 4.x not supporting x87 FPU math?

Feb 11, 2021

linux g++ sse libstdc++ x87

implement _mm256_permutevar8x32_ps using AVX instructions

May 29, 2022

c++ sse simd avx

implications of using _mm_shuffle_ps on integer vector

May 13, 2020

sse avx

For XMM/YMM FP operation on Intel Haswell, can FMA be used in place of ADD?

Jul 12, 2018

sse avx throughput flops fma

What is the difference between these 128bit SIMD xor operations

Aug 30, 2022

simd sse intrinsics sse2

Determine cause of segfault when using -O3?

Mar 20, 2022

c++ gdb sse gcc4.9

access violation _mm_store_si128 SSE Intrinsics

Feb 28, 2019

intel c++ x86 simd sse intrinsics

AVX scalar operations are much faster

Aug 06, 2022

intel c memory x86 sse avx

Most efficient way to convert vector of uint32 to vector of float?

Feb 27, 2022

intel assembly floating-point x86 sse

SSE2 instruction to typecast an integer register to short register and vice-versa

Jul 14, 2022

x86 sse simd sse2

Is there a way to utilize all XMM registers?

Oct 26, 2022

c++ c compiler-construction sse

AVX 256-bit code performing slightly worse than equivalent 128-bit SSSE3 code

Jun 06, 2022

c++ performance sse avx2

« Newer Entries Older Entries »