Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

Find index of maximum element in x86 SIMD vector

Sep 17, 2022

c++ x86 sse simd avx intel

practical BigNum AVX/SSE possible?

Jun 06, 2020

sse biginteger simd avx extended-precision

Why doesn't gcc resolve _mm256_loadu_pd as single vmovupd?

Apr 05, 2022

gcc assembly vectorization simd avx

SIMD latency throughput

Nov 04, 2022

c++ performance x86 sse simd

Speed up float 5x5 matrix * vector multiplication with SSE

Mar 25, 2022

c++ vectorization matrix-multiplication sse simd

Flipping sign on packed SSE floats

Jun 18, 2018

c performance optimization sse simd

Beating or meeting OS X memset (and memset_pattern4)

Jan 18, 2022

c performance optimization assembly simd

Constexpr and SSE intrinsics

Dec 27, 2021

c++ sse simd constexpr intrinsics

Get index of first element that is not zero in a __m256 variable

Nov 09, 2022

c++ c sse simd avx

What is the diffrence between SPMD and SIMD?

Oct 22, 2022

terminology parallel-processing simd

Does rewriting memcpy/memcmp/... with SIMD instructions make sense?

Sep 21, 2022

performance sse simd

SIMD instructions for floating point equality comparison (with NaN == NaN)

Dec 24, 2019

assembly floating-point x86 x86-64 simd intel

Sum reduction of unsigned bytes without overflow, using SSE2 on Intel

Apr 12, 2018

x86 sse simd sse2 sse3

Fast vectorized rsqrt and reciprocal with SSE/AVX depending on precision

Oct 24, 2021

performance sse simd avx

Using __m256d registers

Oct 18, 2022

c++ x86 intel simd avx

Load address calculation when using AVX2 gather instructions

Feb 15, 2022

x86 sse simd avx2

Branch and predicated instructions

Oct 27, 2022

cuda simd

SIMD the following code

Sep 21, 2022

c x86 sse simd

Why does the FMA _mm256_fmadd_pd() intrinsic have 3 asm mnemonics, "vfmadd132pd", "231" and "213"?

Jan 08, 2022

assembly x86 simd instruction-set fma

Can I use the AVX FMA units to do bit-exact 52 bit integer multiplications?

Jun 21, 2022

floating-point x86 simd avx2 fma

« Newer Entries Older Entries »