Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in simd
AVX 3.6x slower than IA32 in simple benchmark involving <cmath> operations - why so? (VS2013)
Mar 31, 2026
c++
visual-studio
sse
simd
avx
Bus error on neon implementation of summary SAD (Sum of Absolute Difference)
Mar 30, 2026
arm
simd
neon
What is the availability of 'vector long long'?
Mar 29, 2026
64-bit
simd
intrinsics
powerpc
altivec
Why is 4x4 Matrix Multiplication in Eigen More Than Twice as Fast as 3x3?
Mar 27, 2026
c++
assembly
eigen
matrix-multiplication
simd
How to implement vectorize "exp" and "log" base-2 functions using AVX-512
Mar 27, 2026
assembly
simd
logarithm
avx512
exp
Does SIMD require a multi-core CPU?
Mar 28, 2026
cpu
cpu-architecture
simd
Writing a piece of C code such that compiler uses SSE4.1 instruction for generating assembly Code
Mar 22, 2026
c
optimization
gcc
sse
simd
xtensor and xsimd: improve performance on reduction
Mar 22, 2026
python
c++
numpy
simd
xtensor
Emulating shifts on 64 bytes with AVX-512
Mar 22, 2026
simd
avx512
Euclidean distance using intrinsic instruction
Mar 19, 2026
c++
sse
simd
euclidean-distance
Broadcast one arbitrary element of __m128 vector
Mar 18, 2026
c++
x86
sse
simd
sse2
Seeded Random Uniform float generator using SIMD? [duplicate]
Mar 18, 2026
c++
random
floating-point
simd
avx
SSE2 8x8 byte-matrix transpose code twice as slow on Haswell+ then on ivy bridge
Mar 17, 2026
performance
assembly
x86
sse
simd
Loop is not vectorized when variable extent is used
Mar 16, 2026
c++
vectorization
sse
compiler-optimization
simd
SIMD transpose when row size is greater than vector width
Mar 15, 2026
matrix
transpose
simd
avx
avx2
Does using SIMD have an initialisation cost
Mar 10, 2026
x86-64
simd
arm64
Sign of the maximum absolute value in an __m128, SSE4
Mar 09, 2026
c++
sse
simd
Older Entries »