Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in avx
Using sse and avx intrinsics to add a set of packed singles into one value
Dec 16, 2025
c++
c++11
sse
avx
Optimal uint8_t bitmap into a 8 x 32bit SIMD "bool" vector
Dec 13, 2025
c++11
simd
avx
avx2
Websocket data unmasking / multi byte xor
Dec 08, 2025
c
x86
sse
simd
avx
Does VS2010 SP1 support only part of the AVX instruction set?
Dec 07, 2025
c++
visual-studio-2010
sse
avx
fma
Difference between _mm256_xor_si256() and _mm256_xor_ps()
Dec 07, 2025
intrinsics
avx
avx2
C++ AVX2 Instrinsic function Non-Standard Size
Dec 05, 2025
c++
simd
intrinsics
avx
avx2
Different semantic of comparison intrinsic instructions in avx512?
Dec 05, 2025
c++
sse
intrinsics
avx
avx512
Integer dot product using SSE/AVX?
Dec 03, 2025
c++
vectorization
sse
simd
avx
Unpack 12-bit data quickly (where the nibbles aren't contiguous; how to shuffle nibbles?)
Nov 30, 2025
c#
c++
avx
avx2
pixelformat
Intel vector instruction to zero-extend 8 4-bit values packed in a 32-bit int to a __m256i?
Nov 25, 2025
sse
avx
avx2
How to implement 16 and 32 bit integer insert and extract operations with AVX-512?
Nov 22, 2025
intrinsics
avx
avx512
how abundant is hardware support for FMA instruction set
Nov 20, 2025
x86
hardware
sse
simd
avx
AVX equivalent for _mm_movelh_ps
Nov 19, 2025
c++
sse
intrinsics
avx
Add saturate 32-bit signed ints intrinsics?
Nov 17, 2025
x86
sse
intrinsics
avx
saturation-arithmetic
Mixing SSE with AVX128 for shorter instructions?
Nov 06, 2025
assembly
x86
sse
avx
micro-optimization
Is there a more efficient way to broadcast 4 contiguous doubles into 4 YMM registers?
Nov 05, 2025
gcc
intel
simd
intrinsics
avx
Best way to mask a single bit in AVX2?
Oct 29, 2025
c
x86
simd
avx
avx2
Simple AVX512 dot-product loop only 10.6x faster, expected 16x
Oct 29, 2025
c++
performance
avx
dot-product
avx512
AVX2: U8 absolute difference
Oct 29, 2025
sse
simd
neon
avx
avx2
How can I do efficiently bitwise majority voting on 3, 5, 7, 9 inputs with SSE/SSE2/AVX/...?
Oct 27, 2025
assembly
sse
avx
neon
avx512
Older Entries »