Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in avx
Intel vector instruction to zero-extend 8 4-bit values packed in a 32-bit int to a __m256i?
Nov 25, 2025
sse
avx
avx2
How to implement 16 and 32 bit integer insert and extract operations with AVX-512?
Nov 22, 2025
intrinsics
avx
avx512
how abundant is hardware support for FMA instruction set
Nov 20, 2025
x86
hardware
sse
simd
avx
AVX equivalent for _mm_movelh_ps
Nov 19, 2025
c++
sse
intrinsics
avx
Add saturate 32-bit signed ints intrinsics?
Nov 17, 2025
x86
sse
intrinsics
avx
saturation-arithmetic
Mixing SSE with AVX128 for shorter instructions?
Nov 06, 2025
assembly
x86
sse
avx
micro-optimization
Is there a more efficient way to broadcast 4 contiguous doubles into 4 YMM registers?
Nov 05, 2025
gcc
intel
simd
intrinsics
avx
Best way to mask a single bit in AVX2?
Oct 29, 2025
c
x86
simd
avx
avx2
Simple AVX512 dot-product loop only 10.6x faster, expected 16x
Oct 29, 2025
c++
performance
avx
dot-product
avx512
AVX2: U8 absolute difference
Oct 29, 2025
sse
simd
neon
avx
avx2
How can I do efficiently bitwise majority voting on 3, 5, 7, 9 inputs with SSE/SSE2/AVX/...?
Oct 27, 2025
assembly
sse
avx
neon
avx512
avx three operands for sqrt?
Oct 28, 2025
assembly
x86
simd
instructions
avx
Convention for displaying vector registers
Oct 29, 2025
x86
sse
simd
avx
How to further optimize matrix multiplication in llm.c project?
Oct 27, 2025
c
optimization
matrix-multiplication
avx
neon
SIMD: Bit-pack signed integers
Oct 27, 2025
sse
simd
avx
avx2
avx512
Logical shift between YMM registers
Oct 27, 2025
assembly
x86-64
avx
avx2
avx512
Code alignment in one object file is affecting the performance of a function in another object file
Oct 27, 2025
c
assembly
x86
nasm
avx
Rust target-cpu=native gets slower SIMD execution
Oct 25, 2025
rust
simd
intrinsics
avx
Accumulating Doubles Into Bins via intrinsics
Oct 23, 2025
c++
simd
avx
avx2
AVX2: Is there a way to implement _mm256_mul_epi8 function for a constant power of 2?
Oct 23, 2025
c++
simd
intrinsics
avx
avx2
Older Entries »