Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in avx
When using a mask register with AVX-512 load and stores, is a fault raised for invalid accesses to masked out elements?
Mar 31, 2022
x86
avx
avx512
Is vxorps-zeroing on AMD Jaguar/Bulldozer/Zen faster with xmm registers than ymm?
Jan 06, 2022
assembly
x86
avx
micro-optimization
amd-processor
what's the difference between _mm256_lddqu_si256 and _mm256_loadu_si256
Feb 04, 2022
x86
simd
intrinsics
avx
micro-optimization
Using AVX with GCC - avxintrin.h missing
Mar 08, 2022
c++
gcc
avx
AVX/SSE version of xorshift128+
Apr 12, 2022
c
performance
sse
avx
L1 memory bandwidth: 50% drop in efficiency using addresses which differ by 4096+64 bytes
May 02, 2022
c
caching
memory
x86
avx
is there an inverse instruction to the movemask instruction in intel avx2?
Dec 05, 2021
x86
intrinsics
avx
avx2
icc
Bitwise xor of two 256-bit integers
Nov 17, 2022
sse
simd
avx
Fastest Implementation of Exponential Function Using AVX
Sep 14, 2019
x86
simd
avx
exponential
avx2
Get sum of values stored in __m256d with SSE/AVX
Feb 09, 2022
c++
optimization
sse
avx
avx2
Why is GCC's AVX slower while LLVM's faster?
Mar 28, 2022
gcc
assembly
llvm
julia
avx
What's the fastest way to perform an arbitrary 128/256/512 bit permutation using SIMD instructions?
Mar 07, 2021
c++
assembly
sse
avx
avx2
8 bit shift operation in AVX2 with shifting in zeros
Jan 20, 2018
c
sse
simd
avx
avx2
Disabling AVX2 in CPU for testing purposes
Sep 16, 2022
testing
x86
avx
instruction-set
avx2
Does the Linux kernel have its own SSE/AVX context?
Oct 30, 2021
linux-kernel
sse
kernel-module
avx
Fastest way to expand bits in a field to all (overlapping + adjacent) set bits in a mask?
Nov 19, 2022
c
assembly
x86
sse
avx
What's the difference between vextracti128 and vextractf128?
Aug 24, 2022
x86
simd
avx
avx2
Horizontal minimum and maximum using SSE
May 20, 2019
c++
max
sse
minimum
avx
Using SIMD on amd64, when is it better to use more instructions vs. loading from memory?
Oct 16, 2022
x86-64
sse
simd
avx
microbenchmark
Half-precision floating-point arithmetic on Intel chips
Jan 20, 2020
x86
intel
avx
floating-point-conversion
half-precision-float
« Newer Entries
Older Entries »