Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in avx
Simd matmul program gives different numerical results
Apr 23, 2021
c
floating-point
vectorization
simd
avx
Intel AVX : Why is there no 256-bits version of dot product for double precision floating point variables? [closed]
Oct 19, 2021
c++
performance
simd
avx
Checking if SSE is supported at runtime [duplicate]
Feb 24, 2022
c++
c
sse
simd
avx
SIMD string to unsigned int parsing in C# performance improvement
Mar 18, 2022
c#
sse
simd
avx
system.numerics
are static / static local SSE / AVX variables blocking a xmm / ymm register?
Aug 17, 2022
c++
sse
avx
vectorized sum in Fortran
Mar 11, 2022
fortran
sse
gfortran
simd
avx
Ensuring that Eigen uses AVX vectorization for a certain operation
Sep 05, 2022
c++
vectorization
eigen
simd
avx
How are AVX registers handled by the common calling conventions?
Aug 16, 2022
windows
linux
calling-convention
avx
Potential bug in Visual Studio C compiler or in Intel Intrinsics' AVX2 "_mm256_set_epi64x" function
Oct 10, 2020
c++
visual-studio
intrinsics
avx
compiler-bug
Copying 64 bytes of memory with NT stores to one full cache line vs. 2 consecutive partial cache lines
Apr 05, 2022
c
performance
assembly
x86
avx
Why two bitwise or AVX instructions? [duplicate]
Jan 20, 2020
x86
bit-manipulation
avx
instructions
bitwise-or
Can I generate AVX vectorized code using LLVM jit?
Feb 15, 2020
x86
llvm
jit
avx
find nan in array of doubles using simd
Jun 13, 2022
c
nan
sse
simd
avx
How to store lower or higher values from AVX/AVX2(YMM) register to memory like the SSE movlps/movhps does?
Feb 21, 2017
x86
sse
simd
avx
avx2
Small branches in modern CPUs
May 23, 2022
performance
x86-64
cpu-architecture
avx
branch-prediction
SIMD minmag and maxmag
Feb 12, 2022
assembly
floating-point
x86
sse
avx
The indices of non-zero bytes of an SSE/AVX register
Feb 06, 2022
c++
c
sse
simd
avx
perf report shows this function "__memset_avx2_unaligned_erms" has overhead. does this mean memory is unaligned?
Oct 17, 2020
c++
profiling
avx
perf
avx2
Is using AVX2 can implement a faster processing of LZCNT on a word array?
Oct 05, 2020
x86
simd
avx
micro-optimization
avx2
« Newer Entries
Older Entries »