Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in avx
Setting __m256i to the value of two __m128i values
Mar 30, 2019
c
sse
simd
avx
Loading 8 chars from memory into an __m256 variable as packed single precision floats
Jun 17, 2021
c++
sse
simd
avx
avx2
Unknown type name __m256 - Intel intrinsics for AVX not recognized?
Mar 13, 2022
c++
c
intel
intrinsics
avx
Shuffling by mask with Intel AVX
Mar 08, 2022
c++
sse
simd
intrinsics
avx
How to probe the availability of Intel® Advanced Vector Extensions?
Sep 15, 2022
delphi
delphi-2007
avx
basm
Are there SIMD(SSE / AVX) instructions in the x86-compatible accelerators Intel Xeon Phi?
Nov 02, 2022
intel
sse
simd
avx
intel-mic
Is there an efficient way to get the first non-zero element in an SIMD register using SIMD intrinsics?
Oct 23, 2022
x86
bit-manipulation
simd
intrinsics
avx
Using a variable to index a simd vector with _mm256_extract_epi32() intrinsic
Feb 26, 2022
simd
intrinsics
avx
avx2
Saturated substraction - AVX or SSE4.2
Sep 26, 2022
c
gcc
optimization
sse
avx
Writing a portable SSE/AVX version of std::copysign
Dec 15, 2021
c++
x86-64
sse
simd
avx
Count leading zeros in __m256i word
Sep 15, 2022
c
x86
simd
intrinsics
avx
Why do processors with only AVX out-perform AVX2 processors for many SIMD algorithms?
Sep 17, 2019
c#
c++
simd
avx
avx2
Fast interleave 2 double arrays into an array of structs with 2 float and 1 int (loop invariant) member, with SIMD double->float conversion?
Oct 04, 2022
c++
x86
simd
intrinsics
avx
Using SIMD/AVX/SSE for tree traversal
Apr 12, 2019
performance
assembly
simd
micro-optimization
avx
Fastest way to perform AVX inner product operations with mixed (float, double) input vectors
Nov 10, 2019
c++
vectorization
simd
avx
sse2
Using ymm registers as a "memory-like" storage location
Dec 07, 2020
assembly
x86
sse
avx
Matrix-vector-multiplication in AVX not proportionately faster than in SSE
Dec 07, 2021
c++
vectorization
sse
matrix-multiplication
avx
How to concatenate two vector efficiently using AVX2? (a lane-crossing version of VPALIGNR)
Mar 08, 2022
c
simd
intrinsics
avx
avx2
AVX 256-bit equivalent for _mm_load1_ps
Mar 14, 2018
simd
intrinsics
avx
« Newer Entries
Older Entries »