Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in simd
What's the best way to load 2 unaligned 64-bit values into an sse register with SSSE3?
May 03, 2022
sse
simd
intrinsics
Add all elements in a lane
Oct 18, 2018
c
arm
simd
neon
Vector SIMD types in Swift
Nov 08, 2022
vector
types
swift
simd
Horizontal add with __m512 (AVX512)
Nov 30, 2018
simd
intrinsics
avx512
What happened to microsoft.bcl.simd?
Aug 31, 2022
c#
vector
sse
simd
Divide 8-bit integers by 4 (or shift) using SSE
Feb 04, 2022
c++
x86
sse
simd
intrinsics
how can I use SVML instructions [duplicate]
Jun 19, 2021
c++
x86
sse
simd
sse/avx equivalent for neon vuzp
Mar 20, 2022
sse
simd
neon
avx
Will gfortran or ifort compilers wisely use SIMD instructions when summing the product of two arrays?
Feb 25, 2021
fortran
gfortran
simd
intel-fortran
fma
What is meant by "fixing up" floats?
Jun 04, 2022
simd
intrinsics
avx512
OpenMP SIMD on Power8
Dec 08, 2020
openmp
vectorization
simd
powerpc
Scaling byte pixel values (y=ax+b) with SSE2 (as floats)?
May 28, 2022
c++
visual-studio
x86
simd
sse2
When should I use DO CONCURRENT and when OpenMP?
Oct 16, 2022
concurrency
fortran
openmp
simd
fortran2008
How to efficiently perform int8/int64 conversion with SSE?
Jun 09, 2022
c++
x86
sse
simd
intrinsics
Meaning of suffix "x" in intrinsics like "_mm256_set1_epi64x"
Sep 15, 2021
x86
vectorization
x86-64
simd
intrinsics
How to optimise this 8-bit positional popcount using assembly?
May 16, 2022
go
assembly
x86
simd
avx
No speedup when summing uint16 vs uint64 arrays with NumPy?
Sep 05, 2022
python
numpy
performance
compiler-optimization
simd
SSE SIMD Optimization For Loop
Nov 09, 2022
visual-c++
sse
simd
OpenCL distribution
Nov 12, 2022
installation
cross-platform
distribution
opencl
simd
neon float multiplication is slower than expected
Aug 10, 2022
c++
gcc
arm
simd
neon
« Newer Entries
Older Entries »