Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in simd
Load address calculation when using AVX2 gather instructions
Feb 15, 2022
x86
sse
simd
avx2
Branch and predicated instructions
Oct 27, 2022
cuda
simd
SIMD the following code
Sep 21, 2022
c
x86
sse
simd
Why does the FMA _mm256_fmadd_pd() intrinsic have 3 asm mnemonics, "vfmadd132pd", "231" and "213"?
Jan 08, 2022
assembly
x86
simd
instruction-set
fma
Can I use the AVX FMA units to do bit-exact 52 bit integer multiplications?
Jun 21, 2022
floating-point
x86
simd
avx2
fma
How can I disable vectorization while using GCC?
Sep 20, 2022
gcc
vectorization
sse
simd
auto-vectorization
Fastest way to compute distance squared
Oct 16, 2022
c
optimization
simd
How to transpose a 16x16 matrix using SIMD instructions?
Sep 20, 2022
assembly
matrix
intel
simd
avx512
How to quickly count bits into separate bins in a series of ints on Sandy Bridge? [duplicate]
Oct 31, 2022
c++
assembly
x86
simd
avx
Fast 24-bit array -> 32-bit array conversion?
Mar 04, 2022
c
bitmap
bit-manipulation
sse
simd
Count each bit-position separately over many 64-bit bitmasks, with AVX but not AVX2
Oct 31, 2022
c
optimization
x86
x86-64
simd
GCC C vector extension: How to check if result of ANY element-wise comparison is true, and which?
May 01, 2021
c
gcc
comparison
vectorization
simd
How can I try out SIMD instructions in Chrome?
Mar 09, 2022
javascript
google-chrome
32bit-64bit
simd
RyuJIT not making full use of SIMD intrinsics
Nov 01, 2022
c#
sse
simd
avx
ryujit
AVX2: Computing dot product of 512 float arrays
Apr 07, 2022
c++
simd
avx2
dot-product
fma
Shift a __m128i of n bits
Oct 31, 2019
c
x86
sse
simd
sse2
Why does does SSE set (_mm_set_ps) reverse the order of arguments
Mar 02, 2022
c++
c
simd
sse
intrinsics
Taking advantage of SSE and other CPU extensions
Feb 04, 2017
c++
gcc
cross-platform
visual-c++
simd
Number of Compute Units corresponding to the number of work groups
Sep 26, 2022
opencl
nvidia
simd
How to use the multiply and accumulate intrinsics in ARM Cortex-a8?
May 13, 2019
c
arm
simd
intrinsics
neon
« Newer Entries
Older Entries »