Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in sse
C code to auto-vectorize floating point minimum
Nov 14, 2022
c
gcc
vectorization
sse
simd
Why is prefetch speedup not greater in this example?
Nov 12, 2022
visual-studio-2012
intel
sse
Unpacking 8 to 16-bit using SIMD: AVX2 version mixes up the order
Nov 08, 2022
c++
simd
sse
avx2
valarray on aligned memory for SSE / AVX
Nov 05, 2022
c++
sse
avx
valarray
gdb: SSE register output format
Nov 03, 2022
debugging
assembly
gdb
sse
cpu-registers
Floating point range reduction
Nov 03, 2022
c#
mono
sse
simd
ieee-754
How do I extract 32 x 4-bit integer from 16 x 8-bit __m128i value
Oct 25, 2022
x86
bit-manipulation
sse
simd
Strange /fp Floating Point Model flag behavior
Oct 23, 2022
c
visual-studio-2010
visual-studio-2012
floating-point
sse
SIMD Implementation of std::nth_element
Oct 23, 2022
c++
performance
sse
simd
stl-algorithm
VC++ SSE code generation - is this a compiler bug?
Oct 21, 2022
visual-c++
assembly
x86
sse
visual-studio-debugging
determinant calculation with SIMD
Oct 22, 2022
sse
simd
neon
determinants
_mm_sad_epu8 faster than _mm_sad_pu8
Oct 21, 2022
c
sse
intrinsics
Check if DLL uses SSE instructions
Oct 21, 2022
visual-c++
assembly
dll
x86
sse
MOVAPS accesses unaligned address
Oct 21, 2022
c++
visual-studio-2013
sse
memory-alignment
disassembly
Vectorization - Speed up expected for SSE, AVX and AVX2
Oct 19, 2022
c
vectorization
sse
avx
avx512
Work around lack of Yz machine constraint under Clang?
Oct 20, 2022
c++
clang
sse
inline-assembly
sha
Is it possible to popcount __m256i and store result in 8 32-bit words instead of the 4 64-bit using Wojciech Mula algorithm's?
Oct 18, 2022
c++
intel
sse
avx
avx2
MSYS2 GCC zeros out doubles on floating point operations with SSE disabled
Oct 19, 2022
gcc
x86-64
sse
calling-convention
msys2
What's the proper way to use different versions of SSE intrinsics in GCC?
Sep 20, 2022
c
gcc
sse
intrinsics
SSE vector wrapper type performance compared to bare __m128
Dec 07, 2020
c++
assembly
optimization
x86
sse
« Newer Entries
Older Entries »