Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx

GCC emits vastly different code using "-march=native" on similar architectures

Sep 10, 2022

c gcc assembly sse avx

How to quickly count bits into separate bins in a series of ints on Sandy Bridge? [duplicate]

Oct 31, 2022

c++ assembly x86 simd avx

Scatter intrinsics in AVX

Feb 20, 2022

intrinsics avx avx2

Vectorizing with unaligned buffers: using VMASKMOVPS: generating a mask from a misalignment count? Or not using that insn at all

Feb 17, 2022

gcc assembly x86 sse avx

RyuJIT not making full use of SIMD intrinsics

Nov 01, 2022

c# sse simd avx ryujit

Unaligned load versus unaligned store

Dec 01, 2020

c++ performance x86 memory-alignment avx

When the compiler reorders AVX instructions on Sandy, does it affect performance?

Mar 05, 2022

c performance optimization intrinsics avx

Is it worth bothering to align AVX-256 memory stores?

Mar 16, 2022

performance assembly x86-64 memory-alignment avx intel

Why do SSE instructions preserve the upper 128-bit of the YMM registers?

Sep 17, 2022

performance x86 avx

Is NOT missing from SSE, AVX?

Feb 02, 2022

x86 bit-manipulation simd sse avx

How to solve the 32-byte-alignment issue for AVX load/store operations?

Jun 26, 2022

c++ sse simd memory-alignment avx

Transpose an 8x8 float using AVX/AVX2

Feb 24, 2022

simd avx avx2

How to find the horizontal maximum in a 256-bit AVX vector

Mar 28, 2014

x86 simd avx vector-processing avx2

AVX VMOVDQA slower than two SSE MOVDQA?

Mar 25, 2022

assembly sse bignum arbitrary-precision avx

How to sum __m256 horizontally?

Sep 03, 2021

sse vectorization intrinsics avx

Loop unrolling to achieve maximum throughput with Ivy Bridge and Haswell

Aug 30, 2022

c++ x86 intel sse avx

Does ICC satisfy C99 specs for multiplication of complex numbers?

Oct 24, 2022

c assembly complex-numbers avx icc

How to rotate an SSE/AVX vector

Oct 11, 2018

c x86 sse intrinsics avx

Disable AVX-optimized functions in glibc (LD_HWCAP_MASK, /etc/ld.so.nohwcap) for valgrind & gdb record

Mar 03, 2022

linux linker gdb glibc avx

Choice between aligned vs. unaligned x86 SIMD instructions

Aug 11, 2022

x86 sse simd avx avx512

« Newer Entries Older Entries »