Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx

Efficiently gather individual bytes, separated by a byte-stride of 4

Feb 27, 2022

c intrinsics avx

Need for fast data demuxing in C# by using multi-threading, AVX, GPU or whatever

Apr 20, 2022

c# multithreading algorithm performance avx

Preventing GCC from automatically using AVX and FMA instructions when compiled with -mavx and -mfma

Jun 21, 2020

c++ gcc vectorization avx fma

Large (0,1) matrix multiplication using bitwise AND and popcount instead of actual int or float multiplies?

Mar 28, 2021

c++ sse matrix-multiplication avx bitset

How to align stack at 32 byte boundary in GCC?

Oct 23, 2022

gcc stack sse avx

How to force gcc to use all SSE (or AVX) registers?

Nov 10, 2022

gcc 64-bit sse register-allocation avx

Horizontal XOR in AVX

Apr 25, 2022

c++ assembly x86 simd avx

Do 128bit cross lane operations in AVX512 give better performance?

Mar 23, 2019

performance x86 intel avx avx512

Parallel programming using Haswell architecture [closed]

Apr 12, 2015

sse cpu-architecture avx avx2

Does vzeroall zero registers ymm16 to ymm31?

Nov 20, 2022

assembly x86 intel avx avx512

Is L2 HW prefetcher really helpful?

Apr 25, 2022

c performance assembly x86-64 avx

AVX log intrinsics (_mm256_log_ps) missing in g++-4.8?

Nov 10, 2022

c++ g++ intrinsics avx

How to efficiently combine comparisons in SSE?

May 06, 2021

c optimization assembly sse avx

Fastest way to unpack 32 bits to a 32 byte SIMD vector

Jan 01, 2017

x86 simd avx bitmask avx2

Do all CPUs which support AVX2 also support SSE4.2 and AVX?

Nov 13, 2022

sse simd avx avx2

SSE runs slow after using AVX

Apr 26, 2021

c++ gcc x86 avx sse2

Does Clang have something like #pragma GCC target?

Sep 27, 2022

clang intrinsics avx pragma

What is the most efficient way to clear a single or a few ZMM registers on Knights Landing?

Feb 22, 2022

assembly avx xeon-phi avx512 knights-landing

Packing and de-interleaving two __m256 registers

Apr 18, 2022

c++ x86 simd avx avx2

« Newer Entries Older Entries »