Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx

Setting m256i to the value of two m128i values

Mar 30, 2019

c sse simd avx

Loading 8 chars from memory into an __m256 variable as packed single precision floats

Jun 17, 2021

c++ sse simd avx avx2

Unknown type name __m256 - Intel intrinsics for AVX not recognized?

Mar 13, 2022

c++ c intel intrinsics avx

Shuffling by mask with Intel AVX

Mar 08, 2022

c++ sse simd intrinsics avx

How to probe the availability of Intel® Advanced Vector Extensions?

Sep 15, 2022

delphi delphi-2007 avx basm

Are there SIMD(SSE / AVX) instructions in the x86-compatible accelerators Intel Xeon Phi?

Nov 02, 2022

intel sse simd avx intel-mic

Is there an efficient way to get the first non-zero element in an SIMD register using SIMD intrinsics?

Oct 23, 2022

x86 bit-manipulation simd intrinsics avx

Using a variable to index a simd vector with _mm256_extract_epi32() intrinsic

Feb 26, 2022

simd intrinsics avx avx2

Saturated substraction - AVX or SSE4.2

Sep 26, 2022

c gcc optimization sse avx

Writing a portable SSE/AVX version of std::copysign

Dec 15, 2021

c++ x86-64 sse simd avx

Count leading zeros in __m256i word

Sep 15, 2022

c x86 simd intrinsics avx

Why do processors with only AVX out-perform AVX2 processors for many SIMD algorithms?

Sep 17, 2019

c# c++ simd avx avx2

Fast interleave 2 double arrays into an array of structs with 2 float and 1 int (loop invariant) member, with SIMD double->float conversion?

Oct 04, 2022

c++ x86 simd intrinsics avx

Using SIMD/AVX/SSE for tree traversal

Apr 12, 2019

performance assembly simd micro-optimization avx

Fastest way to perform AVX inner product operations with mixed (float, double) input vectors

Nov 10, 2019

c++ vectorization simd avx sse2

Using ymm registers as a "memory-like" storage location

Dec 07, 2020

assembly x86 sse avx

Matrix-vector-multiplication in AVX not proportionately faster than in SSE

Dec 07, 2021

c++ vectorization sse matrix-multiplication avx

How to concatenate two vector efficiently using AVX2? (a lane-crossing version of VPALIGNR)

Mar 08, 2022

c simd intrinsics avx avx2

AVX 256-bit equivalent for _mm_load1_ps

Mar 14, 2018

simd intrinsics avx

« Newer Entries Older Entries »