Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

Shuffling by mask with Intel AVX

Mar 08, 2022

c++ sse simd intrinsics avx

Control flow divergence in SIMT and SIMD

May 11, 2022

cuda sse simd

Are there SIMD(SSE / AVX) instructions in the x86-compatible accelerators Intel Xeon Phi?

Nov 02, 2022

intel sse simd avx intel-mic

Faster lookup tables using AVX2

May 07, 2022

algorithm performance optimization sse simd

Does using mix of pxor and xorps affect performance?

Aug 26, 2021

assembly x86 sse simd

Is there an efficient way to get the first non-zero element in an SIMD register using SIMD intrinsics?

Oct 23, 2022

x86 bit-manipulation simd intrinsics avx

Using a variable to index a simd vector with _mm256_extract_epi32() intrinsic

Feb 26, 2022

simd intrinsics avx avx2

Is casting to simd-type undefined behaviour in C++? [duplicate]

May 13, 2022

c++ sse undefined-behavior simd intrinsics

What's the most efficient way to load and extract 32 bit integer values from a 128 bit SSE vector?

Nov 29, 2019

c gcc sse simd

ARM and NEON can work in parallel?

Oct 30, 2018

arm inline-assembly simd neon cortex-a8

How to cast SIMD int vectors to float in GCC?

Jul 19, 2018

c gcc vectorization simd

Writing a portable SSE/AVX version of std::copysign

Dec 15, 2021

c++ x86-64 sse simd avx

How to convert byte array of image pixels data to grayscale using vector SSE operation

Sep 11, 2022

c# image-processing vectorization sse simd

How to reverse an __m128 type variable?

Apr 22, 2022

c++ c x86 sse simd

SSE intrinsic over int16[8] to extract the sign of each element

Mar 21, 2022

c x86 sse simd sign

Count leading zeros in __m256i word

Sep 15, 2022

c x86 simd intrinsics avx

How to perform uint32/float conversion with SSE?

Jan 12, 2022

c x86 sse simd

Why do processors with only AVX out-perform AVX2 processors for many SIMD algorithms?

Sep 17, 2019

c# c++ simd avx avx2

Which one is better, gcc or armcc for NEON optimizations?

Jun 18, 2021

embedded arm simd neon cortex-a8

Fast interleave 2 double arrays into an array of structs with 2 float and 1 int (loop invariant) member, with SIMD double->float conversion?

Oct 04, 2022

c++ x86 simd intrinsics avx

« Newer Entries Older Entries »