Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in sse

Why does _mm_stream_ps produce L1/LL cache misses?

Dec 11, 2017

c performance caching gcc sse

Where does the SSE instructions outperform normal instructions

Oct 27, 2017

c x86-64 sse

What is the difference between MOVDQA and MOVNTDQA, and VMOVDQA and VMOVNTDQ for WB/WC marked region?

Jun 09, 2020

assembly x86 sse simd avx

Visual Studio 2017: _mm_load_ps often compiled to movups

Jun 29, 2021

c++ assembly sse intrinsics visual-studio-2017

How do you move 128-bit values between XMM registers?

Feb 17, 2020

assembly simd sse

Use both SSE2 intrinsics and gcc inline assembler

Jan 17, 2022

gcc sse inline-assembly intrinsics

SSE3 intrinsics: How to find the maximum of a large array of floats

Jul 18, 2019

c++ sse intrinsics

Setting m256i to the value of two m128i values

Mar 30, 2019

c sse simd avx

Loading 8 chars from memory into an __m256 variable as packed single precision floats

Jun 17, 2021

c++ sse simd avx avx2

Shuffling by mask with Intel AVX

Mar 08, 2022

c++ sse simd intrinsics avx

Control flow divergence in SIMT and SIMD

May 11, 2022

cuda sse simd

Are there SIMD(SSE / AVX) instructions in the x86-compatible accelerators Intel Xeon Phi?

Nov 02, 2022

intel sse simd avx intel-mic

Faster lookup tables using AVX2

May 07, 2022

algorithm performance optimization sse simd

Does using mix of pxor and xorps affect performance?

Aug 26, 2021

assembly x86 sse simd

What is the minimum supported SSE flag that can be enabled on macOS?

Mar 26, 2022

c++ macos optimization compiler-optimization sse

Is casting to simd-type undefined behaviour in C++? [duplicate]

May 13, 2022

c++ sse undefined-behavior simd intrinsics

GCC - How to realign stack?

Nov 21, 2019

c gcc stack pthreads sse

What's the most efficient way to load and extract 32 bit integer values from a 128 bit SSE vector?

Nov 29, 2019

c gcc sse simd

Saturated substraction - AVX or SSE4.2

Sep 26, 2022

c gcc optimization sse avx

Writing a portable SSE/AVX version of std::copysign

Dec 15, 2021

c++ x86-64 sse simd avx

« Newer Entries Older Entries »