Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

The indices of non-zero bytes of an SSE/AVX register

Feb 06, 2022

c++ c sse simd avx

Accessing arbitrary 16-bit elements packed in a 128-bit register

Jun 14, 2022

assembly sse simd micro-optimization intrinsics

SIMD XOR operation is not as effective as Integer XOR?

Jul 26, 2021

c++ performance parallel-processing simd seeding

Auto vectorization not working

Oct 22, 2022

c++ optimization vectorization sse simd

How does this function compute the absolute value of a float through a NOT and AND operation?

Nov 03, 2022

c++ bit-manipulation sse simd absolute-value

SSE instruction to sum 32 bit integers to 64 bit

Sep 10, 2022

sse simd

Is using AVX2 can implement a faster processing of LZCNT on a word array?

Oct 05, 2020

x86 simd avx micro-optimization avx2

How to make premultiplied alpha function faster using SIMD instructions?

Jun 04, 2022

c++ x86 sse simd avx

SIMD (AVX) compare

Nov 07, 2022

c gcc sse simd

Minimum of 4 SP values in __m128

May 31, 2022

c sse simd

Compiling SSE intrinsics in GCC gives an error

Oct 31, 2018

gcc x86 intel sse simd

Why use SIMD if we have GPGPU? [closed]

Nov 02, 2022

cuda gpgpu simd computer-architecture cpu-architecture

AVX2, How to Efficiently Load Four Integers to Even Indices of a 256 Bit Register and Copy to Odd Indices?

Oct 07, 2018

x86 sse simd avx avx2

Why are SIMD instructions not used in kernel?

Sep 10, 2022

linux-kernel operating-system linux-device-driver simd ispc

How to convert 32-bit float to 8-bit signed char? (4:1 packing of int32 to int8 __m256i)

Jan 24, 2022

c x86 simd intrinsics avx2

Summing 3 lanes in a NEON float32x4_t

Dec 01, 2020

ios arm simd neon intrinsics

What is the difference between MOVDQA and MOVNTDQA, and VMOVDQA and VMOVNTDQ for WB/WC marked region?

Jun 09, 2020

assembly x86 sse simd avx

AVX2 VPSHUFB emulation in AVX

Oct 01, 2018

x86 simd intrinsics avx

_mm_alignr_epi8 (PALIGNR) equivalent in AVX2

Sep 01, 2020

x86 simd intrinsics avx avx2

How do you move 128-bit values between XMM registers?

Feb 17, 2020

assembly simd sse

« Newer Entries Older Entries »