Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

SIMD vs Vector architectures

Nov 16, 2022

simd computer-architecture

Fastest way to unpack 32 bits to a 32 byte SIMD vector

Jan 01, 2017

x86 simd avx bitmask avx2

Do all CPUs which support AVX2 also support SSE4.2 and AVX?

Nov 13, 2022

sse simd avx avx2

Storing two x86 32 bit registers into 128 bit xmm register

Mar 28, 2022

assembly x86 simd sse

What are the 128-bit to 512-bit registers used for?

Mar 17, 2022

assembly x86-64 sse simd cpu-registers

Most efficient way to check if all __m128i components are 0 [using <= SSE4.1 intrinsics]

Nov 29, 2021

c++ integer sse simd intrinsics

AVX2 slower than SSE on Haswell

May 18, 2017

c++ x86 sse simd avx2

How to convert a binary integer number to a hex string?

Nov 06, 2022

assembly x86 hex simd avx512

how to work with 128 bits C variable and xmm 128 bits asm?

Mar 30, 2022

c sse simd

SSE micro-optimization instruction order

Feb 20, 2022

c++ optimization sse simd micro-optimization

approximating log10[x^k0 + k1]

Oct 16, 2021

optimization math sse simd approximation

Vectorize a function in clang

Oct 29, 2022

c++ vector simd clang++

gcc, simd intrinsics and fast-math concepts

Oct 14, 2022

gcc simd intrinsics fast-math

Packing and de-interleaving two __m256 registers

Apr 18, 2022

c++ x86 simd avx avx2

Why both? vperm2f128 (avx) vs vperm2i128 (avx2)

Nov 15, 2022

intel simd avx avx2

Is there a good double-precision small matrix SIMD library for x86?

Oct 01, 2018

c++ sse simd matrix-multiplication

Most efficient way to store 4 dot products into a contiguous array in C using SSE intrinsics

Nov 17, 2021

c sse simd intrinsics dot-product

Fast counting the number of equal bytes between two arrays [duplicate]

Oct 04, 2019

c++ c sse simd sse2

Is it possible to use SIMD instructions in Rust?

Feb 07, 2022

rust simd avx avx2

Is it possible to vectorize myNum += a[b[i]] * c[i]; on x86_64?

Apr 16, 2022

x86 x86-64 sse simd vectorization

« Newer Entries Older Entries »