Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in simd

approximating log10[x^k0 + k1]

Vectorize a function in clang

c++ vector simd clang++

gcc, simd intrinsics and fast-math concepts

gcc simd intrinsics fast-math

Packing and de-interleaving two __m256 registers

c++ x86 simd avx avx2

Why both? vperm2f128 (avx) vs vperm2i128 (avx2)

intel simd avx avx2

Is there a good double-precision small matrix SIMD library for x86?

Most efficient way to store 4 dot products into a contiguous array in C using SSE intrinsics

Fast counting the number of equal bytes between two arrays [duplicate]

c++ c sse simd sse2

Is it possible to use SIMD instructions in Rust?

rust simd avx avx2

Is it possible to vectorize myNum += a[b[i]] * c[i]; on x86_64?

what's the difference between _mm256_lddqu_si256 and _mm256_loadu_si256

Bitwise xor of two 256-bit integers

sse simd avx

A better 8x8 bytes matrix transpose with SSE?

c matrix optimization sse simd

Fastest Implementation of Exponential Function Using AVX

x86 simd avx exponential avx2

SIMD programming languages

How do I reorder vector data using ARM Neon intrinsics?

arm simd neon intrinsics

Testing equality between two __m128i variables

c x86 sse simd

Complex Mul and Div using sse Instructions

x86 sse simd complex-numbers

SSE: convert short integer to float

x86 sse simd

Why do I get this compile error with GCC 5 and cilk-plus?

c gcc simd cilk-plus