Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

How to write c++ code that the compiler can efficiently compile to SSE or AVX?

Jan 17, 2020

visual-c++ sse simd avx auto-vectorization

Find the first instance of a character using simd

Mar 13, 2020

x86 sse simd avx avx2

AVX2 instructions latency and throughput

Mar 22, 2022

performance x86 x86-64 simd avx2

Intel IACA analyzer alters assembly?

Sep 15, 2022

assembly simd avx2 iaca

Bitwise-AND Slower with SIMD than Scalar

Oct 19, 2022

performance gcc bit-manipulation simd scalar

What is the fastest way to do a SIMD gather without AVX(2)?

Aug 22, 2018

x86 sse simd sse4

difference between load1 and broadcast intrinsics

Aug 11, 2022

x86 sse simd intrinsics intel

SSE and AVX intrinsics mixture

Apr 13, 2022

c++ performance sse simd avx

How does endianness work with SIMD registers?

Mar 09, 2022

x86 sse endianness simd

Implementation of bit rotate operators using SIMD in CUDA

Oct 27, 2021

c++ cuda bitwise-operators simd

Multithreaded & SIMD vectorized Mandelbrot in R using Rcpp & OpenMP

Mar 11, 2022

multithreading openmp rcpp simd mandelbrot

BMI for generating masks with AVX512

Aug 26, 2022

x86 simd avx512 bmi

transpose for 8 registers of 16-bit elements on SSE2/SSSE3

Oct 11, 2019

assembly matrix x86 sse simd

Why is permute needed in parallel SIMD/SSE/AVX ?

Jun 16, 2018

permutation sse simd avx

Is this function a good candidate for SIMD on Intel?

Jun 14, 2022

c++ c optimization simd

Extract set bytes position from SIMD vector

Mar 05, 2022

c++ sse simd intrinsics

_mm256_slli_si256: error "last argument must be an 8-bit intermediate"

Oct 20, 2019

c gcc simd avx avx2

Why doesn't Intel design its SIMD ISAs in a more compatible or universal way?

May 30, 2022

intel simd avx avx2 avx512

What are these extra disassembly instructions when using SIMD intrinsics?

Mar 03, 2018

c# .net simd ryujit

Fastest way to horizontally sum SSE unsigned byte vector

Dec 26, 2021

c++ x86 sse simd

« Newer Entries Older Entries »