Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx2

Is there any data on the latency of an AVX2 gather instruction?

Oct 05, 2022

performance x86 latency micro-optimization avx2

What is packed and unpacked and extended packed data

Feb 10, 2022

cpu-architecture sse simd avx avx2

AVX2 code slower then without AVX2

Nov 19, 2021

intel c++ performance x86 avx2

Error: suffix or operands invalid for `vbroadcastss'

Dec 11, 2019

python compiler-errors avx avx2

How can I convert a vector of float to short int using avx instructions?

Oct 18, 2022

c++ c gcc avx avx2

Using values from `__m256i` to access an array efficiently - SIMD [closed]

May 26, 2022

c++ arrays simd avx2

What is the inverse of "_mm256_cvtepi16_epi32"

Feb 14, 2022

x86 g++ intrinsics avx avx2

Why does Tensorflow warn about AVX2 while I am using MKL?

Jun 06, 2022

tensorflow keras anaconda intel-mkl avx2

Optimize extraction of 64 bit value from AVX2 register

Oct 31, 2015

c sse avx avx2

Get an arbitrary float from a simd register at runtime?

Sep 15, 2022

x86 sse simd avx avx2

How do I broadcast the lowest word of a __m256i?

Jul 25, 2021

intrinsics avx2

c++ AVX512 intrinsic equivalent of _mm256_broadcast_ss()?

Dec 14, 2021

c++ intel intrinsics avx2 avx512

AVX alternative of AVX2's vector shift?

Mar 21, 2022

c++ bitwise-operators bit-shift avx avx2

How to increment a vector in AVX/AVX2

Oct 31, 2022

assembly x86 simd intrinsics avx2

AVX2 float compare and get 0.0 or 1.0 instead of all-0 or all-one bits

Oct 14, 2021

c++ sse simd avx avx2

avx2 register bits reverse

Apr 19, 2022

c++ x86 simd avx2

How to vectorise int8 multiplcation in C (AVX2)

Aug 02, 2022

c x86 simd intrinsics avx2

Emulating shifts on 32 bytes with AVX

Sep 24, 2020

c++ simd intrinsics sse2 avx2

Fastest way to multiply an array of int64_t?

Nov 27, 2016

c vectorization multiplication avx avx2

AVX 256-bit code performing slightly worse than equivalent 128-bit SSSE3 code

Jun 06, 2022

c++ performance sse avx2

« Newer Entries Older Entries »