Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in x86

fastest way to write a bitstream on modern x86 hardware

What is a Partial Flag Stall?

When using a mask register with AVX-512 load and stores, is a fault raised for invalid accesses to masked out elements?

x86 avx avx512

x86 Program Counter abstracted from microarchitecture?

Is it possible to vectorize myNum += a[b[i]] * c[i]; on x86_64?

About setjmp/longjmp

c linux x86 setjmp

Porting compiler from x86 Assembly to LLVM

What's the difference between __popcnt() and _mm_popcnt_u32()?

x86 sse intrinsics sse4

What's inside the stack?

c x86 stack low-level

What is the overhead of using Intel Last Branch Record?

Do the x86 virtualization instruction sets (VT-x, AMD-V) have alternate uses?

virtualization x86

Is vxorps-zeroing on AMD Jaguar/Bulldozer/Zen faster with xmm registers than ymm?

what's the difference between _mm256_lddqu_si256 and _mm256_loadu_si256

Two very similar functions involving sin() exhibit vastly different performance -- why?

L1 memory bandwidth: 50% drop in efficiency using addresses which differ by 4096+64 bytes

c caching memory x86 avx

is there an inverse instruction to the movemask instruction in intel avx2?

x86 intrinsics avx avx2 icc

Is integer overflow undefined in inline x86 assembly?

c gcc x86 undefined-behavior

128-bit values - From XMM registers to General Purpose

assembly x86 sse

Why does DOS set the SP register to 0xFFFE after loading a .COM file?

Concrete example of incorrect behavior of an early-clobber affecting a memory operand's addressing mode in GCC inline asm?