Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in micro-optimization

Adding a byte from memory to the AX register

How much faster are SSE4.2 string instructions than SSE2 for memcmp?

How to get gcc to generate decent code that checks if a buffer is full of NUL bytes?

c gcc x86 micro-optimization

Efficient division operator for 3D vectors

Can accessing uninitialized values result in a performance hit?

Is there any point in creating a second column optimized for FULLTEXT searches?

Does the order of class members affect access speed?

Is CMOVcc considered a branching instruction?

Copy bit of one register to another register (x86-64 asm)

Should I inline operators == and != in case when operator!=() uses negation of operator==()

c++ micro-optimization

Mixing SSE with AVX128 for shorter instructions?

Why doesn't the C++ standard library utilize likely/unlikely attributes?

During thread contention how can I speed up this ConcurrentQueue implementation which uses ReaderWriterLockSlim over a regular Queue<T>

Understanding `_mm_prefetch`

X86: How to set lower half of xmm0 to 0, without affecting the upper half?

Bottleneck when using indexed addressing modes

Loading an xmm from GP regs

68000 Assembly – Build a String from Characters *not* Present in Another & Return Its Length (stack-passed params)

Access of struct member faster if located <128 bytes from start?