Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in micro-optimization

Controller performance in codeigniter

complier generating a mov back and forth on eax

Missing latency in instruction tables

What is the fastest way to swap the bytes of an unaligned 64 bit value in memory?

Why is clang's `-O3` alloca 2x faster than g++

Is there any data on the latency of an AVX2 gather instruction?

Why don't cases in switch statements have their own scope? [duplicate]

Which is generally faster to test for zero in x86 ASM: "TEST EAX, EAX" versus "TEST AL, AL"?

Does cmpxchg write destination cache line on failure? If not, is it better than xchg for spinlock?

Does adding local variables to methods make them slower?

What nonlocal effects can change the performance of a basic block?

Why jnz requires 2 cycles to complete in an inner loop

Efficient modulo-255 computation

Efficient computation of the average of three unsigned integers (without overflow)