Shortest Intel x86-64 opcode for rax=1?

2 Answers

Since there is a byte immediate encoding for push and a one-byte pop for registers, this can be done in three bytes: 6a 01 58, or push $1 / pop %rax.

187

answered Oct 16 '22 12:10

gsg

With any known pre-conditions, there are some tricks that are more efficient (in terms of speed) than the push imm8/pop rax 3-byte solution.

For speed mov eax, 1 has many advantages, because it doesn't have any input dependencies and it's only one instruction. Out-of-order execution can get started on it (and anything that depends on it) without waiting for other stuff. (See Agner Fog's guides and the x86 tag wiki).

Obviously many of these take advantage of the fact that writing a 32-bit register zeros the upper half, to avoid the unnecessary REX prefix of the OP's code. (Also note that xor rax,rax is not special-cased as a zeroing idiom on Silvermont. It only recognizes xor-zeroing of 32-bit registers, like eax or r10d, not rax or r10.)

If you have a small known constant in any register to start with, you can use

lea   eax, [rcx+1]    ; 3 bytes: opcode + ModRM + disp8

disp8 can encode displacements from -128 to +127.

If you have an odd number in eax, and eax, 1 is also 3 bytes.

In 32-bit code, inc eax only takes one byte, but those inc/dec opcodes were repurposed as REX prefixes for AMD64. So xor eax,eax / inc eax is 4 bytes in x86-64 code, but only 3 in 32-bit code. Still, if saving 1 byte over a mov eax,1 is sufficient, and LEA or AND won't work, this is more efficient than push/pop.

answered Oct 16 '22 11:10

Peter Cordes

Related questions
                            
                                What does this assembly code do? (TEST,XOR,JNZ)
                            
                                GCC INLINE ASSEMBLY Won't Let Me Overwrite $esp
                            
                                There must be a really fast way to calculate this bitwise expression?
                            
                                What is data type and how is it implemented?
                            
                                How do I correctly use the mod operator in MIPS?
                            
                                Literals VS Immediate Operands
                            
                                Can't clear entire screen in 16-bit real mode Assembly
                            
                                Adding two vector in assembly x86_64 with AVX2 plus technical clarifications
                            
                                In NASM labels next to each other in memory are printing both strings instead of first one
                            
                                Comma, colon, decorator or end of line expected after operand
                            
                                Unsigned int to unsigned long long well defined?
                            
                                Why do 32-bit applications work on 64-bit x86 CPUs?
                            
                                Error 13: Invalid or unsupported executable while booting simple kernel in grub with string literal
                            
                                Regarding cmp / jg, jle, etc in AT&T syntax assembly
                            
                                Why does ARM distinguish between SDIV and UDIV but not with ADD, SUB and MUL?
                            
                                How does assembler compute segment and offset for symbol addresses?
                            
                                Counting character frequencies in an array of characters - x86 Assembly
                            
                                NASM compiling x86_64 ASM label addresses off by 256 bytes in Mach-O when using multiple db declarations?
                            
                                masm error A2075: jump destination too far : by 30 bytes
                            
                                Can atomic instructions straddle cache lines?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Shortest Intel x86-64 opcode for rax=1?

Tags:

assembly

x86-64

intel

micro-optimization

code-size

kubuzetto

People also ask

2 Answers

gsg

Peter Cordes

Recent Activity

Donate For Us