I'm looking through an Intel Instruction Set manual, and it looks like there are 2 different forms of ADC
that would match/encode ADC EAX, ECX
as follows:
ADC r/m32, r32 (11 /r , which encodes to 11C8)
or
ADC r32, r/m32 (13 /r, which encodes to 13C1)
My question is (given I did the math correctly), are 11C8
and 13C1
equivalent? What are the factors that an assembler would consider in selecting one encoding over another? The question is from a perspective of implementing an assembler, so the question is in general, not about this particular hypothetical instruction.
If it's a lengthy answer, please point me in a right direction as my attempts at googling it failed.
This is redundancy of instruction encoding. Any architecture that use multiple parameters in the instruction has this.
Think of a RISC architecture that have add rx, ry, rz
that assigns the sum of ry and rz into rx then you can encode add rx, ry, rz
or add rx, rz, ry
, they'll all be equivalent.
In x86 we (normally) have only 2 parameters for each instruction but you can select the direction between them since you can store to or read from memory. If you don't use memory then you can choose the direction between the 2 registers, so there are 2 encoding ways
You can use this to identify some compilers/assemblers. For some assemblers you can choose which encoding to use. In GAS you can use .s
suffix to force it to emit the alternate encoding
10 de adcb %bl,%dh
12 f3 adcb.s %bl,%dh
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With