Suppose I have the following declared: <pre class="prettyprint"><code>section .bss buffer resb 1 </code></pre> And these instructions follow in <code>section .text</code>: <pre class="prettyprint"><code>mov al, 5 ; mov-immediate mov [buffer], al ; store mov bl, [buffer] ; load mov cl, buffer ; mov-immediate? </code></pre> Am I correct in understanding that bl will contain the value 5, and cl will contain the memory address of the variable <code>buffer</code>? I am confused about the differences between <ul> <li>moving an immediate into a register,</li> <li>moving a register into an immediate (what goes in, the data or the address?) and</li> <li>moving an immediate into a register without the brackets <ul> <li>For example, <code>mov cl, buffer</code> vs <code>mov cl, [buffer]</code> </li> </ul> </li> </ul> <hr> UPDATE: After reading the responses, I suppose the following summary is accurate: <ul> <li> <code>mov edi, array</code> puts the memory address of the zeroth array index in <code>edi</code>. i.e. the label address.</li> <li> <code>mov byte [edi], 3</code> puts the VALUE 3 into the zeroth index of the array</li> <li>after <code>add edi, 3</code>, <code>edi</code> now contains the memory address of the 3rd index of the array</li> <li> <code>mov al, [array]</code> loads the DATA at the zeroth index into <code>al</code>.</li> <li> <code>mov al, [array+3]</code> loads the DATA at the third index into <code>al</code>.</li> <li> <code>mov [al], [array]</code> is invalid because x86 can't encode 2 explicit memory operands, and because <code>al</code> is only 8 bits and can't be used even in a 16-bit addressing mode. Referencing the contents of a memory location. (x86 addressing modes) </li> <li> <code>mov array, 3</code> is invalid, because you can't say "Hey, I don't like the offset at which <code>array</code> is stored, so I'll call it 3". An immediate can only be a source operand.</li> <li> <code>mov byte [array], 3</code> puts the value 3 into the zeroth index (first byte) of the array. The <code>byte</code> specifier is needed to avoid ambiguity between byte/word/dword for instructions with memory, immediate operands. That would be an assemble-time error (ambiguous operand size) otherwise.</li> </ul> Please mention if any of these is false. (editor's note: I fixed syntax errors / ambiguities so the valid ones actually are valid NASM syntax. And linked other Q&As for details)

The square brackets essentially work like a dereference operator (e.g., like <code>*</code> in C). So, something like <pre class="prettyprint"><code>mov REG, x </code></pre> moves the value of <code>x</code> into <code>REG</code>, whereas <pre class="prettyprint"><code>mov REG, [x] </code></pre> moves the value of the memory location where <code>x</code> points to into <code>REG</code>. Note that if <code>x</code> is a label, its value is the address of that label. As for you're question: <blockquote> Am I correct in understanding that bl will contain the value 5, and cl will contain the memory address of the variable buffer? </blockquote> Yes, you are correct. But beware that, since <code>CL</code> is only 8 bits wide, it will only contain the least significant byte of the address of <code>buffer</code>.

Indeed, your thought is correct.That is, bl will contain 5 and cl the memory address of buffer(in fact the label buffer is a memory address itself). <hr> Now, let me explain the differences between the operations you mentioned: <ul> <li>moving an immediate into a register can be done using <code>mov reg,imm</code>.What may be confusing is that labels e.g buffer are immediate values themselves that contain an address.</li> <li>You cannot really move a register into an immediate, since immediate values are constants, like <code>2</code> or <code>FF1Ah</code>.What you can do is move a register to the place where the constant points to.You can do it like <code>mov [const], reg</code> .</li> <li>You can also use indirect addressing like <code>mov reg2,[reg1]</code> provided reg1 points to a valid location, and it will transfer the value pointed by reg1 to reg2.</li> </ul> <hr> So, <code>mov cl, buffer</code> will move the address of buffer to cl(which may or may not give the correct address, since cl is only one byte long) , whereas <code>mov cl, [buffer]</code> will get the actual value. <h3>Summary</h3> <ul> <li>When you use [a], then you refer to the value at the place where a points to.For example, if a is <code>F5B1</code>, then [a] refers to the address F5B1 in RAM.</li> <li>Labels are addresses,i.e values like <code>F5B1</code>.</li> <li>Values stored in registers do not have to be referenced to as [reg] because registers do not have addresses.In fact, registers can be thought of as immediate values.</li> </ul>

You are getting the idea. However, there are a few details worth bearing in mind: <ol> <li>Addresses can and usually are greater than what 8 bits can hold (<code>cl</code> is 8-bit, <code>cx</code> is 16-bit, <code>ecx</code> is 32-bit, <code>rcx</code> is 64-bit). So, <code>cl</code> is likely going to be unequal to the address of the variable <code>buffer</code>. It'll only have the least significant 8 bits of the address.</li> <li>If there are interrupt routines or threads that can preempt the above code and/or access <code>buffer</code>, the value in <code>bl</code> may differ from 5. Broken interrupt routines may actually affect any register when they fail to preserve register values.</li> </ol>

Basic use of immediates vs. square brackets in YASM/NASM x86 assembly

Tags:

x86

assembly

memory-address

nasm

yasm

Suppose I have the following declared:

section .bss
buffer    resb     1

And these instructions follow in section .text:

mov    al, 5                    ; mov-immediate
mov    [buffer], al             ; store
mov    bl, [buffer]             ; load
mov    cl, buffer               ; mov-immediate?

Am I correct in understanding that bl will contain the value 5, and cl will contain the memory address of the variable buffer?

I am confused about the differences between

moving an immediate into a register,
moving a register into an immediate (what goes in, the data or the address?) and
moving an immediate into a register without the brackets
- For example, mov cl, buffer vs mov cl, [buffer]

UPDATE: After reading the responses, I suppose the following summary is accurate:

mov edi, array puts the memory address of the zeroth array index in edi. i.e. the label address.
mov byte [edi], 3 puts the VALUE 3 into the zeroth index of the array
after add edi, 3, edi now contains the memory address of the 3rd index of the array
mov al, [array] loads the DATA at the zeroth index into al.
mov al, [array+3] loads the DATA at the third index into al.
mov [al], [array] is invalid because x86 can't encode 2 explicit memory operands, and because al is only 8 bits and can't be used even in a 16-bit addressing mode. Referencing the contents of a memory location. (x86 addressing modes)
mov array, 3 is invalid, because you can't say "Hey, I don't like the offset at which array is stored, so I'll call it 3". An immediate can only be a source operand.
mov byte [array], 3 puts the value 3 into the zeroth index (first byte) of the array. The byte specifier is needed to avoid ambiguity between byte/word/dword for instructions with memory, immediate operands. That would be an assemble-time error (ambiguous operand size) otherwise.

Please mention if any of these is false. (editor's note: I fixed syntax errors / ambiguities so the valid ones actually are valid NASM syntax. And linked other Q&As for details)

400

asked Apr 28 '12 10:04

InvalidBrainException

3 Answers

The square brackets essentially work like a dereference operator (e.g., like * in C).

So, something like

mov REG, x

moves the value of x into REG, whereas

mov REG, [x]

moves the value of the memory location where x points to into REG. Note that if x is a label, its value is the address of that label.

As for you're question:

Am I correct in understanding that bl will contain the value 5, and cl will contain the memory address of the variable buffer?

Yes, you are correct. But beware that, since CL is only 8 bits wide, it will only contain the least significant byte of the address of buffer.

answered Oct 18 '22 22:10

mtvec

Indeed, your thought is correct.That is, bl will contain 5 and cl the memory address of buffer(in fact the label buffer is a memory address itself).

Now, let me explain the differences between the operations you mentioned:

moving an immediate into a register can be done using mov reg,imm.What may be confusing is that labels e.g buffer are immediate values themselves that contain an address.
You cannot really move a register into an immediate, since immediate values are constants, like 2 or FF1Ah.What you can do is move a register to the place where the constant points to.You can do it like mov [const], reg .
You can also use indirect addressing like mov reg2,[reg1] provided reg1 points to a valid location, and it will transfer the value pointed by reg1 to reg2.

So, mov cl, buffer will move the address of buffer to cl(which may or may not give the correct address, since cl is only one byte long) , whereas mov cl, [buffer] will get the actual value.

Summary

When you use [a], then you refer to the value at the place where a points to.For example, if a is F5B1, then [a] refers to the address F5B1 in RAM.
Labels are addresses,i.e values like F5B1.
Values stored in registers do not have to be referenced to as [reg] because registers do not have addresses.In fact, registers can be thought of as immediate values.

answered Oct 18 '22 22:10

byrondrossos

You are getting the idea. However, there are a few details worth bearing in mind:

Addresses can and usually are greater than what 8 bits can hold (cl is 8-bit, cx is 16-bit, ecx is 32-bit, rcx is 64-bit). So, cl is likely going to be unequal to the address of the variable buffer. It'll only have the least significant 8 bits of the address.
If there are interrupt routines or threads that can preempt the above code and/or access buffer, the value in bl may differ from 5. Broken interrupt routines may actually affect any register when they fail to preserve register values.

answered Oct 18 '22 22:10

Alexey Frunze

Related questions
                            
                                'Correct' unsigned integer comparison
                            
                                Some x86 ASM Reference/Tutorials? [closed]
                            
                                Useless test instruction?
                            
                                Adding leading underscores to assembly symbols with GCC on Win32?
                            
                                Why does division by 3 require a rightshift (and other oddities) on x86?
                            
                                What's the relationship between assembly language and machine language?
                            
                                Why isn't pass struct by reference a common optimization?
                            
                                Alloca implementation
                            
                                Reading program counter directly
                            
                                Why does g++ pull computations into a hot loop?
                            
                                Why are AND instructions generated?
                            
                                What does bx lr do in ARM assembly language?
                            
                                Base pointer and stack pointer
                            
                                Why does Visual Studio use xchg ax,ax
                            
                                What is the 0x10 in the "leal 0x10(%ebx), %eax" x86 assembly instruction?
                            
                                How to: pow(real, real) in x86
                            
                                x86_64 ASM - maximum bytes for an instruction?
                            
                                What functions does gcc add to the linux ELF?
                            
                                Big differences in GCC code generation when compiling as C++ vs C
                            
                                x86 LOCK question on multi-core CPUs

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Basic use of immediates vs. square brackets in YASM/NASM x86 assembly

Tags:

x86

assembly

memory-address

nasm

yasm

InvalidBrainException

People also ask

3 Answers

mtvec

Summary

byrondrossos

Alexey Frunze

Recent Activity

Donate For Us