I saw some x86 assembly in Qt's source: <pre class="prettyprint"><code>q_atomic_increment: movl 4(%esp), %ecx lock incl (%ecx) mov $0,%eax setne %al ret .align 4,0x90 .type q_atomic_increment,@function .size q_atomic_increment,.-q_atomic_increment </code></pre> <ol> <li>From Googling, I knew <code>lock</code> instruction will cause CPU to lock the bus, but I don't know when CPU frees the bus?</li> <li>About the whole above code, I don't understand how this code implements the <code>Add</code>?</li> </ol>

<ol> <li> <code>LOCK</code> is not an instruction itself: it is an instruction prefix, which applies to the following instruction. That instruction must be something that does a read-modify-write on memory (<code>INC</code>, <code>XCHG</code>, <code>CMPXCHG</code> etc.) --- in this case it is the <code>incl (%ecx)</code> instruction which <code>inc</code>rements the <code>l</code>ong word at the address held in the <code>ecx</code> register. The <code>LOCK</code> prefix ensures that the CPU has exclusive ownership of the appropriate cache line for the duration of the operation, and provides certain additional ordering guarantees. This may be achieved by asserting a bus lock, but the CPU will avoid this where possible. If the bus is locked then it is only for the duration of the locked instruction. </li> <li> This code copies the address of the variable to be incremented off the stack into the <code>ecx</code> register, then it does <code>lock incl (%ecx)</code> to atomically increment that variable by 1. The next two instructions set the <code>eax</code> register (which holds the return value from the function) to 0 if the new value of the variable is 0, and 1 otherwise. The operation is an increment, not an add (hence the name). </li> </ol>

What does the "lock" instruction mean in x86 assembly?

Tags:

c++

x86

assembly

qt

I saw some x86 assembly in Qt's source:

q_atomic_increment:     movl 4(%esp), %ecx     lock      incl (%ecx)     mov $0,%eax     setne %al     ret      .align 4,0x90     .type q_atomic_increment,@function     .size   q_atomic_increment,.-q_atomic_increment

From Googling, I knew lock instruction will cause CPU to lock the bus, but I don't know when CPU frees the bus?
About the whole above code, I don't understand how this code implements the Add?

725

asked Jan 17 '12 07:01

gemfield

2 Answers

LOCK is not an instruction itself: it is an instruction prefix, which applies to the following instruction. That instruction must be something that does a read-modify-write on memory (INC, XCHG, CMPXCHG etc.) --- in this case it is the incl (%ecx) instruction which increments the long word at the address held in the ecx register.

The LOCK prefix ensures that the CPU has exclusive ownership of the appropriate cache line for the duration of the operation, and provides certain additional ordering guarantees. This may be achieved by asserting a bus lock, but the CPU will avoid this where possible. If the bus is locked then it is only for the duration of the locked instruction.
This code copies the address of the variable to be incremented off the stack into the ecx register, then it does lock incl (%ecx) to atomically increment that variable by 1. The next two instructions set the eax register (which holds the return value from the function) to 0 if the new value of the variable is 0, and 1 otherwise. The operation is an increment, not an add (hence the name).

114

answered Sep 23 '22 01:09

Anthony Williams

What you may be failing to understand is that the microcode required to increment a value requires that we read in the old value first.

The Lock keyword forces the multiple micro instructions that are actually occuring to appear to operate atomically.

If you had 2 threads each trying to increment the same variable, and they both read the same original value at the same time then they both increment to the same value, and they both write out the same value.

Instead of having the variable incremented twice, which is the typical expectation, you end up incrementing the variable once.

The lock keyword prevents this from happening.

answered Sep 22 '22 01:09

Dan

Related questions
                            
                                Obtaining list of keys and values from unordered_map
                            
                                What is the lifetime of a C++ lambda expression?
                            
                                C++11: Correct std::array initialization?
                            
                                fixed length data types in C/C++
                            
                                How to speed up g++ compile time (when using a lot of templates)
                            
                                Fast textfile reading in c++
                            
                                Export all symbols when creating a DLL
                            
                                Enable C++11 support on Android
                            
                                Why are NULL pointers defined differently in C and C++?
                            
                                Can we reassign the reference in C++?
                            
                                C++ view types: pass by const& or by value?
                            
                                C++17: Keep only some members when tuple unpacking
                            
                                How do I decide whether to use ATL, MFC, Win32 or CLR for a new C++ project?
                            
                                A lambda's return type can be deduced by the return value, so why can't a function's?
                            
                                Why aren't my include guards preventing recursive inclusion and multiple symbol definitions?
                            
                                do I need to close a std::fstream? [duplicate]
                            
                                Why would the behavior of std::memcpy be undefined for objects that are not TriviallyCopyable?
                            
                                How do you find what version of libstdc++ library is installed on your linux machine?
                            
                                What does the g stand for in gcount, tellg and seekg?
                            
                                Why do I get an error trying to call a template member function with an explicit type parameter?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With