How do memory fences work?

Tags:

I need to understand memory fences in multicore machines. Say I have this code

Core 1

mov [_x], 1; mov r1, [_y]

Core 2

mov [_y], 1; mov r2, [_x]

Now the unexpected results without memory fences would be that both r1 and r2 can be 0 after execution. In my opinion, to counter that problem, we should put memory fence in both codes, as putting it to only one would still not solve the problem. Something like as follows...

Core 1

mov [_x], 1; memory_fence; mov r1, [_y]

Core 2

mov [_y], 1; memory_fence; mov r2, [_x]

Is my understanding correct or am I still missing something? Assume the architecture is x86. Also, can someone tell me how to put memory fences in a C++ code?

710

asked Sep 02 '11 06:09

MetallicPriest

1 Answers

Fences serialize the operation that they fence (loads & stores), that is, no other operation may start till the fence is executed, but the fence will not execute till all preceding operations have completed. quoting intel makes the meaning of this a little more precise (taken from the MFENCE instruction, page 3-628, Vol. 2A, Intel Instruction reference):

This serializing operation guarantees that every load and store instruction that precedes the MFENCE instruction in program order becomes globally visible before any load or store instruction that follows the MFENCE instruction.1

A load instruction is considered to become globally visible when the value to be loaded into its destination register is determined.

Using fences in C++ is tricky (C++11 may have fence semantics somewhere, maybe someone else has info on that), as it is platform and compiler dependent. For x86 using MSVC or ICC, you can use the _mm_lfence, _mm_sfence & _mm_mfence for load, store and load + store fencing (note that some of these are SSE2 instructions).

Note: this assumes an Intel perspective, that is: one using an x86 (32 or 64 bit) or IA64 processor

104

answered Oct 14 '22 07:10

Necrolis

Related questions
                            
                                Changing the default github code font
                            
                                Does inheriting constructors work with templates in C++0x?
                            
                                What is the reason for having unreserved identifiers as built-in macros in gcc?
                            
                                Vertical rhythm for Twitter's Bootstrap
                            
                                Python: Why do int.numerator and int.denominator exist?
                            
                                Android theme name from theme ID
                            
                                HLS (http live streaming) on Android 3.0 and seeking
                            
                                Compute as much of a list as possible in a fixed time
                            
                                AngularJS how to force an input to be re-rendered on blur
                            
                                Copy SVG Images from Browser to Clipboard
                            
                                What happens to using statement when I move to dependency injection
                            
                                How is the microsecond time of linux gettimeofday() obtained and what is its accuracy?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do memory fences work?

Tags:

Core 1

Core 2

Core 1

Core 2

MetallicPriest

People also ask

1 Answers

Necrolis

Recent Activity

Donate For Us