I compiled the code below with the VC++ 2010 compiler: <pre class="prettyprint"><code>__declspec(dllexport) unsigned int __cdecl __mm_getcsr(void) { return _mm_getcsr(); } </code></pre> and the generated code was: <pre class="prettyprint"><code>push ECX stmxcsr [ESP] mov EAX, [ESP] pop ECX retn </code></pre> Why is there a <code>push ECX</code>/<code>pop ECX</code> instruction pair?

The compiler is making room on the stack to store the MXCSR. It could have equally well done this: <pre class="prettyprint"><code>sub esp,4 stmxcsr [ESP] mov EAX, [ESP] add esp,4 retn </code></pre> But "push ecx" is probably shorter or faster.

The push here is used to allocate 4 bytes of temporary space. <code>[ESP]</code> would normally point to the pushed return address, which we cannot overwrite. <code>ECX</code> will be overwritten here, however, <code>ECX</code> is a probably a volatile register in the ABI you're targeting, so functions don't have to preserve <code>ECX</code>. The reason a push/pop is used here is a space (and possibly speed) optimization.

Why is the compiler generating a push/pop instruction pair?

Tags:

c

x86

assembly

sse

I compiled the code below with the VC++ 2010 compiler:

__declspec(dllexport)
unsigned int __cdecl __mm_getcsr(void) { return _mm_getcsr(); }

and the generated code was:

push ECX
    stmxcsr [ESP]
    mov EAX, [ESP]
pop ECX
retn

Why is there a push ECX/pop ECX instruction pair?

933

asked Jan 14 '12 15:01

user541686

2 Answers

The compiler is making room on the stack to store the MXCSR. It could have equally well done this:

sub esp,4
stmxcsr [ESP]
mov EAX, [ESP]
add esp,4
retn

But "push ecx" is probably shorter or faster.

answered Nov 15 '22 06:11

Robᵩ

The push here is used to allocate 4 bytes of temporary space. [ESP] would normally point to the pushed return address, which we cannot overwrite.

ECX will be overwritten here, however, ECX is a probably a volatile register in the ABI you're targeting, so functions don't have to preserve ECX.

The reason a push/pop is used here is a space (and possibly speed) optimization.

answered Nov 15 '22 07:11

Maister

Related questions
                            
                                Unions that contain a"type" member
                            
                                Pass many pieces of data from Python to C program
                            
                                a and &a differs for an array passed as a function parameter in C
                            
                                Does Android not really have wchar_t?
                            
                                Calculating scroll inertia/momentum?
                            
                                Realloc Vs Linked List Scanning
                            
                                Create char from char* that includes escape character
                            
                                What is program break? Where does it start from,0x00?
                            
                                What is the historical context for long and int often being the same size?
                            
                                Calling Lua function
                            
                                Mathematica and C/C++: Exchanging Data
                            
                                What guarantees about low order bits does malloc provide?
                            
                                What corner cases must we consider when parsing $PATH on Linux?
                            
                                Linux automatically restarting application on crash - Daemons
                            
                                How to recover from I2C bus collision BCLIF?
                            
                                What is OR EQUAL
                            
                                Is there Path Edit Control in Win32?
                            
                                In assembler, why does the use of registers differ between addition and subtraction?
                            
                                Frequency Modulation Synthesis Algorithm
                            
                                Odd behavior when converting C strings to/from doubles

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With