I'm working on optimization techniques performed by the .NET Native compiler. I've created a sample loop: <pre class="prettyprint"><code> for (int i = 0; i < 100; i++) { Function(); } </code></pre> And I've compiled it with Native. Then I disassembled the result <code>.dll</code> file with machine code inside in IDA. As the result, I have: <img src="https://i.stack.imgur.com/3L0Nr.png" alt="IDA output"> (I've removed a few unnecessary lines, so don't worry that address lines are inconsistent) I understand that <code>add esi, 0FFFFFFFFh</code> means really <code>subtract one from esi and alter Zero Flag if needed</code>, so we can jump to the beginning if zero hasn't been reached yet. What I don't understand is why did the compiler reverse the loop? I came to the conclusion that <pre class="prettyprint"><code>LOOP: add esi, 0FFFFFFFFh jnz LOOP </code></pre> is just faster than for example <pre class="prettyprint"><code>LOOP: inc esi cmp esi, 064h jl LOOP </code></pre> But is it really because of that and is the speed difference really significant?

Your conclusion is correct: inverted cycle will target <code>0</code> (cycle will ends when register value reach <code>0</code>), so that <code>Add</code> will set zero flag used in conditional branch. This way you don't need dedicated <code>Cmp</code> which leads to: 1) size optimization 2) it's also faster (conclusion from compiler programmers decision and another answer). That's pretty common assembler trick to write loop targeting <code>0</code>. I am surprised you understand assembler, but don't know (asking) about it.

Why does .NET Native compile loop in reverse order?

Tags:

c#

x86

assembly

micro-optimization

.net-native

I'm working on optimization techniques performed by the .NET Native compiler. I've created a sample loop:

        for (int i = 0; i < 100; i++)
        {
            Function();
        }

And I've compiled it with Native. Then I disassembled the result .dll file with machine code inside in IDA. As the result, I have:

IDA output

(I've removed a few unnecessary lines, so don't worry that address lines are inconsistent)

I understand that add esi, 0FFFFFFFFh means really subtract one from esi and alter Zero Flag if needed, so we can jump to the beginning if zero hasn't been reached yet.

What I don't understand is why did the compiler reverse the loop?

I came to the conclusion that

LOOP:
add esi, 0FFFFFFFFh
jnz LOOP

is just faster than for example

LOOP:
inc esi
cmp esi, 064h
jl LOOP

But is it really because of that and is the speed difference really significant?

669

asked Apr 05 '16 15:04

Kamil T

Video Answer

2 Answers

inc might be slower than add because of the partial flag update. Moreover add affects the zero flag so you don't need to use another cmp instruction. Just jump directly.

This is one famous type of loop optimization

reversal: Loop reversal reverses the order in which values are assigned to the index variable. This is a subtle optimization which can help eliminate dependencies and thus enable other optimizations. Also, certain architectures utilize looping constructs at Assembly language level that count in a single direction only (e.g. decrement-jump-if-not-zero (DJNZ)).

Is it faster to count down than it is to count up?
GCC Loop optimization

You can see the result for other compilers here.

168

answered Oct 17 '22 11:10

phuclv

Your conclusion is correct: inverted cycle will target 0 (cycle will ends when register value reach 0), so that Add will set zero flag used in conditional branch.

This way you don't need dedicated Cmp which leads to: 1) size optimization 2) it's also faster (conclusion from compiler programmers decision and another answer).

That's pretty common assembler trick to write loop targeting 0. I am surprised you understand assembler, but don't know (asking) about it.

answered Oct 17 '22 11:10

Sinatr

Related questions
                            
                                How to forward to another object when using .NET Moq?
                            
                                Contains / in array in Azure Search (Preview)
                            
                                Changing the default text color of a Picker control in Xamarin Forms for Windows Phone 8.1
                            
                                AssemblyInfo and custom attributes
                            
                                RazorEngine 3.7.7 - Error when compiling a cached template
                            
                                Why does Parallel.For execute the WinForms message pump, and how to prevent it?
                            
                                Longpress in UWP
                            
                                Sitecore's Field.HasValue returning false, even when there is a Value?
                            
                                How do a save Unity3d Mesh to file?
                            
                                How to set maximum length of separated word of string property C# EF
                            
                                Malformed Reference Element
                            
                                The 2D array won't transponse c#
                            
                                Newtonsoft update JObject from JSON path?
                            
                                How do I know if a class is a wrapper for an unmanaged resource
                            
                                How can I pin a default live tile programmatically?
                            
                                HttpContentExtensions.ReadAsAsync Error
                            
                                Best way in ASP.NET of configuring rows in a database to delete after a certain time
                            
                                c# blocking code in async method
                            
                                Version dependent Json deserialization
                            
                                502 bad gateway - POST from C# but works fine in fiddler

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With