I have heard a teacher drop this once, and it has been bugging me ever since. Let's say we want to check if the integer <code>x</code> is bigger than or equal to 0. There are two ways to check this: <pre class="prettyprint"><code>if (x > -1){ //do stuff } </code></pre> and <pre class="prettyprint"><code>if (x >= 0){ //do stuff } </code></pre> According to this teacher <code>></code> would be slightly faster then <code>>=</code>. In this case it was Java, but according to him this also applied for C, c++ and other languages. Is there any truth to this statement?

There's no difference in any real-world sense. Let's take a look at some code generated by various compilers for various targets. <ul> <li>I'm assuming a signed int operation (which seem the intent of the OP)</li> <li>I've limited by survey to C and to compilers that I have readily at hand (admittedly a pretty small sample - GCC, MSVC and IAR)</li> <li>basic optimizations enabled (<code>-O2</code> for GCC, <code>/Ox</code> for MSVC, <code>-Oh</code> for IAR)</li> <li> using the following module: <pre class="prettyprint"><code>void my_puts(char const* s); void cmp_gt(int x) { if (x > -1) { my_puts("non-negative"); } else { my_puts("negative"); } } void cmp_gte(int x) { if (x >= 0) { my_puts("non-negative"); } else { my_puts("negative"); } } </code></pre> </li> </ul> And here's what each of them produced for the comparison operations: MSVC 11 targeting ARM: <pre class="prettyprint"><code>// if (x > -1) {... 00000 |cmp_gt| PROC 00000 f1b0 3fff cmp r0,#0xFFFFFFFF 00004 dd05 ble |$LN2@cmp_gt| // if (x >= 0) {... 00024 |cmp_gte| PROC 00024 2800 cmp r0,#0 00026 db05 blt |$LN2@cmp_gte| </code></pre> MSVC 11 targeting x64: <pre class="prettyprint"><code>// if (x > -1) {... cmp_gt PROC 00000 83 f9 ff cmp ecx, -1 00003 48 8d 0d 00 00 // speculative load of argument to my_puts() 00 00 lea rcx, OFFSET FLAT:$SG1359 0000a 7f 07 jg SHORT $LN5@cmp_gt // if (x >= 0) {... cmp_gte PROC 00000 85 c9 test ecx, ecx 00002 48 8d 0d 00 00 // speculative load of argument to my_puts() 00 00 lea rcx, OFFSET FLAT:$SG1367 00009 79 07 jns SHORT $LN5@cmp_gte </code></pre> MSVC 11 targeting x86: <pre class="prettyprint"><code>// if (x > -1) {... _cmp_gt PROC 00000 83 7c 24 04 ff cmp DWORD PTR _x$[esp-4], -1 00005 7e 0d jle SHORT $LN2@cmp_gt // if (x >= 0) {... _cmp_gte PROC 00000 83 7c 24 04 00 cmp DWORD PTR _x$[esp-4], 0 00005 7c 0d jl SHORT $LN2@cmp_gte </code></pre> GCC 4.6.1 targeting x64 <pre class="prettyprint"><code>// if (x > -1) {... cmp_gt: .seh_endprologue test ecx, ecx js .L2 // if (x >= 0) {... cmp_gte: .seh_endprologue test ecx, ecx js .L5 </code></pre> GCC 4.6.1 targeting x86: <pre class="prettyprint"><code>// if (x > -1) {... _cmp_gt: mov eax, DWORD PTR [esp+4] test eax, eax js L2 // if (x >= 0) {... _cmp_gte: mov edx, DWORD PTR [esp+4] test edx, edx js L5 </code></pre> GCC 4.4.1 targeting ARM: <pre class="prettyprint"><code>// if (x > -1) {... cmp_gt: .fnstart .LFB0: cmp r0, #0 blt .L8 // if (x >= 0) {... cmp_gte: .fnstart .LFB1: cmp r0, #0 blt .L2 </code></pre> IAR 5.20 targeting an ARM Cortex-M3: <pre class="prettyprint"><code>// if (x > -1) {... cmp_gt: 80B5 PUSH {R7,LR} .... LDR.N R1,??DataTable1 ;; `?<Constant "non-negative">` 0028 CMP R0,#+0 01D4 BMI.N ??cmp_gt_0 // if (x >= 0) {... cmp_gte: 80B5 PUSH {R7,LR} .... LDR.N R1,??DataTable1 ;; `?<Constant "non-negative">` 0028 CMP R0,#+0 01D4 BMI.N ??cmp_gte_0 </code></pre> If you're still with me, here are the differences of any note between evaluating <code>(x > -1)</code> and <code>(x >= 0)</code> that show up: <ul> <li>MSVC targeting ARM uses <code>cmp r0,#0xFFFFFFFF</code> for <code>(x > -1)</code> vs <code>cmp r0,#0</code> for <code>(x >= 0)</code>. The first instruction's opcode is two bytes longer. I suppose that may introduce some additional time, so we'll call this an advantage for <code>(x >= 0)</code> </li> <li>MSVC targeting x86 uses <code>cmp ecx, -1</code> for <code>(x > -1)</code> vs <code>test ecx, ecx</code> for <code>(x >= 0)</code>. The first instruction's opcode is one byte longer. I suppose that may introduce some additional time, so we'll call this an advantage for <code>(x >= 0)</code> </li> </ul> Note that GCC and IAR generated identical machine code for the two kinds of comparison (with the possible exception of which register was used). So according to this survey, it appears that <code>(x >= 0)</code> has an ever-so-slight chance of being 'faster'. But whatever advantage the minimally shorter opcode byte encoding might have (and I stress might have) will be certainly completely overshadowed by other factors. I'd be surprised if you found anything different for the jitted output of Java or C#. I doubt you'd find any difference of note even for a very small target like an 8 bit AVR. In short, don't worry about this micro-optimization. I think my write up here has already spent more time than will be spent by any difference in the performance of these expressions accumulated across all the CPUs executing them in my lifetime. If you have the capability to measure the difference in performance, please apply your efforts to something more important like studying the behavior of sub-atomic particles or something.

It is very much dependent on the underlying architecture, but any difference will be minuscule. If anything, I'd expect <code>(x >= 0)</code> to be slightly faster, as comparison with <code>0</code> comes for free on some instruction sets (such as ARM). Of course, any sensible compiler will choose the best implementation regardless of which variant is in your source.

x > -1 vs x >= 0, is there a performance difference

Tags:

java

c++

operators

micro-optimization

premature-optimization

I have heard a teacher drop this once, and it has been bugging me ever since. Let's say we want to check if the integer x is bigger than or equal to 0. There are two ways to check this:

if (x > -1){     //do stuff }

and

if (x >= 0){     //do stuff }

According to this teacher > would be slightly faster then >=. In this case it was Java, but according to him this also applied for C, c++ and other languages. Is there any truth to this statement?

304

asked Jan 25 '13 11:01

Cheiron

2 Answers

There's no difference in any real-world sense.

Let's take a look at some code generated by various compilers for various targets.

I'm assuming a signed int operation (which seem the intent of the OP)
I've limited by survey to C and to compilers that I have readily at hand (admittedly a pretty small sample - GCC, MSVC and IAR)
basic optimizations enabled (-O2 for GCC, /Ox for MSVC, -Oh for IAR)

using the following module:

void my_puts(char const* s);  void cmp_gt(int x)  {     if (x > -1) {         my_puts("non-negative");     }     else {         my_puts("negative");     } }  void cmp_gte(int x)  {     if (x >= 0) {         my_puts("non-negative");     }     else {         my_puts("negative");     } }

And here's what each of them produced for the comparison operations:

MSVC 11 targeting ARM:

// if (x > -1) {... 00000        |cmp_gt| PROC   00000 f1b0 3fff    cmp         r0,#0xFFFFFFFF   00004 dd05         ble         |$LN2@cmp_gt|   // if (x >= 0) {...   00024      |cmp_gte| PROC   00024 2800         cmp         r0,#0   00026 db05         blt         |$LN2@cmp_gte|

MSVC 11 targeting x64:

// if (x > -1) {... cmp_gt  PROC   00000 83 f9 ff     cmp     ecx, -1   00003 48 8d 0d 00 00                  // speculative load of argument to my_puts()     00 00        lea     rcx, OFFSET FLAT:$SG1359   0000a 7f 07        jg  SHORT $LN5@cmp_gt  // if (x >= 0) {... cmp_gte PROC   00000 85 c9        test    ecx, ecx   00002 48 8d 0d 00 00                  // speculative load of argument to my_puts()     00 00        lea     rcx, OFFSET FLAT:$SG1367   00009 79 07        jns     SHORT $LN5@cmp_gte

MSVC 11 targeting x86:

// if (x > -1) {... _cmp_gt PROC   00000 83 7c 24 04 ff   cmp     DWORD PTR _x$[esp-4], -1   00005 7e 0d        jle     SHORT $LN2@cmp_gt   // if (x >= 0) {... _cmp_gte PROC   00000 83 7c 24 04 00   cmp     DWORD PTR _x$[esp-4], 0   00005 7c 0d        jl  SHORT $LN2@cmp_gte

GCC 4.6.1 targeting x64

// if (x > -1) {... cmp_gt:     .seh_endprologue     test    ecx, ecx     js  .L2  // if (x >= 0) {... cmp_gte:     .seh_endprologue     test    ecx, ecx     js  .L5

GCC 4.6.1 targeting x86:

// if (x > -1) {... _cmp_gt:     mov eax, DWORD PTR [esp+4]     test    eax, eax     js  L2  // if (x >= 0) {... _cmp_gte:     mov edx, DWORD PTR [esp+4]     test    edx, edx     js  L5

GCC 4.4.1 targeting ARM:

// if (x > -1) {... cmp_gt:     .fnstart .LFB0:     cmp r0, #0     blt .L8  // if (x >= 0) {... cmp_gte:     .fnstart .LFB1:     cmp r0, #0     blt .L2

IAR 5.20 targeting an ARM Cortex-M3:

// if (x > -1) {... cmp_gt: 80B5 PUSH     {R7,LR} .... LDR.N    R1,??DataTable1  ;; `?<Constant "non-negative">` 0028 CMP      R0,#+0 01D4 BMI.N    ??cmp_gt_0  // if (x >= 0) {... cmp_gte:  80B5 PUSH     {R7,LR}  .... LDR.N    R1,??DataTable1  ;; `?<Constant "non-negative">`  0028 CMP      R0,#+0  01D4 BMI.N    ??cmp_gte_0

If you're still with me, here are the differences of any note between evaluating (x > -1) and (x >= 0) that show up:

MSVC targeting ARM uses cmp r0,#0xFFFFFFFF for (x > -1) vs cmp r0,#0 for (x >= 0). The first instruction's opcode is two bytes longer. I suppose that may introduce some additional time, so we'll call this an advantage for (x >= 0)
MSVC targeting x86 uses cmp ecx, -1 for (x > -1) vs test ecx, ecx for (x >= 0). The first instruction's opcode is one byte longer. I suppose that may introduce some additional time, so we'll call this an advantage for (x >= 0)

Note that GCC and IAR generated identical machine code for the two kinds of comparison (with the possible exception of which register was used). So according to this survey, it appears that (x >= 0) has an ever-so-slight chance of being 'faster'. But whatever advantage the minimally shorter opcode byte encoding might have (and I stress might have) will be certainly completely overshadowed by other factors.

I'd be surprised if you found anything different for the jitted output of Java or C#. I doubt you'd find any difference of note even for a very small target like an 8 bit AVR.

In short, don't worry about this micro-optimization. I think my write up here has already spent more time than will be spent by any difference in the performance of these expressions accumulated across all the CPUs executing them in my lifetime. If you have the capability to measure the difference in performance, please apply your efforts to something more important like studying the behavior of sub-atomic particles or something.

151

answered Sep 18 '22 14:09

Michael Burr

It is very much dependent on the underlying architecture, but any difference will be minuscule.

If anything, I'd expect (x >= 0) to be slightly faster, as comparison with 0 comes for free on some instruction sets (such as ARM).

Of course, any sensible compiler will choose the best implementation regardless of which variant is in your source.

answered Sep 20 '22 14:09

Graham Borland

Related questions
                            
                                Compact way to create Guava Multimaps?
                            
                                What is the point of making the singleton instance volatile while using double lock? [duplicate]
                            
                                How does Java decide when to import?
                            
                                Creating a LinkedList class from scratch
                            
                                cast Long to BigDecimal
                            
                                How do I break from the main/outer loop in a double/nested loop? [duplicate]
                            
                                How can PublishSubject and BehaviorSubject be unsubscribed from?
                            
                                use of the bitwise operators to pack multiple values in one int
                            
                                Jersey Exception : SEVERE: A message body reader for Java class
                            
                                Where to put persistence.xml in library jar using maven?
                            
                                Program type already present: android.support.v13.view.DragAndDropPermissionsCompat
                            
                                Reliably convert any object to String and then back again
                            
                                JPA Criteria API with multiple parameters
                            
                                Jackson: Serialize and deserialize enum values as integers
                            
                                Mock static java methods using Mockk
                            
                                Disadvantage of OOP? [closed]
                            
                                Set the interpolator for android animations in Java
                            
                                Cannot resolve method 'show(android.support.v4.app.FragmentManager, java.lang.String)
                            
                                ClassNotFoundException for javax.xml.bind.JAXBException with Spring Boot when switch to Java 9
                            
                                How to sort an arraylist of objects by a property?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With