I wonder if it's a good idea to keep using <code>int</code> (which is 32 bit on both x86 and x86_64) on 64 bit programs for variables that have nothing special and do not really need to span up to 2^64, like iteration counters, or if it's better to use <code>size_t</code> which matches the word size of the CPU. For sure if you keep using <code>int</code> you save half of the memory, and that could mean something speaking about CPU cache, but I don't know then if on 64 bit machine every 32 bit number has to be extended to 64 bit before any use. EDIT: I've ran some test with a program of mine (see the self answer, I still keep janneb's as accepted though because it is good). It turns out that there is a significant performance improvement.

I am coding a little hard spheres model. The source can be found on github. I tried to keep using <code>size_t</code> for variables that are used as index of arrays, and <code>int</code> where I do other operations, not related to word size. The performance improvement was significant: a ~27 to ~24 execution time drop.

Any reason to use 32 bit integers for common operations on 64 bit CPU?

Tags:

c

int

32bit-64bit

64-bit

I wonder if it's a good idea to keep using int (which is 32 bit on both x86 and x86_64) on 64 bit programs for variables that have nothing special and do not really need to span up to 2^64, like iteration counters, or if it's better to use size_t which matches the word size of the CPU.

For sure if you keep using int you save half of the memory, and that could mean something speaking about CPU cache, but I don't know then if on 64 bit machine every 32 bit number has to be extended to 64 bit before any use.

EDIT: I've ran some test with a program of mine (see the self answer, I still keep janneb's as accepted though because it is good). It turns out that there is a significant performance improvement.

694

asked Jun 06 '12 10:06

Lorenzo Pistone

3 Answers

For array indices and pointer arithmetic, types which are of the same size as a pointer (typically, size_t and ptrdiff_t) can be better, as they avoid the need to zero or sign extend the register. Consider


float onei(float *a, int n)
{
  return a[n];
}

float oneu(float *a, unsigned n)
{
  return a[n];
}

float onep(float *a, ptrdiff_t n)
{
  return a[n];
}

float ones(float *a, size_t n)
{
  return a[n];
}

With GCC 4.4 -O2 on x86_64 the following asm is generated:


    .p2align 4,,15
.globl onei
    .type   onei, @function
onei:
.LFB3:
    .cfi_startproc
    movslq  %esi,%rsi
    movss   (%rdi,%rsi,4), %xmm0
    ret
    .cfi_endproc
.LFE3:
    .size   onei, .-onei
    .p2align 4,,15
.globl oneu
    .type   oneu, @function
oneu:
.LFB4:
    .cfi_startproc
    mov %esi, %esi
    movss   (%rdi,%rsi,4), %xmm0
    ret
    .cfi_endproc
.LFE4:
    .size   oneu, .-oneu
    .p2align 4,,15
.globl onep
    .type   onep, @function
onep:
.LFB5:
    .cfi_startproc
    movss   (%rdi,%rsi,4), %xmm0
    ret
    .cfi_endproc
.LFE5:
    .size   onep, .-onep
    .p2align 4,,15
.globl ones
    .type   ones, @function
ones:
.LFB6:
    .cfi_startproc
    movss   (%rdi,%rsi,4), %xmm0
    ret
    .cfi_endproc
.LFE6:
    .size   ones, .-ones

As can be seen, the versions with the int and unsigned int index (onei and oneu) requires an extra instruction (movslq/mov) to sign/zero extend the register.

As was mentioned in a comment, the downside is that encoding a 64-bit register takes more space than the 32-bit part, bloating the code size. Secondly, ptrdiff_t/size_t variables need more memory than the equivalent int; if you have such arrays it can certainly affect performance much more than the relatively small benefit of avoiding the zero/sign extension. If unsure, profile!

148

answered Oct 14 '22 16:10

janneb

In terms of Cache, it will save space; cache handles blocks of data, regardless of whether CPU requested a single address or the complete chunk equal to cache block size.

So if you are asking whether 32-bit numbers take 64-bits of space inside caches on 64 bit machines then the answer is no, they will still take 32 bits for themselves. So in general, it will save you some space, especially if you are using large arrays with frequent accesses etc.

In my personal opinion, a simple int looks simpler than size_t and most editors will not recognize size_t type so syntax highlighting will also be better if you use int. ;)

answered Oct 14 '22 15:10

xenodevil

I am coding a little hard spheres model. The source can be found on github.

I tried to keep using size_t for variables that are used as index of arrays, and int where I do other operations, not related to word size. The performance improvement was significant: a ~27 to ~24 execution time drop.

answered Oct 14 '22 16:10

Lorenzo Pistone

Related questions
                            
                                GCC's implementation of angle-brackets includes. Why does it have to be as described below?
                            
                                Can a null pointer constant be any integer constant expression evaluated to 0?
                            
                                How to deal with excess precision in floating-point computations?
                            
                                Windows: Overwrite File In Use
                            
                                How do I set the version of a DLL built in C, compiled with CL.EXE?
                            
                                forcing a program to flush its standard output when redirected
                            
                                high performance udp server. blocking or non-blocking? c
                            
                                Test to see if I should free memory or not
                            
                                Creating a Transparent Bitmap with GDI?
                            
                                How to account for column-contiguous array when extending numpy with C
                            
                                Using ifstream as fscanf
                            
                                How do you use SymLoadModuleEx to load a PDB file?
                            
                                UNIX Pipes Between Child Processes
                            
                                Can we verify whether a typedef has been defined or not
                            
                                Linux blocking signals to Python init
                            
                                SSE vectorization of math 'pow' function gcc
                            
                                A minimalistic human-readable serialisation format parser for an embedded system
                            
                                How to convert from UTC to local time in C?
                            
                                Correct use of string storage in C and C++
                            
                                Learning C for Objective-C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Any reason to use 32 bit integers for common operations on 64 bit CPU?

Tags:

c

int

32bit-64bit

64-bit

Lorenzo Pistone

People also ask

3 Answers

janneb

xenodevil

Lorenzo Pistone

Recent Activity

Donate For Us