32-bit pointers with the x86-64 ISA: why not?

Tags:

The x86-64 instruction set adds more registers and other improvements to help streamline executable code. However, in many applications the increased pointer size is a burden. The extra, unused bytes in every pointer clog up the cache and might even overflow RAM. GCC, for example, builds with the -m32 flag, and I assume this is the reason.

It's possible to load a 32-bit value and treat it as a pointer. This doesn't necessitate extra instructions, just load/compute the 32 bits and load from the resulting address. The trick won't be portable, though, as platforms have different memory maps. On Mac OS X, the entire low 4 GiB of address space is reserved. Still, for one program I wrote, hackishly adding 0x100000000L to 32-bit "addresses" before use improved performance greatly over true 64-bit addresses, or compiling with -m32.

Is there any fundamental impediment to having a 32-bit, x86-64 platform? I suppose that supporting such a chimera would add complexity to any operating system, and anyone wanting that last 20% should just Make it Work™, but it still seems that this would be the best fit for a variety of computationally intensive programs.

588

asked Feb 10 '12 19:02

Potatoswatter

2 Answers

There is an ABI called "x32" for linux in development. It's a mix between x86_64 and ia32 similar to what you describe - 32 bit address space while using the full 64 bit register set. It needs a custom kernel, binutils and gcc.

Some SPEC runs indicate a performace improvement of about 30% in some benchmarks. See further information at https://sites.google.com/site/x32abi/

137

answered Oct 31 '22 23:10

Gunther Piez

As Mysticial commented above, ICC has the -auto-ilp32 / /Qauto-ilp32 option to use 32-bit pointers in 64-bit mode:

Instructs the compiler to analyze the program to determine if there are 64-bit pointers that can be safely shrunk into 32-bit pointers and if there are 64-bit longs (on Linux* systems) that can be safely shrunk into 32-bit longs.

On Windows there's no x32abi like on Linux, but you can still use 32-bit pointers by disabling the /LARGEADDRESSAWARE flag which is enabled for 64-bit binaries by default

By default, 64-bit Microsoft Windows-based applications have a user-mode address space of several terabytes. For precise values, see Memory Limits for Windows and Windows Server Releases. However, applications can specify that the system should allocate all memory for the application below 2 gigabytes. This feature is beneficial for 64-bit applications if the following conditions are true:

A 2 GB address space is sufficient.

The code has many pointer truncation warnings.

Pointers and integers are freely mixed.

The code has polymorphism using 32-bit data types.

All pointers are still 64-bit pointers, but the system ensures that every memory allocation occurs below the 2 GB limit, so that if the application truncates a pointer, no significant data is lost. Pointers can be truncated to 32-bit values, then extended to 64-bit values by either sign extension or zero extension.

Virtual Address Space

Of course there's no direct compiler support so you'll need to deal with pointers manually every time you store a pointer to memory or dereference it. The simplest solution is to write a class wrapping a 32-bit pointer to handle that

Google's V8 engine uses a different way by compressing pointers to 32 bits to save memory as well as improve performance. See the comparison in memory and performance improvement here

See also How does the compressed pointer implementation in V8 differ from JVM's compressed Oops?

How to use 32-bit pointers in 64-bit application?
Can a C compiler generate an executable 64-bits where pointers are 32-bits?

answered Oct 31 '22 21:10

phuclv

Related questions
                            
                                How to profile inconsistent H12 timeouts on Heroku
                            
                                Is the string ctor the fastest way to convert an IEnumerable<char> to string
                            
                                Phalcon php vs node.js
                            
                                Regex speed: Python x6 times faster than C++11 under VS2013?
                            
                                Why is ".concat(String)" so much faster than "+"? [duplicate]
                            
                                Can I speed up this basic linear algebra code?
                            
                                Python, numpy, einsum multiply a stack of matrices
                            
                                Why is >>= faster than concatMap when they ought to be the same thing?
                            
                                Why is Arrays.copyOf 2 times faster than System.arraycopy for small arrays?
                            
                                Moving from Relational Database to Big Data
                            
                                C# Performance on Small Functions
                            
                                Refactoring a React PureComponent to a hooks based functional component
                            
                                How to disable loop unrolling in Rust?
                            
                                Performance gain in compiling java to native code?
                            
                                Will I run into performance issues if I use a blob field as primary key in SQLite?
                            
                                Drawing incrementally in a UIView (iPhone)
                            
                                Table design for user's information as well as login credentials?
                            
                                How to perform bit shift without ("<<" || ">>") operator efficiently?
                            
                                EJB Vs WebService? Performance point of view
                            
                                Java fast pixel operations

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

32-bit pointers with the x86-64 ISA: why not?

Tags:

performance

pointers

32bit-64bit

x86-64

abi

Potatoswatter

People also ask

2 Answers

Gunther Piez

phuclv

Recent Activity

Donate For Us