I've recently seen two really nice and educating languages talks: This first one by Herb Sutter, presents all the nice and cool features of C++0x, why C++'s future seems brighter than ever, and how M$ is said to be a good guy in this game. The talk revolves around efficiency and how minimizing heap activity very often improves performance. This other one, by Andrei Alexandrescu, motivates a transition from C/C++ to his new game-changer D. Most of D's stuff seems really well motivated and designed. One thing, however, surprised me, namely that D pushes for garbage collection and that all classes are created solely by reference. Even more confusing, the book The D Programming Language Ref Manual specifically in the section about Resource Management states the following, quote: Garbage collection eliminates the tedious, error prone memory allocation tracking code necessary in C and C++. This not only means much faster development time and lower maintenance costs, but the resulting program frequently runs faster! This conflicts with Sutter's constant talk about minimizing heap activity. I strongly respect both Sutter's and Alexandrescou's insights, so I feel a bit confused about these two key questions <ol> <li>Doesn't creating class instances solely by reference result in a lot of unnecesseary heap activity?</li> <li>In which cases can we use Garbage Collection without sacrificing run-time performance?</li> </ol>

To directly answer your two questions: <ol> <li> Yes, creating class instances by reference does result in a lot of heap activity, but: a. In D, you have <code>struct</code> as well as <code>class</code>. A <code>struct</code> has value semantics and can do everything a class can, except polymorphism. b. Polymorphism and value semantics have never worked well together due to the slicing problem. c. In D, if you really need to allocate a class instance on the stack in some performance-critical code and don't care about the loss of safety, you can do so without unreasonable hassle via the <code>scoped</code> function. </li> <li> GC can be comparable to or faster than manual memory management if: a. You still allocate on the stack where possible (as you typically do in D) instead of relying on the heap for everything (as you often do in other GC'd languages). b. You have a top-of-the-line garbage collector (D's current GC implementation is admittedly somewhat naive, though it has seen some major optimizations in the past few releases, so it's not as bad as it was). c. You're allocating mostly small objects. If you allocate mostly large arrays and performance ends up being a problem, you may want to switch a few of these to the C heap (you have access to C's malloc and free in D) or, if it has a scoped lifetime, some other allocator like RegionAllocator. (RegionAllocator is currently being discussed and refined for eventual inclusion in D's standard library). d. You don't care that much about space efficiency. If you make the GC run too frequently to keep the memory footprint ultra-low, performance will suffer. </li> </ol>

To GC or Not To GC

Tags:

c++

performance

heap-memory

garbage-collection

d

I've recently seen two really nice and educating languages talks:

This first one by Herb Sutter, presents all the nice and cool features of C++0x, why C++'s future seems brighter than ever, and how M$ is said to be a good guy in this game. The talk revolves around efficiency and how minimizing heap activity very often improves performance.

This other one, by Andrei Alexandrescu, motivates a transition from C/C++ to his new game-changer D. Most of D's stuff seems really well motivated and designed. One thing, however, surprised me, namely that D pushes for garbage collection and that all classes are created solely by reference. Even more confusing, the book The D Programming Language Ref Manual specifically in the section about Resource Management states the following, quote:

Garbage collection eliminates the tedious, error prone memory allocation tracking code necessary in C and C++. This not only means much faster development time and lower maintenance costs, but the resulting program frequently runs faster!

This conflicts with Sutter's constant talk about minimizing heap activity. I strongly respect both Sutter's and Alexandrescou's insights, so I feel a bit confused about these two key questions

Doesn't creating class instances solely by reference result in a lot of unnecesseary heap activity?
In which cases can we use Garbage Collection without sacrificing run-time performance?

336

asked Sep 27 '11 20:09

Nordlöw

2 Answers

To directly answer your two questions:

Yes, creating class instances by reference does result in a lot of heap activity, but:

a. In D, you have struct as well as class. A struct has value semantics and can do everything a class can, except polymorphism.

b. Polymorphism and value semantics have never worked well together due to the slicing problem.

c. In D, if you really need to allocate a class instance on the stack in some performance-critical code and don't care about the loss of safety, you can do so without unreasonable hassle via the scoped function.
GC can be comparable to or faster than manual memory management if:

a. You still allocate on the stack where possible (as you typically do in D) instead of relying on the heap for everything (as you often do in other GC'd languages).

b. You have a top-of-the-line garbage collector (D's current GC implementation is admittedly somewhat naive, though it has seen some major optimizations in the past few releases, so it's not as bad as it was).

c. You're allocating mostly small objects. If you allocate mostly large arrays and performance ends up being a problem, you may want to switch a few of these to the C heap (you have access to C's malloc and free in D) or, if it has a scoped lifetime, some other allocator like RegionAllocator. (RegionAllocator is currently being discussed and refined for eventual inclusion in D's standard library).

d. You don't care that much about space efficiency. If you make the GC run too frequently to keep the memory footprint ultra-low, performance will suffer.

200

answered Oct 05 '22 18:10

dsimcha

The reason creating an object on the heap is slower than creating it on the stack is that the memory allocation methods need to deal with things like heap fragmentation. Allocating memory on the stack is as simple as incrementing the stack pointer (a constant-time operation).

Yet, with a compacting garbage collector, you don't have to worry about heap fragmentation, heap allocations can be as fast as stack allocations. The Garbage Collection page for the D Programming Language explains this in more detail.

The assertion that GC'd languages run faster is probably assuming that many programs allocate memory on the heap much more often than on the stack. Assuming that heap allocation could be faster in a GC'd language, then it follows that you have just optimized a huge part of most programs (heap allocation).

answered Oct 05 '22 17:10

Jack Edmonds

Related questions
                            
                                Can using a lambda in header files violate the ODR?
                            
                                Is it legal to compare dangling pointers?
                            
                                C++ Constructor/Destructor inheritance
                            
                                Why I have to write std::cout and not also std::<<
                            
                                What kind of optimization does const offer in C/C++?
                            
                                What's the difference between static constexpr and static inline variables in C++17?
                            
                                What is `constinit` in C++20?
                            
                                Why do I get the same sequence for every run with std::random_device with mingw gcc4.8.1?
                            
                                Why is strncpy insecure?
                            
                                numpy-like package for node [closed]
                            
                                Why is the construction of std::optional<int> more expensive than a std::pair<int, bool>?
                            
                                Garbage collection Libraries in C++ [closed]
                            
                                What do 1.#INF00, -1.#IND00 and -1.#IND mean?
                            
                                How can I have multiple parameter packs in a variadic template?
                            
                                C++: Mysteriously huge speedup from keeping one operand in a register
                            
                                Mixing extern and const
                            
                                What legitimate reasons exist to overload the unary operator&?
                            
                                how to make an application thread safe?
                            
                                C++ Lambdas: Difference between "mutable" and capture-by-reference
                            
                                Unsigned double in C++?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With