How to implement a cache friendly dynamic binary tree?

Tags:

According to several sources, including Wikipedia, the two most used ways of implementing a binary tree are:

Nodes and pointers (or references) where each node explicitly holds its children.
Array where the position of child nodes is given implicitly by the index of its parent.

The second one is obviously superior in terms of memory usage and locality of reference. However, it can lead to problems if you want to allow insertions and removals from the tree in such a manner that may leave the tree unbalanced. This is because the memory usage of this design is an exponential function of the tree depth.

Suppose that you want to support such insertions and removals. How can you implement the tree such that tree traversal makes good use of CPU caches.

I was thinking about making an object pool for the nodes and allocate them in an array. This way the nodes will be close together -> hence good locality of reference.

But if the size of the node is the same as the size of the cache line, does this make any sense?

If you have a L1 line size of 64 bytes and you access the first member of std::vector<std::uint8_t>(64), you will possibly have the entire contents of the vector in your L1 cache. This means that you can access any element very quickly. But what if the size of the element is the same as the cache line size? Since the cache line is likely not to be very different for L1, L2, and L3 caches, there seems to be no way in which locality of reference can help here. Am I wrong? What else can be done?

820

asked Jan 27 '17 22:01

Martin Drozdik

2 Answers

Unless you are working on research on how to improve binary trees for cache access patterns, I feel this is an XY problem - what is the problem you are trying to solve? Why do you think binary trees are the best algorithm for your problem? What is the expected working set size?

If you are looking for a generic associative storage, there are multiple cache-friendly (other keywords: "cache-efficient", "cache-oblivious") algorithms, such as Judy arrays, for which there is an extensive explanation PDF.

If your working set size is small enough, and you only need ordered set of items, a simple ordered array may be enough, which could lead to another performance benefit - branch prediction.

In the end, to find out what's best for your use-case is to try and measure different approaches.

164

answered Sep 22 '22 11:09

Michal Kottman

Use a block allocator.

You have one or maybe a handful of contiguous memory "pools" from which you dole out blocks of a fixed size. It's implemented as a linked list. So allocation is simply

answer = head, 
head = head->next, 
return answer;

freeing is simply

tofree->next = head;
head = tofree;

If you allow more than one pool of course you need to write code to determine the pool, which adds a bit of complexity, but not much. It's essentially a simple memory allocation system. Since all pool members are close together in memory, you get good cache coherence on small trees. For large trees you'll have to be a bit cleverer.

answered Sep 26 '22 11:09

Malcolm McLean

Related questions
                            
                                Is there any way of detecting arbitrary template classes that mix types and non-types?
                            
                                How do I construct a functor for use with an algorithm like boost's brent_find_minima?
                            
                                Static order initialization fiasco, iostream and C++11
                            
                                Difference in C++11 async behaviour on Mac and Linux
                            
                                Why move return an rvalue reference parameter need to wrap it with std::move()?
                            
                                Type trait to identify types that can be read/written in binary form
                            
                                deep neural network's precision for image recognition, float or double?
                            
                                SFINAE remove function from overload set if a free function does / does not exist
                            
                                c++ atomic: would function call act as memory barrier?
                            
                                C++ - how to copy elements from std::priority_queue to std::vector
                            
                                Safely check if `this` is null
                            
                                Shouldn't a compiler raise a warning for member variables of base struct shadowed in derived class(es)?
                            
                                Is a virtual function of a template class implicitly instantiated?
                            
                                Issue with Newton binomial coefficient in c++
                            
                                std::is_nothrow_constructible when constructor is inherited
                            
                                Prevent a returning function from execution if a condition on parameters is true
                            
                                While installing on OSX Sierra via gcc-6, keep having "FATAL:/opt/local/bin/../libexec/as/x86_64/as: I don't understand 'm' flag!" error
                            
                                Why are C++11 override and final not attributes?
                            
                                Invalidate all shared ptrs toward a specific managed object
                            
                                Does Using a Pointer as a Container Iterator Violate the Standard

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to implement a cache friendly dynamic binary tree?

Tags:

c++

memory-management

data-structures

cpu-cache

binary-tree

Martin Drozdik

People also ask

2 Answers

Michal Kottman

Malcolm McLean

Recent Activity

Donate For Us