I am creating a very fast multi-threaded discrete event simulation framework. The core of the framework uses atomics and lockless programming techniques to achieve very fast execution across many threads. This requires me to align some variables to cache lines and pad the remaining cache line space so that I don't have cache line contention. Here is how I do it: <pre class="prettyprint"><code>// compute cache line padding size constexpr u64 CLPAD(u64 _objSize) { return ((_objSize / CACHELINE_SIZE) * CACHELINE_SIZE) + (((_objSize % CACHELINE_SIZE) > 0) * CACHELINE_SIZE) - _objSize; } alignas(CACHELINE_SIZE) MyObject myObj; char padding[CLPAD(sizeof(myObj))]; </code></pre> This works great for me, but I stumbled upon an issue today when I was using this methodology for a new object type. The CLPAD() function returns the amount of chars needed to pad the input type up to the next cache line. However, if I put in a type that is exactly sized a multiple of number of cache lines, the CLPAD returns 0. If you attempt to create a zero sized array, you get this warning/error: <pre class="prettyprint"><code>ISO C++ forbids zero-size array 'padding' </code></pre> I know I could modify CLPAD() to return CACHELINE_SIZE in this case, but then I'm burning a cache line worth of space for no reason. How can I make the declaration of 'padding' disappear if CLPAD returns 0?

Taking a page from <code>std::aligned_storage<></code>, I've come up with the following: <pre class="prettyprint"><code>template<class T, bool = false> struct padded { using type = struct { alignas(CACHELINE_SIZE)T myObj; char padding[CLPAD(sizeof(T))]; }; }; template<class T> struct padded<T, true> { using type = struct { alignas(CACHELINE_SIZE)T myObj; }; }; template<class T> using padded_t = typename padded<T, (sizeof(T) % CACHELINE_SIZE == 0)>::type; </code></pre> Usage: <pre class="prettyprint"><code>struct alignas(32) my_type_1 { char c[32]; }; // char c[32] to silence MSVC warning struct my_type_2 { char c[CACHELINE_SIZE * 2]; }; // ditto int main() { padded_t<my_type_1> pt0; padded_t<my_type_2> pt1; sizeof(pt0); // 128 alignof(pt0); // 128 sizeof(pt1); // 256 alignof(pt1); // 128 } </code></pre> You can provide a function to access <code>myObj</code> however you wish.

Cache line padding for variables that are a multiple of cache line size

Tags:

I am creating a very fast multi-threaded discrete event simulation framework. The core of the framework uses atomics and lockless programming techniques to achieve very fast execution across many threads. This requires me to align some variables to cache lines and pad the remaining cache line space so that I don't have cache line contention. Here is how I do it:

// compute cache line padding size
constexpr u64 CLPAD(u64 _objSize) {
  return ((_objSize / CACHELINE_SIZE) * CACHELINE_SIZE) +
      (((_objSize % CACHELINE_SIZE) > 0) * CACHELINE_SIZE) -
      _objSize;
}

alignas(CACHELINE_SIZE) MyObject myObj;
char padding[CLPAD(sizeof(myObj))];

This works great for me, but I stumbled upon an issue today when I was using this methodology for a new object type. The CLPAD() function returns the amount of chars needed to pad the input type up to the next cache line. However, if I put in a type that is exactly sized a multiple of number of cache lines, the CLPAD returns 0. If you attempt to create a zero sized array, you get this warning/error:

ISO C++ forbids zero-size array 'padding'

I know I could modify CLPAD() to return CACHELINE_SIZE in this case, but then I'm burning a cache line worth of space for no reason.

How can I make the declaration of 'padding' disappear if CLPAD returns 0?

223

asked May 19 '17 06:05

nic

1 Answers

Taking a page from std::aligned_storage<>, I've come up with the following:

template<class T, bool = false>
struct padded
{
    using type = struct
    {
        alignas(CACHELINE_SIZE)T myObj;
        char padding[CLPAD(sizeof(T))];
    };
};

template<class T>
struct padded<T, true>
{
    using type = struct
    {
        alignas(CACHELINE_SIZE)T myObj;
    };
};

template<class T>
using padded_t = typename padded<T, (sizeof(T) % CACHELINE_SIZE == 0)>::type;

Usage:

struct alignas(32) my_type_1 { char c[32]; }; // char c[32] to silence MSVC warning
struct my_type_2 { char c[CACHELINE_SIZE * 2]; }; // ditto

int main()
{
    padded_t<my_type_1> pt0;
    padded_t<my_type_2> pt1;

    sizeof(pt0);    // 128
    alignof(pt0);   // 128

    sizeof(pt1);    // 256
    alignof(pt1);   // 128
}

You can provide a function to access myObj however you wish.

answered Sep 25 '22 10:09

user2296177

Related questions
                            
                                What does the "target" property in tsconfig.json actually represent?
                            
                                FileProvider authority in android library project
                            
                                "Cannot read property _location of null" when using React Apollo in a Jest test case
                            
                                Angular 4: JSONP injected script did not invoke callback
                            
                                How to get the cursor position in a C program using termcap, without writing a character?
                            
                                data model for notification in social network?
                            
                                Exception using @RolesAllowed in SpringBoot app
                            
                                Where does SignalR belong in a DDD architecture?
                            
                                Name of font by powershell
                            
                                marker.setIcon throws java.lang.IllegalArgumentException: Unmanaged descriptor
                            
                                How to extend a volume programmatically
                            
                                Golang middleware with just the standard library

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cache line padding for variables that are a multiple of cache line size

Tags:

nic

People also ask

1 Answers

user2296177

Recent Activity

Donate For Us