This question is about how to design a program so it will be easy to make certain modifications. I have a class which holds some (non-trivial) data and has several member functions that change this data. Sometimes I need to compute some property of this data. But it is slow to recompute it from scratch on every change. It is much faster to compute a small update to these properties instead. I have several such properties which I need to be able to easily add or remove to/from my class (or turn on/off) to carry out some numerical experiments. The class is only modified by myself and is used for numerical simulations (scientific code). <h3>Concrete example</h3> Let's say I have a class that holds a number <code>x</code>. But I also need <code>2^x</code> (a "property" of <code>x</code>). The basic class is: <pre class="prettyprint"><code>class C { double x; public: C() : x(0.0) { } void inc() { x += 1; } void dec() { x -= 1; } void set(double x_) { x = x_; } }; </code></pre> But now I need to keep track of <code>2^x</code> and to keep updating this value every time <code>x</code> changes. So I end up with <pre class="prettyprint"><code>class expC { double expx; public: expC(const double &x) { recompute(x); } void inc() { expx *= 2; } // fast incremental change void dec() { expx /= 2; } // fast incremental change void recompute(const double &x) { expx = std::pow(2, x); // slow recomputation from scratch } }; class C { double x; expC prop1; // XX public: C() : x(0.0), prop1(x) // XX { } void inc() { x += 1; prop1.inc(); // XX } void dec() { x -= 1; prop1.dec(); // XX } void set(double x_) { x = x_; prop1.recompute(x); // XX } }; </code></pre> <code>XX</code> marks changes I needed to make to the class <code>C</code>. That's a lot of changes, which is error prone. It becomes even more complicated with several properties, which my even depend on each other. <pre class="prettyprint"><code>class C { double x; expC prop1; // XX someC prop2; // XX public: C() : x(0.0), prop1(x), prop2(x, prop1) // XX { } void inc() { x += 1; prop1.inc(); // XX prop2.inc(); // XX } void dec() { x -= 1; prop1.dec(); // XX prop2.dec(); // XX } void set(double x_) { x = x_; prop1.recompute(x); // XX prop2.recompute(x, prop1); // XX } }; </code></pre> Question: What is a good design for such a program? I'm sure it's possible to do better than the above. The goals are: 1) Make it easy to add/remove such properties or turn on/off their computation 2) Performance is critical. <code>inc</code> and <code>dec</code> are called in tight inner loops and do relatively little. They cannot be made virtual for performance reasons. In reality <code>x</code> is a more complex data structure. Think e.g. adding/removing edges to/from a graph and keeping track of its degree sequence during the process. <hr> Update @tobi303 asked that I show how this class would be used. It's in a manner similar to this: <pre class="prettyprint"><code>void simulate(C &c) { for (/* lots of iterations */) { c.inc(); double p1 = c.prop1.value(); double p2 = c.prop2.value(); if (condition(p1,p2)) c.dec(); } } </code></pre> Or in words: <ul> <li>make a (random) change</li> <li>get property values after the change</li> <li>depending on the new property values, decide whether to accept or undo the change.</li> </ul> It's actually a Monte-Carlo simulation similar to a Metropolis-Hasting algorithm. A concrete example could be where the "data" in class <code>C</code> (state) is the spin state of an Ising model (for those familiar with it) and properties are the total energy and total magnetization of the system. These are much faster to update after a single spin flip than to recompute from scratch. In practice I don't have an Ising model, I have something a bit more complicated. I have several properties, some fast to compute and some slow (actually I have some auxiliary data structures that help compute properties). I need to experiment with combinations of different properties, so I often change what I include in the code. Sometimes I implement new properties. When I don't need an already implemented property, I need to be able to turn off its computation for performance reasons (some are really slow to compute).

Here is an approach that may work for you: <pre class="prettyprint"><code>class C { double x; expC prop1; someC prop2; . . . template <typename F> void for_each_property(const F &f) { f(prop1,x); f(prop2,x,prop1); . . . } public: C() : x(0.0), prop1(x), prop2(x, prop1) { } void inc() { x += 1; for_each_property([](auto &prop,auto&& ...) { prop.inc(); }); } void dec() { x -= 1; for_each_property([](auto &prop,auto&& ...) { prop.dec(); }); } void set(double x_) { x = x_; for_each_property([](auto &prop,auto&& ... args) { prop.recompute(args...); }); } }; </code></pre> When you add a new property, you only need to add one call in <code>for_each_property()</code>. The use of variadics avoids the need to provide new overloads for different parameters as long as you stick to the same formula. This doesn't eliminate the duplication in the constructor, unless you are willing to switch to doing default initialization of the properties and then call <code>set(0.0)</code>.

How to design this program with easy modification in mind?

Tags:

c++

This question is about how to design a program so it will be easy to make certain modifications.

I have a class which holds some (non-trivial) data and has several member functions that change this data.

Sometimes I need to compute some property of this data. But it is slow to recompute it from scratch on every change. It is much faster to compute a small update to these properties instead.

I have several such properties which I need to be able to easily add or remove to/from my class (or turn on/off) to carry out some numerical experiments. The class is only modified by myself and is used for numerical simulations (scientific code).

Concrete example

Let's say I have a class that holds a number x. But I also need 2^x (a "property" of x). The basic class is:

class C {
    double x;

public:
    C() : x(0.0) 
    { }

    void inc() { x += 1; } 
    void dec() { x -= 1; } 
    void set(double x_) { x = x_; } 
};

But now I need to keep track of 2^x and to keep updating this value every time x changes. So I end up with

class expC {
    double expx;

public:        
    expC(const double &x) {
        recompute(x);
    }

    void inc() { expx *= 2; } // fast incremental change
    void dec() { expx /= 2; } // fast incremental change
    void recompute(const double &x) {
        expx = std::pow(2, x); // slow recomputation from scratch
    }
};


class C {
    double x;

    expC prop1; // XX

public:
    C() : x(0.0), prop1(x) // XX 
    { }

    void inc() { 
        x += 1;
        prop1.inc(); // XX 
    }
    void dec() { 
        x -= 1; 
        prop1.dec(); // XX
    }
    void set(double x_) { 
        x = x_;
        prop1.recompute(x); // XX
    }
};

XX marks changes I needed to make to the class C. That's a lot of changes, which is error prone. It becomes even more complicated with several properties, which my even depend on each other.

class C {
    double x;

    expC  prop1; // XX
    someC prop2; // XX

public:
    C() : x(0.0), prop1(x), prop2(x, prop1) // XX 
    { }

    void inc() { 
        x += 1;
        prop1.inc(); // XX 
        prop2.inc(); // XX 
    }
    void dec() { 
        x -= 1; 
        prop1.dec(); // XX
        prop2.dec(); // XX
    }
    void set(double x_) { 
        x = x_;
        prop1.recompute(x); // XX
        prop2.recompute(x, prop1); // XX
    }
};

Question: What is a good design for such a program? I'm sure it's possible to do better than the above. The goals are: 1) Make it easy to add/remove such properties or turn on/off their computation 2) Performance is critical. inc and dec are called in tight inner loops and do relatively little. They cannot be made virtual for performance reasons.

In reality x is a more complex data structure. Think e.g. adding/removing edges to/from a graph and keeping track of its degree sequence during the process.

Update

@tobi303 asked that I show how this class would be used. It's in a manner similar to this:

void simulate(C &c) {
    for (/* lots of iterations */) {
        c.inc();
        double p1 = c.prop1.value();
        double p2 = c.prop2.value();
        if (condition(p1,p2))
            c.dec();
    }
}

Or in words:

make a (random) change
get property values after the change
depending on the new property values, decide whether to accept or undo the change.

It's actually a Monte-Carlo simulation similar to a Metropolis-Hasting algorithm.

A concrete example could be where the "data" in class C (state) is the spin state of an Ising model (for those familiar with it) and properties are the total energy and total magnetization of the system. These are much faster to update after a single spin flip than to recompute from scratch. In practice I don't have an Ising model, I have something a bit more complicated. I have several properties, some fast to compute and some slow (actually I have some auxiliary data structures that help compute properties). I need to experiment with combinations of different properties, so I often change what I include in the code. Sometimes I implement new properties. When I don't need an already implemented property, I need to be able to turn off its computation for performance reasons (some are really slow to compute).

355

asked May 24 '16 16:05

Szabolcs

2 Answers

Just be lazy and don't calculate the properties when you need to. It will remove plenty of code and unnecessary computation.

When you do need your property, compute it if it's not already in cache. So you need a boolean for each property to tell if the cache is up-to-date, and you need to invalidate the booleans each time x itself is updated.

Basically:

class C {
    double x;

    template <typename Value> struct cachedProp {
        bool cache = false;
        Value value;
    }

    cachedProp<expC> prop1;
    cachedProp<someC> prop2;
    //...

    void invalidateCache() {
         prop1.cache = false;
         prop2.cache = false;
         //...
    }
public:
    expC getProperty1() {
        if (!prop1.cache) {
            recalculateProp1();
            prop1.cache = true;
        }
        return prop1.value;
    }

    void inc() {
        x += 1;
        invalidateCache();
    }
};

Edit: an even lazier solution is to instead of storing a boolean in cache, store an integer correponding to the last update and maintain a counter in C. Each time the cache is invalidated, the counter in C is increased. When getting propX, if the counter doesn't match propX.lastUpdate then update `propX.

That way, invalidating cache is just one operation and doesn't have to update all the properties' cache.

160

answered Oct 08 '22 09:10

coyotte508

Here is an approach that may work for you:

class C {
    double x;

    expC prop1;
    someC prop2;
    .
    .
    .

    template <typename F>
    void for_each_property(const F &f)
    {
        f(prop1,x);
        f(prop2,x,prop1);
        .
        .
        .
    }

public:
    C() : x(0.0), prop1(x), prop2(x, prop1)
    { }

    void inc()
    {
        x += 1;

        for_each_property([](auto &prop,auto&& ...) {
            prop.inc();
        });
    }

    void dec()
    {
        x -= 1;

        for_each_property([](auto &prop,auto&& ...) {
            prop.dec();
        });
    }

    void set(double x_)
    {
        x = x_;

        for_each_property([](auto &prop,auto&& ... args) {
            prop.recompute(args...);
        });
    }
};

When you add a new property, you only need to add one call in for_each_property(). The use of variadics avoids the need to provide new overloads for different parameters as long as you stick to the same formula.

This doesn't eliminate the duplication in the constructor, unless you are willing to switch to doing default initialization of the properties and then call set(0.0).

answered Oct 08 '22 11:10

Vaughn Cato

Related questions
                            
                                Matches overlapping lookahead on LZ77/LZSS with suffix trees
                            
                                Finding a "movement direction" (angle) of a point
                            
                                data member with the class name
                            
                                Initializing templated, recursive, POD struct
                            
                                Using Eigen Array-of-Arrays for RGB images
                            
                                Why is this declaration of a function in template class invalid?
                            
                                How to get all possible matches of std::regex
                            
                                OpenCV triangulatePoints varying distance
                            
                                using declaration for concrete output operator (with concrete signature)
                            
                                Assigning a value to a constant syntax or semantic error?
                            
                                Visual C++: forward an array as a pointer
                            
                                Do inline namespace variables have internal linkage? If not, why does the code below work?
                            
                                Choose luminosity (exposure) from HDR image
                            
                                std::is_constructible doesn't give the correct result [duplicate]
                            
                                Why atomic_flag default constructor leaves state unspecified?
                            
                                Using non-exporting functions inside templates in C++ modules
                            
                                does passing lambda by value or reference make it easier to inline?
                            
                                How to autocomplete code with C/C++ in Android Studio with Android NDK
                            
                                Confusion while deriving from std::tuple, can not handle std::get
                            
                                deprecated warnings while using boost.spirit

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With