(Yes, I know that one machine instruction usually doesn't matter. I'm asking this question because I want to understand the pimpl idiom, and use it in the best possible way; and because sometimes I do care about one machine instruction.) In the sample code below, there are two classes, <code>Thing</code> and <code>OtherThing</code>. Users would include "thing.hh". <code>Thing</code> uses the pimpl idiom to hide it's implementation. <code>OtherThing</code> uses a C style – non-member functions that return and take pointers. This style produces slightly better machine code. I'm wondering: is there a way to use C++ style – ie, make the functions into member functions – and yet still save the machine instruction. I like this style because it doesn't pollute the namespace outside the class. Note: I'm only looking at calling member functions (in this case, <code>calc</code>). I'm not looking at object allocation. Below are the files, commands, and the machine code, on my Mac. thing.hh: <pre class="prettyprint"><code>class ThingImpl; class Thing { ThingImpl *impl; public: Thing(); int calc(); }; class OtherThing; OtherThing *make_other(); int calc(OtherThing *); </code></pre> thing.cc: <pre class="prettyprint"><code>#include "thing.hh" struct ThingImpl { int x; }; Thing::Thing() { impl = new ThingImpl; impl->x = 5; } int Thing::calc() { return impl->x + 1; } struct OtherThing { int x; }; OtherThing *make_other() { OtherThing *t = new OtherThing; t->x = 5; } int calc(OtherThing *t) { return t->x + 1; } </code></pre> main.cc (just to test the code actually works...) <pre class="prettyprint"><code>#include "thing.hh" #include <cstdio> int main() { Thing *t = new Thing; printf("calc: %d\n", t->calc()); OtherThing *t2 = make_other(); printf("calc: %d\n", calc(t2)); } </code></pre> Makefile: <pre class="prettyprint"><code>all: main thing.o : thing.cc thing.hh g++ -fomit-frame-pointer -O2 -c thing.cc main.o : main.cc thing.hh g++ -fomit-frame-pointer -O2 -c main.cc main: main.o thing.o g++ -O2 -o $@ $^ clean: rm *.o rm main </code></pre> Run <code>make</code> and then look at the machine code. On the mac I use <code>otool -tv thing.o | c++filt</code>. On linux I think it's <code>objdump -d thing.o</code>. Here is the relevant output: <blockquote> Thing::calc(): 0000000000000000 movq (%rdi),%rax 0000000000000003 movl (%rax),%eax 0000000000000005 incl %eax 0000000000000007 ret calc(OtherThing*): 0000000000000010 movl (%rdi),%eax 0000000000000012 incl %eax 0000000000000014 ret </blockquote> Notice the extra instruction because of the pointer indirection. The first function looks up two fields (impl, then x), while the second only needs to get x. What can be done?

Not too hard, just use the same technique inside your class. Any halfway decent optimizer will inline the trivial wrapper. <pre class="prettyprint"><code>class ThingImpl; class Thing { ThingImpl *impl; static int calc(ThingImpl*); public: Thing(); int calc() { calc(impl); } }; </code></pre>

C++ pimpl idiom wastes an instruction vs. C style?

Tags:

c++

optimization

pimpl-idiom

(Yes, I know that one machine instruction usually doesn't matter. I'm asking this question because I want to understand the pimpl idiom, and use it in the best possible way; and because sometimes I do care about one machine instruction.)

In the sample code below, there are two classes, Thing and OtherThing. Users would include "thing.hh". Thing uses the pimpl idiom to hide it's implementation. OtherThing uses a C style – non-member functions that return and take pointers. This style produces slightly better machine code. I'm wondering: is there a way to use C++ style – ie, make the functions into member functions – and yet still save the machine instruction. I like this style because it doesn't pollute the namespace outside the class.

Note: I'm only looking at calling member functions (in this case, calc). I'm not looking at object allocation.

Below are the files, commands, and the machine code, on my Mac.

thing.hh:

class ThingImpl;
class Thing
{
    ThingImpl *impl;
public:
    Thing();
    int calc();
};

class OtherThing;    
OtherThing *make_other();
int calc(OtherThing *);

thing.cc:

#include "thing.hh"

struct ThingImpl
{
    int x;
};

Thing::Thing()
{
    impl = new ThingImpl;
    impl->x = 5;
}

int Thing::calc()
{
    return impl->x + 1;
}

struct OtherThing
{
    int x;
};

OtherThing *make_other()
{
    OtherThing *t = new OtherThing;
    t->x = 5;
}

int calc(OtherThing *t)
{
    return t->x + 1;
}

main.cc (just to test the code actually works...)

#include "thing.hh"
#include <cstdio>

int main()
{
    Thing *t = new Thing;
    printf("calc: %d\n", t->calc());

    OtherThing *t2 = make_other();
    printf("calc: %d\n", calc(t2));
}

Makefile:

all: main

thing.o : thing.cc thing.hh
    g++ -fomit-frame-pointer -O2 -c thing.cc

main.o : main.cc thing.hh
    g++ -fomit-frame-pointer -O2 -c main.cc

main: main.o thing.o
    g++ -O2 -o $@ $^

clean: 
    rm *.o
    rm main

Run make and then look at the machine code. On the mac I use otool -tv thing.o | c++filt. On linux I think it's objdump -d thing.o. Here is the relevant output:

Thing::calc():
0000000000000000 movq (%rdi),%rax
0000000000000003 movl (%rax),%eax
0000000000000005 incl %eax
0000000000000007 ret
calc(OtherThing*):
0000000000000010 movl (%rdi),%eax
0000000000000012 incl %eax
0000000000000014 ret

Notice the extra instruction because of the pointer indirection. The first function looks up two fields (impl, then x), while the second only needs to get x. What can be done?

911

asked May 21 '10 08:05

Rob N

3 Answers

One instruction is rarely a thing to spend much time worrying over. Firstly, the compiler may cache the pImpl in a more complex use case, thus amortising the cost in a real-world scenario. Secondly, pipelined architectures make it almost impossible to predict the real cost in clock cycles. You'll get a much more realistic idea of the cost if you run these operations in a loop and time the difference.

136

answered Oct 16 '22 06:10

Marcelo Cantos

Not too hard, just use the same technique inside your class. Any halfway decent optimizer will inline the trivial wrapper.

class ThingImpl;
class Thing
{
    ThingImpl *impl;
    static int calc(ThingImpl*);
public:
    Thing();
    int calc() { calc(impl); }
};

answered Oct 16 '22 05:10

MSalters

There's the nasty way, which is to replace the pointer to ThingImpl with a big-enough array of unsigned chars and then placement/new reinterpret cast/explicitly destruct the ThingImpl object.

Or you could just pass the Thing around by value, since it should be no larger than the pointer to the ThingImpl, though may require a little more than that (reference counting of the ThingImpl would defeat the optimisation, so you need some way of flagging the 'owning' Thing, which might require extra space on some architectures).

answered Oct 16 '22 07:10

Pete Kirkham

Related questions
                            
                                Difference between a program that crashes and program that hangs
                            
                                Trusting the Return Value Optimization
                            
                                const cast to allow read lock, does this smell bad?
                            
                                C++ compilation for iPhone (STL issue?)
                            
                                What alternatives to the Windows registry exist to store software configuration settings [closed]
                            
                                'long long int' is interpreted as 'long int'. How do I get round this?
                            
                                C++ Variable declarable in function body, but not class member?
                            
                                Storing objects in STL vector - minimal set of methods
                            
                                Which programming languages support constant methods?
                            
                                Class declaration confusion - name between closing brace and semi-colon
                            
                                How do I compile for windows XP under windows 7 / visual studio 2008
                            
                                C++ Class Access Specifier Verbosity
                            
                                Convert char pointer (char*) to struct
                            
                                Check my anagram code from a job interview in the past
                            
                                How to debug packet loss?
                            
                                C++ OOP: Which functions to put into the class?
                            
                                Why would you use umask?
                            
                                x86-64 long double precision
                            
                                Why can't QFile read from the "~" directory?
                            
                                C++ Preprocessor string literal concatenation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With