I use std::tr1::shared_ptr extensively throughout my application. This includes passing objects in as function arguments. Consider the following: <pre class="prettyprint"><code>class Dataset {...} void f( shared_ptr< Dataset const > pds ) {...} void g( shared_ptr< Dataset const > pds ) {...} ... </code></pre> While passing a dataset object around via shared_ptr guarantees its existence inside f and g, the functions may be called millions of times, which causes a lot of shared_ptr objects being created and destroyed. Here's a snippet of the flat gprof profile from a recent run: <pre class="prettyprint"> Each sample counts as 0.01 seconds. % cumulative self self total time seconds seconds calls s/call s/call name 9.74 295.39 35.12 2451177304 0.00 0.00 std::tr1::__shared_count::__shared_count(std::tr1::__shared_count const&) 8.03 324.34 28.95 2451252116 0.00 0.00 std::tr1::__shared_count::~__shared_count() </pre> So, ~17% of the runtime was spent on reference counting with shared_ptr objects. Is this normal? A large portion of my application is single-threaded and I was thinking about re-writing some of the functions as <pre class="prettyprint"><code>void f( const Dataset& ds ) {...} </code></pre> and replacing the calls <pre class="prettyprint"><code>shared_ptr< Dataset > pds( new Dataset(...) ); f( pds ); </code></pre> with <pre class="prettyprint"><code>f( *pds ); </code></pre> in places where I know for sure the object will not get destroyed while the flow of the program is inside f(). But before I run off to change a bunch of function signatures / calls, I wanted to know what the typical performance hit of passing by shared_ptr was. Seems like shared_ptr should not be used for functions that get called very often. Any input would be appreciated. Thanks for reading. -Artem Update: After changing a handful of functions to accept <code>const Dataset&</code>, the new profile looks like this: <pre class="prettyprint"> Each sample counts as 0.01 seconds. % cumulative self self total time seconds seconds calls s/call s/call name 0.15 241.62 0.37 24981902 0.00 0.00 std::tr1::__shared_count::~__shared_count() 0.12 241.91 0.30 28342376 0.00 0.00 std::tr1::__shared_count::__shared_count(std::tr1::__shared_count const&) </pre> I'm a little puzzled by the number of destructor calls being smaller than the number of copy constructor calls, but overall I'm very pleased with the decrease in the associated run-time. Thanks to all for their advice.

Always pass your <code>shared_ptr</code> by const reference: <pre class="prettyprint"><code>void f(const shared_ptr<Dataset const>& pds) {...} void g(const shared_ptr<Dataset const>& pds) {...} </code></pre> Edit: Regarding the safety issues mentioned by others: <ul> <li>When using <code>shared_ptr</code> heavily throughout an application, passing by value will take up a tremendous amount of time (I've seen it go 50+%).</li> <li>Use <code>const T&</code> instead of <code>const shared_ptr<T const>&</code> when the argument shall not be null.</li> <li>Using <code>const shared_ptr<T const>&</code> is safer than <code>const T*</code> when performance is an issue.</li> </ul>

The cost of passing by shared_ptr

Tags:

c++

performance

shared-ptr

I use std::tr1::shared_ptr extensively throughout my application. This includes passing objects in as function arguments. Consider the following:

Click to copy

class Dataset {...}  void f( shared_ptr< Dataset const > pds ) {...} void g( shared_ptr< Dataset const > pds ) {...} ...

While passing a dataset object around via shared_ptr guarantees its existence inside f and g, the functions may be called millions of times, which causes a lot of shared_ptr objects being created and destroyed. Here's a snippet of the flat gprof profile from a recent run:

Click to copy

 Each sample counts as 0.01 seconds.   %   cumulative   self              self     total  time   seconds   seconds    calls   s/call   s/call  name   9.74    295.39    35.12 2451177304     0.00     0.00  std::tr1::__shared_count::__shared_count(std::tr1::__shared_count const&)   8.03    324.34    28.95 2451252116     0.00     0.00  std::tr1::__shared_count::~__shared_count()

So, ~17% of the runtime was spent on reference counting with shared_ptr objects. Is this normal?

A large portion of my application is single-threaded and I was thinking about re-writing some of the functions as

Click to copy

void f( const Dataset& ds ) {...}

and replacing the calls

Click to copy

shared_ptr< Dataset > pds( new Dataset(...) ); f( pds );

with

Click to copy

f( *pds );

in places where I know for sure the object will not get destroyed while the flow of the program is inside f(). But before I run off to change a bunch of function signatures / calls, I wanted to know what the typical performance hit of passing by shared_ptr was. Seems like shared_ptr should not be used for functions that get called very often.

Any input would be appreciated. Thanks for reading.

-Artem

Update: After changing a handful of functions to accept const Dataset&, the new profile looks like this:

Click to copy

 Each sample counts as 0.01 seconds.   %   cumulative   self              self     total  time   seconds   seconds    calls   s/call   s/call  name   0.15    241.62     0.37 24981902     0.00     0.00  std::tr1::__shared_count::~__shared_count()   0.12    241.91     0.30 28342376     0.00     0.00  std::tr1::__shared_count::__shared_count(std::tr1::__shared_count const&)

I'm a little puzzled by the number of destructor calls being smaller than the number of copy constructor calls, but overall I'm very pleased with the decrease in the associated run-time. Thanks to all for their advice.

369

asked Mar 23 '10 18:03

Artem Sokolov

2 Answers

Always pass your shared_ptr by const reference:

Click to copy

void f(const shared_ptr<Dataset const>& pds) {...}  void g(const shared_ptr<Dataset const>& pds) {...}

Edit: Regarding the safety issues mentioned by others:

When using shared_ptr heavily throughout an application, passing by value will take up a tremendous amount of time (I've seen it go 50+%).
Use const T& instead of const shared_ptr<T const>& when the argument shall not be null.
Using const shared_ptr<T const>& is safer than const T* when performance is an issue.

answered Sep 27 '22 21:09

Sam Harwell

You need shared_ptr only to pass it to functions/objects which keep it for future use. For example, some class may keep shared_ptr for using in an worker thread. For simple synchronous calls it's quite enough to use plain pointer or reference. shared_ptr should not replace using plain pointers completely.

answered Sep 27 '22 19:09

Alex F

Related questions
                            
                                Avoid warning 'Unreferenced Formal Parameter'
                            
                                Iterating over a QMap with for
                            
                                Install Qt on Ubuntu
                            
                                C++ for a C# developer
                            
                                How to make thread sleep less than a millisecond on Windows
                            
                                How to convert vector to set? [closed]
                            
                                How can Boost be used to achieve C++14-style auto return types?
                            
                                Why does C++11 contain an odd clause about comparing void pointers?
                            
                                How is numpy so fast?
                            
                                what is the difference between set and unordered_set in C++?
                            
                                When did C++ compilers start considering more than two hex digits in string literal character escapes?
                            
                                Are int8_t and uint8_t intended to be char types?
                            
                                What can and can't I specialize in the std namespace?
                            
                                Can I use const in vectors to allow adding elements, but not modifications to the already added?
                            
                                Class static variable initialization order
                            
                                What is the point of clog?
                            
                                How does this "size of array" template function work? [duplicate]
                            
                                Using Unicode in C++ source code
                            
                                How memset initializes an array of integers by -1?
                            
                                Is x = std::move(x) undefined?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

The cost of passing by shared_ptr

Tags:

c++

performance

shared-ptr

Artem Sokolov

People also ask

2 Answers

Sam Harwell

Alex F

Recent Activity

Donate For Us