I have two headers and two cpp files: <pre class="prettyprint"><code>//f1.h int f1(); //f1.cpp include "f1.h" int f1() {return 1;} //f2.h int f2(); //f2.cpp #include "f2.h" #include "f1.h" int f2() {return f1() + 1;} //main.cpp #include "f2.h" int main() {return f2();} </code></pre> First I compile a shared object from <code>f1</code> and <code>f2</code> and create a binary from <code>main.cpp</code> depending on that shared object: <pre class="prettyprint"><code>g++ -c -fPIC -shared f1.cpp f2.cpp g++ -shared -fPIC -o libf.so f2.o f1.o g++ -o dynamic main.cpp libf.so </code></pre> Now I introduce some changes to <code>f1.cpp</code> (say <code>f1</code> now returns <code>2</code>): <pre class="prettyprint"><code>//f1.cpp# include "f1.h" int f1() {return 2;} </code></pre> And compile a binary as follows: <pre class="prettyprint"><code>g++ -o semistatic main.cpp f1.cpp libf.so </code></pre> The question is whether 'semistatic' binary will use definition of <code>f1()</code> from <code>libf</code> (in which <code>f1</code> returns <code>1</code>) or it will use statically linked symbol (one in which <code>f1</code> returns <code>2</code>)? Is this different across systems and can I rely on this being consistent within a single system?

As have been pointed out, you are violating the one-definition rule. This is not the end of the world, but in this case there are no guarantees from the C++-standard what will happens and the behavior depends on the implementation details of the linker and loader. Tool-chains and operating systems are quite different so the above will not even link on Windows. But if your are speaking about Linux with the usual linker/loader pair, then the behavior will be to use the changed version - and it will be the for every Linux-installation. That is the way the linker/loader are working on Linux (and this behavior is widely used for example for LD_PRELOAD-trick): <ul> <li>The symbols in <code>*.so</code> are weak and so the definition from <code>*.so</code> are just neglected if linker finds another definition somewhere else (in your case in the updated version of <code>f1.o</code>).</li> <li>during the run time, the loader neglects the definitions from the shared object, if the symbol is already bound, i.e. another definition is known. In your case the symbol <code>f1</code> (ok, because of the name-mangling it will have a different name, but let's ignore that for the sake of simplicity) is already bound to the definition which is in the main-program and thus will be used when <code>f1</code> is called in <code>*.so</code>.</li> </ul> However, this way of doing things is very brittle and some minor changes can lead to a different result. A: changing the visibility to hidden. It's recommended to hide symbols which are not part of the public interface, i.e. <pre class="prettyprint"><code>__attribute__ ((visibility ("hidden"))) int f1() {return 1;} </code></pre> In this case, not the overwritten version is used but the old. The difference is, that when the linker sees a hidden symbol being used, it no longer delegates it to the loader to resolve the address of the symbol, but uses the address at hand directly. Later on, there is no way we could change which definition is called. B: making <code>f1</code> were an inline-function. That would lead to really funny things, because in some parts the shared-object the old version would be used and in some part the new version. <code>-fPIC</code> prevents the inlining of the function which are not marked with <code>inline</code>, so the above holds only for function which are marked as inline explicitly. <hr> In a nutshell: This trick is can be used on Linux. However in bigger projects you don't want to have additional complexity and try to stick the more sustainable and simple one-definition-rule framework.

Is there any preference linker gives to static symbols or dynamic symbols?

Tags:

c++

dynamic

static

linker

shared-libraries

I have two headers and two cpp files:

//f1.h
int f1();

//f1.cpp
include "f1.h"
int f1() {return 1;}

//f2.h
int f2();

//f2.cpp
#include "f2.h"
#include "f1.h"
int f2() {return f1() + 1;}

//main.cpp
#include "f2.h"
int main() {return f2();}

First I compile a shared object from f1 and f2 and create a binary from main.cpp depending on that shared object:

g++ -c -fPIC -shared f1.cpp f2.cpp
g++ -shared -fPIC -o libf.so f2.o f1.o
g++ -o dynamic main.cpp libf.so

Now I introduce some changes to f1.cpp (say f1 now returns 2):

//f1.cpp#
include "f1.h"
int f1() {return 2;}

And compile a binary as follows:

g++ -o semistatic main.cpp f1.cpp libf.so

The question is whether 'semistatic' binary will use definition of f1() from libf (in which f1 returns 1) or it will use statically linked symbol (one in which f1 returns 2)? Is this different across systems and can I rely on this being consistent within a single system?

205

asked Jun 28 '18 06:06

senx

1 Answers

As have been pointed out, you are violating the one-definition rule. This is not the end of the world, but in this case there are no guarantees from the C++-standard what will happens and the behavior depends on the implementation details of the linker and loader.

Tool-chains and operating systems are quite different so the above will not even link on Windows. But if your are speaking about Linux with the usual linker/loader pair, then the behavior will be to use the changed version - and it will be the for every Linux-installation.

That is the way the linker/loader are working on Linux (and this behavior is widely used for example for LD_PRELOAD-trick):

The symbols in *.so are weak and so the definition from *.so are just neglected if linker finds another definition somewhere else (in your case in the updated version of f1.o).
during the run time, the loader neglects the definitions from the shared object, if the symbol is already bound, i.e. another definition is known. In your case the symbol f1 (ok, because of the name-mangling it will have a different name, but let's ignore that for the sake of simplicity) is already bound to the definition which is in the main-program and thus will be used when f1 is called in *.so.

However, this way of doing things is very brittle and some minor changes can lead to a different result.

A: changing the visibility to hidden.

It's recommended to hide symbols which are not part of the public interface, i.e.

__attribute__ ((visibility ("hidden")))
int f1() {return 1;}

In this case, not the overwritten version is used but the old. The difference is, that when the linker sees a hidden symbol being used, it no longer delegates it to the loader to resolve the address of the symbol, but uses the address at hand directly. Later on, there is no way we could change which definition is called.

B: making f1 were an inline-function.

That would lead to really funny things, because in some parts the shared-object the old version would be used and in some part the new version.

-fPIC prevents the inlining of the function which are not marked with inline, so the above holds only for function which are marked as inline explicitly.

In a nutshell: This trick is can be used on Linux. However in bigger projects you don't want to have additional complexity and try to stick the more sustainable and simple one-definition-rule framework.

159

answered Nov 15 '22 12:11

ead

Related questions
                            
                                casting to the same type
                            
                                Implicit conversion to/from an enum
                            
                                Constexpr tricks
                            
                                CMake conflict with multiple gtest
                            
                                Why does defining a class across cpp files not cause a linker error?
                            
                                Copying data from a shared-memory-mapped object using sendfile()/fcopyfile()
                            
                                Boost python getter/setter with the same name
                            
                                Hiding a private overloaded virtual function?
                            
                                Reading a huge file into a C++ vector in Ubuntu (linux OS)
                            
                                why allocate_shared and make_shared so slow
                            
                                Android Studio External Native Build Precompiled Headers
                            
                                Wrong CRLF in UTF-16 stream?
                            
                                Windows error reporting and out of range exceptions
                            
                                How do I return a native object from a class derived from Nan::ObjectWrap?
                            
                                Trying to understand std::forward, std::move a little better
                            
                                Can auto placeholder be used to deduce function result in non-type template parameter?
                            
                                Templated Variables Bug With Lambdas in Visual Studio?
                            
                                A type trait to detect functors using C++17?
                            
                                Atollic TrueSTUDIO: How to convert from C to C++?
                            
                                How are symbols contained in the libpythonX.X linked to numpy extension dynamic libraries?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With