Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to find duplicate definitions from template specializations?

I have a template class with a specialization, defined in another file. Therefore it is possible to generate two versions of the same class: one time by replacing the template parameter and one time using the specialization. My current understanding is, that this can lead to two instanciations of the same type to have different sizes in memory, resulting in segmentation faults.

I created a minimal example and the following code is to illustrate the question:

Create a template class:

// - templateexample.h ---------------
#ifndef TEMPLATEEXAMPLE_H
#define TEMPLATEEXAMPLE_H
template<typename T> class Example
{
    public:
        Example(){}
        int doWork() {return 42;}
};
#endif
// -----------------------------------

Template specialization in another file:

// - templatespecialization.h --------
#ifndef TEMPLATESPECIALIZATION_H
#define TEMPLATESPECIALIZATION_H    
#include "templateexample.h"
template<> class Example<int>
{
    public:
        Example() : a(0), b(1), c(2), d(3) {} 
        int doWork() {return a+b+c+d;}

    private:
        int a; //<== the specialized object will be larger in memory
        int b;
        int c;
        int d;
};
#endif
// --------------------------------

Have a class which includes only the template class definition, but should include the specialization.

// - a.h --------------------------
#ifndef A_H
#define A_H   
#include "templateexample.h"
class A
{
    public:
        Example<int> returnSmallExample();
};
#endif

// - a.cpp ------------------------
#include "a.h"
Example<int> A::returnSmallExample() {return Example<int>();}
// --------------------------------

The main class now knows two versions of Example<int> the one from A and the one from the templatespecialization.h.

// - main.cpp ---------------------
#include <iostream>
#include "a.h"
#include "templatespecialization.h"

int main()
{
    A a;
    Example<int> test = a.returnSmallExample();
    std::cout<<test.doWork()<<std::endl;
}
// --------------------------------

Please note, this problem will only occur when compiling class A separately, this example from ideone outputs 6, whereas using separate files can result in a segmentation fauls, or output 42 (https://ideone.com/3RTzlC). On my machine the example compiles successfully and outputs 2013265920:

proof

In the production version of the above example everything is build into a shared library which is used by main.

Question 1: Why doesn't the linker detect this problem? This should be easy to spot by comparing the size of objects.

Question 2: is there a way to examine the object files or the shared library to detect multiple implementations of the same type like in the example above?


Edit: please note: the code above is a minimal example to explain the problem. The reason for the situation is that the template class is from one library and I cannot edit the files from this library. Finally the whole thing is used all over the place in the executable and now I need to find out if the problem above occurs.


Edit: code above can be compiled like this:

#!/bin/bash 
g++ -g -c a.cpp 
g++ -g -c main.cpp 
g++ -o test a.o main.o 
like image 739
Beginner Avatar asked Jan 08 '15 09:01

Beginner


2 Answers

You have different definition of the same template and its specializations in different translation units. This leads to One Definition Rule violation.

A fix would be to put the specialization in the same header file where the primary class template is defined.

Question 1: Why doesn't the linker detect this problem? This should be easy to spot by comparing the size of objects.

Different types may have the same size (e.g. double and int64_t), so, obviously, just comparing sizes of objects does not work.

Question 2: is there a way to examine the object files or the shared library to detect multiple implementations of the same type like in the example above?

You should use gold linker for linking your C++ applications, if you do not use it already. One nice feature of it is --detect-odr-violations command line switch which does exactly what you ask:

gold uses a heuristic to find potential ODR violations: if the same symbol is seen defined in two different input files, and the two symbols have different sizes, then gold looks at the debugging information in the input objects. If the debugging information suggests that the symbols were defined in different source files, gold reports a potential ODR violation. This approach has both false negatives and false positives. However, it is reasonably reliable at detecting problems when linking unoptimized code. It is much easier to find these problems at link time than to debug cases where the wrong symbol.

See Enforcing One Definition Rule for more details.

like image 50
Maxim Egorushkin Avatar answered Nov 03 '22 10:11

Maxim Egorushkin


Question 1: Why doesn't the linker detect this problem? This should be easy to spot by comparing the size of objects.

Because this is not a linker's problem. In case of templates, the main declaration and all the other specializations (be it class or function) should be visible upfront.

Question 2: is there a way to examine the object files or the shared library to detect multiple implementations of the same type like in the example above?

At least I am not aware of any.

To further simplify this situation, look at a similar broken code:

// foo.h
inline void foo () {
#ifdef FOO
  return;
#else
  throw 0;
#endif
}

// foo1.cpp
#define FOO
#include"foo.h"
// ... uses foo() with "return"

// foo2.cpp
#include"foo.h"
// ... uses foo() with "throw"

It's possible that you get different results based on the way of compilation being used.

Update:
Having multiple body definitions for a same function is undefined behavior. That's the reason why you are getting an awkward output like 2013265920 and the same happens in my machine as well. The output should be either 42 or 6. I gave you above example because with inline functions you can create such race conditions.

With my limited knowledge on linking stage, the responsibility undertaken by a typical linker is limited only till rejecting more than 1 non-inline functions definitions with the same signature. e.g.

// Header.h is included in multiple .cpp files
void foo () {}  // rejected due to multiple definitions
inline void bar () {}  // ok because `inline` keyword is found

Beyond that it doesn't check if the function body is similar or not, because that is already parsed during earlier stage and linker doesn't parse the function body.
Having said above, now pay attention to this statement:

template functions are always inline by nature

Hence linker may not get a chance to reject them.

The safest way is to #include the read-only header into your specialized header and include that specialized header everywhere.

like image 36
iammilind Avatar answered Nov 03 '22 10:11

iammilind