In this question the OP asked about limiting what classes a template will accept. A summary of the sentiment that followed is that the equivalent facility in Java is bad; and don't do this. I don't understand why this is bad. Duck typing is certainly a powerful tool; but in my mind it lends itself confusing runtime issues when a class looks close (same function names) but has slightly different behavior. And you can't necessarily rely on compile time checking because of examples like this: <pre class="prettyprint"><code>struct One { int a; int b }; struct Two { int a; }; template <class T> class Worker{ T data; void print() { cout << data.a << endl; } template <class X> void usually_important () { int a = data.a; int b = data.b; } } int main() { Worker<Two> w; w.print(); } </code></pre> Type <code>Two</code> will allow <code>Worker</code> to compile only if <code>usually_important</code> is not called. This could lead to some instantiations of <code>Worker</code> compiling and others not even in the same program. In a case like this, though. The responsibility is put on to the designer of <code>ENGINE</code> to ensure that it is a valid type (after which they should inherit <code>ENGINE_BASE</code>). If they don't, there will be a compiler error. To me this seems much safer while not imposing any restrictions or adding much additional work. <pre class="prettyprint"><code>class ENGINE_BASE {}; // Empty class, all engines should extend this template <class ENGINE> class NeedsAnEngine { BOOST_STATIC_ASSERT((is_base_of<ENGINE_BASE, ENGINE>)); // Do stuff with ENGINE... }; </code></pre>

This is too long, but it might be informative. Generics in Java are a type erasure mechanism, and automatic code generation of type casts and type checks. <code>template</code>s in C++ are code generation and pattern matching mechanisms. You can use C++ <code>template</code>s to do what Java generics do with a bit of effort. <code>std::function< A(B) ></code> behaves in a covariant/contravariant fashion with regards to <code>A</code> and <code>B</code> types and conversion to other <code>std::function< X(Y) ></code>. But the primary design of the two is not the same. A Java <code>List<X></code> will be a <code>List<Object></code> with some thin wrapping on it so users don't have to do type casts on extraction. If you pass it as a <code>List<? extends Bar></code>, it again is getting a <code>List<Object></code> in essence, it just has some extra type information that changes how the casts work and which methods can be invoked. This means you can extract elements from the <code>List</code> into a <code>Bar</code> and know it works (and check it). Only one method is generated for all <code>List<? extends Bar></code>. A C++ <code>std::vector<X></code> is not in essence a <code>std::vector<Object></code> or <code>std::vector<void*></code> or anything else. Each instance of a C++ <code>template</code> is an unrelated type (except template pattern matching). In fact, <code>std::vector<bool></code> uses a completely different implementation than any other <code>std::vector</code> (this is now considered a mistake because the implementation differences "leak" in annoying ways in this case). Each method and function is generated independently for the particular type you pass it. In Java, it is assumed that all objects will fit into some hierarchy. In C++, that is sometimes useful, but it has been discovered it is often ill fitting to a problem. A C++ container need not inherit from a common interface. A <code>std::list<int></code> and <code>std::vector<int></code> are unrelated types, but you can act on them uniformly -- they both are sequential containers. The question "is the argument a sequential container" is a good question. This allows anyone to implement a sequential container, and such sequential containers can as high performance as hand-crafted C code with utterly different implementations. If you created a common root <code>std::container<T></code> which all containers inherited from, it would either be full of <code>virtual</code> table cruft or it would be useless other than as a tag type. As a tag type, it would intrusively inject itself into all non-<code>std</code> containers, requiring that they inherit from <code>std::container<T></code> to be a real container. The traits approach instead means that there are specifications as to what a container (sequential, associative, etc) is. You can test these specifications at compile time, and/or allow types to note that they qualify for certain axioms via traits of some kind. The C++03/11 standard library does this with iterators. <code>std::iterator_traits<T></code> is a traits class that exposes iterator information about an arbitrary type <code>T</code>. Someone completely unconnected to the standard library can write their own iterator, and use <code>std::iterator<...></code> to auto-work with <code>std::iterator_traits</code>, add their own type aliases manually, or specialize <code>std::iterator_traits</code> to pass on the information required. C++11 goes a step further. <code>for( auto&& x : y )</code> can work with things that where written long before the range-based iteration was designed, without touching the class itself. You simply write a free <code>begin</code> and <code>end</code> function in the namespace that the class belongs to that returns a valid forward iterator (note: even invalid forward iterators that are close enough work), and suddenly <code>for ( auto&& x : y )</code> starts working. <code>std::function< A(B) ></code> is an example of using these techniques together with type erasure. It has a constructor that accepts anything that can be copied, destroyed, invoked with <code>(B)</code> and whose return type can be converted to <code>A</code>. The types it can take can be completely unrelated -- only that which is required is tested for. Because of <code>std::function</code>s design, we can have lambda invokables that are unrelated types that can be type-erased into a common <code>std::function</code> if needed, but when not type erased their invokation action is known from there type. So a <code>template</code> function that takes a lambda knows at the point of invokation what will happen, which makes inlining an easy local operation. This technique is not new -- it was in C++ since <code>std::sort</code>, a high level algorithm that is faster than C's <code>qsort</code> due to the ease of inlining invokable objects passed as comparators. In short, if you need a common runtime type, type erase. If you need certain properties, test for those properties, don't force a common base. If you need certain axioms to hold (untestable properties), either document or require callers to claim those properties via tags or traits classes (see how the standard library handles iterator categories -- again, not inheritance). When in doubt, use free functions with ADL enabled to access properties of your arguments, and have your default free functions use SFINAE to look for a method and invoke if it exists, and fail otherwise. Such a mechanism removes the central responsibility of a common base class, allows existing classes to be adapted without modification to pass your requirements (if reasonable), places type erasure only where it is needed, avoids <code>virtual</code> overhead, and ideally generates clear errors when properties are found to not hold. If your <code>ENGINE</code> has certain properites it needs to pass, write a traits class that tests for those. If there are properties that cannot be tested for, create tags that describe such properties. Use specialization of a traits class, or canonical typedefs, to let the class describe which axioms hold for the type. (See iterator tags). If you have a type like <code>ENGINE_BASE</code>, don't demand it, but instead use it as a helper for said tags and traits and axiom typedefs, like <code>std::iterator<...></code> (you never have to inherit from it, it simply acts as a helper). Avoid over specifying requirements. If <code>usually_important</code> is never invoked on your <code>Worker<X></code>, probably your <code>X</code> doesn't need a <code>b</code> in that context. But do test for properties in a way clearer than "method does not compile". And sometimes, just punt. Following such practices might make things harder for you -- so do an easier way. Most code is written and discarded. Know when your code will persist, and write it better and more extendably and more maintainably. Know that you need to practice those techniques on disposable code so you can write it correctly when you have to.

<blockquote> I don't understand why this is bad. Duck typing is certainly a powerful tool; but in my mind it lends itself confusing runtime issues when a class looks close (same function names) but has slightly different behavior. </blockquote> The probability that you can define a non-trivial interface and then by accident have another interface that has different semantics but can be substituted is minimal. This never, ever happens. <blockquote> Type Two will allow Worker to compile only if usually_important is not called. </blockquote> That is a good thing. We depend on it all the time. It makes class templates more flexible. Matching a compile-time interface is strictly superior to a run-time one. This is because run-time interfaces can't differ in key ways that compile-time ones can (e.g. different types in the interface), and require a bunch of run-time abstraction like dynamic allocation that may be unnecessary. <blockquote> In a case like this, though. The responsibility is put on to the designer of ENGINE to ensure that it is a valid type (after which they should inherit ENGINE_BASE). If they don't, there will be a compiler error. To me this seems much safer while not imposing any restrictions or adding much additional work. </blockquote> It is not safer. It is utterly pointless. It is stupendously unlikely that the user will accidentally instantiate the class with the wrong type but it will compile successfully due to circumstantial interface match. What it really boils down to is this: you should only require what you really need. Absolutely definitely must have in order to function. Everything else, don't require it. This is a core tenet of making software maintainable. You cannot possibly imagine what shenanigans I might conceive of long after you have written this class to use it in ways that you never thought it could be used for.

Why is it bad to impose type constraints on templates in C++?

Tags:

c++

inheritance

templates

In this question the OP asked about limiting what classes a template will accept. A summary of the sentiment that followed is that the equivalent facility in Java is bad; and don't do this.

I don't understand why this is bad. Duck typing is certainly a powerful tool; but in my mind it lends itself confusing runtime issues when a class looks close (same function names) but has slightly different behavior. And you can't necessarily rely on compile time checking because of examples like this:

struct One { int a; int b };
struct Two { int a; };

template <class T>
class Worker{
    T data;

    void print() { cout << data.a << endl; }

    template <class X>
    void usually_important () { int a = data.a; int b = data.b; }
}

int main() {
    Worker<Two> w;
    w.print();
}

Type Two will allow Worker to compile only if usually_important is not called. This could lead to some instantiations of Worker compiling and others not even in the same program.

In a case like this, though. The responsibility is put on to the designer of ENGINE to ensure that it is a valid type (after which they should inherit ENGINE_BASE). If they don't, there will be a compiler error. To me this seems much safer while not imposing any restrictions or adding much additional work.

class ENGINE_BASE {}; // Empty class, all engines should extend this

template <class ENGINE>
class NeedsAnEngine {
    BOOST_STATIC_ASSERT((is_base_of<ENGINE_BASE, ENGINE>));
    // Do stuff with ENGINE...
};

817

asked Aug 19 '14 15:08

John Smith

3 Answers

This is too long, but it might be informative.

Generics in Java are a type erasure mechanism, and automatic code generation of type casts and type checks.

templates in C++ are code generation and pattern matching mechanisms.

You can use C++ templates to do what Java generics do with a bit of effort. std::function< A(B) > behaves in a covariant/contravariant fashion with regards to A and B types and conversion to other std::function< X(Y) >.

But the primary design of the two is not the same.

A Java List<X> will be a List<Object> with some thin wrapping on it so users don't have to do type casts on extraction. If you pass it as a List<? extends Bar>, it again is getting a List<Object> in essence, it just has some extra type information that changes how the casts work and which methods can be invoked. This means you can extract elements from the List into a Bar and know it works (and check it). Only one method is generated for all List<? extends Bar>.

A C++ std::vector<X> is not in essence a std::vector<Object> or std::vector<void*> or anything else. Each instance of a C++ template is an unrelated type (except template pattern matching). In fact, std::vector<bool> uses a completely different implementation than any other std::vector (this is now considered a mistake because the implementation differences "leak" in annoying ways in this case). Each method and function is generated independently for the particular type you pass it.

In Java, it is assumed that all objects will fit into some hierarchy. In C++, that is sometimes useful, but it has been discovered it is often ill fitting to a problem.

A C++ container need not inherit from a common interface. A std::list<int> and std::vector<int> are unrelated types, but you can act on them uniformly -- they both are sequential containers.

The question "is the argument a sequential container" is a good question. This allows anyone to implement a sequential container, and such sequential containers can as high performance as hand-crafted C code with utterly different implementations.

If you created a common root std::container<T> which all containers inherited from, it would either be full of virtual table cruft or it would be useless other than as a tag type. As a tag type, it would intrusively inject itself into all non-std containers, requiring that they inherit from std::container<T> to be a real container.

The traits approach instead means that there are specifications as to what a container (sequential, associative, etc) is. You can test these specifications at compile time, and/or allow types to note that they qualify for certain axioms via traits of some kind.

The C++03/11 standard library does this with iterators. std::iterator_traits<T> is a traits class that exposes iterator information about an arbitrary type T. Someone completely unconnected to the standard library can write their own iterator, and use std::iterator<...> to auto-work with std::iterator_traits, add their own type aliases manually, or specialize std::iterator_traits to pass on the information required.

C++11 goes a step further. for( auto&& x : y ) can work with things that where written long before the range-based iteration was designed, without touching the class itself. You simply write a free begin and end function in the namespace that the class belongs to that returns a valid forward iterator (note: even invalid forward iterators that are close enough work), and suddenly for ( auto&& x : y ) starts working.

std::function< A(B) > is an example of using these techniques together with type erasure. It has a constructor that accepts anything that can be copied, destroyed, invoked with (B) and whose return type can be converted to A. The types it can take can be completely unrelated -- only that which is required is tested for.

Because of std::functions design, we can have lambda invokables that are unrelated types that can be type-erased into a common std::function if needed, but when not type erased their invokation action is known from there type. So a template function that takes a lambda knows at the point of invokation what will happen, which makes inlining an easy local operation.

This technique is not new -- it was in C++ since std::sort, a high level algorithm that is faster than C's qsort due to the ease of inlining invokable objects passed as comparators.

In short, if you need a common runtime type, type erase. If you need certain properties, test for those properties, don't force a common base. If you need certain axioms to hold (untestable properties), either document or require callers to claim those properties via tags or traits classes (see how the standard library handles iterator categories -- again, not inheritance). When in doubt, use free functions with ADL enabled to access properties of your arguments, and have your default free functions use SFINAE to look for a method and invoke if it exists, and fail otherwise.

Such a mechanism removes the central responsibility of a common base class, allows existing classes to be adapted without modification to pass your requirements (if reasonable), places type erasure only where it is needed, avoids virtual overhead, and ideally generates clear errors when properties are found to not hold.

If your ENGINE has certain properites it needs to pass, write a traits class that tests for those.

If there are properties that cannot be tested for, create tags that describe such properties. Use specialization of a traits class, or canonical typedefs, to let the class describe which axioms hold for the type. (See iterator tags).

If you have a type like ENGINE_BASE, don't demand it, but instead use it as a helper for said tags and traits and axiom typedefs, like std::iterator<...> (you never have to inherit from it, it simply acts as a helper).

Avoid over specifying requirements. If usually_important is never invoked on your Worker<X>, probably your X doesn't need a b in that context. But do test for properties in a way clearer than "method does not compile".

And sometimes, just punt. Following such practices might make things harder for you -- so do an easier way. Most code is written and discarded. Know when your code will persist, and write it better and more extendably and more maintainably. Know that you need to practice those techniques on disposable code so you can write it correctly when you have to.

103

answered Oct 04 '22 23:10

Yakk - Adam Nevraumont

Let me turn the question around on you: Why is it bad that the code compiles for Two if usually_important isn't called? The type you gave it meets all the needs for that particular instantiation and the compiler will immediately tell you if a particular instantiation no longer meets the interface needed for the needed functionality in the template.

That said if you insist that you need an Engine object, don't do it with templates at all, instead treat it as a sort of strategy pattern with a non-template (using this approach enforces at compile time that the user-defined type adheres to a specific interface, not just that it looks like a duck):

class Worker
{
public:
    explicit Worker(EngineBase* data) : data_(data) {}
    void print() { cout << data_->a() << endl; }

    template <class X>
    void usually_important () { int a = data_->a(); int b = data_->b(); }

private:
    EngineBase* data_;
}

int main() 
{
    Worker w(new ConcreteEngine);
    w.print();
}

answered Oct 05 '22 01:10

Mark B

I don't understand why this is bad. Duck typing is certainly a powerful tool; but in my mind it lends itself confusing runtime issues when a class looks close (same function names) but has slightly different behavior.

The probability that you can define a non-trivial interface and then by accident have another interface that has different semantics but can be substituted is minimal. This never, ever happens.

Type Two will allow Worker to compile only if usually_important is not called.

That is a good thing. We depend on it all the time. It makes class templates more flexible.

Matching a compile-time interface is strictly superior to a run-time one. This is because run-time interfaces can't differ in key ways that compile-time ones can (e.g. different types in the interface), and require a bunch of run-time abstraction like dynamic allocation that may be unnecessary.

In a case like this, though. The responsibility is put on to the designer of ENGINE to ensure that it is a valid type (after which they should inherit ENGINE_BASE). If they don't, there will be a compiler error. To me this seems much safer while not imposing any restrictions or adding much additional work.

It is not safer. It is utterly pointless. It is stupendously unlikely that the user will accidentally instantiate the class with the wrong type but it will compile successfully due to circumstantial interface match.

What it really boils down to is this: you should only require what you really need. Absolutely definitely must have in order to function. Everything else, don't require it. This is a core tenet of making software maintainable. You cannot possibly imagine what shenanigans I might conceive of long after you have written this class to use it in ways that you never thought it could be used for.

answered Oct 05 '22 00:10

Puppy

Related questions
                            
                                C++ template - error: expected initializer before '<' token
                            
                                How to check if a void* is to a pointer on a valid instance of an object type?
                            
                                Can't call a static method in Qt
                            
                                Qt Signal Slot Architecture Unwanted Infinite Loop
                            
                                void ** equivalent to void * return?
                            
                                ignore escape sequence c++
                            
                                Objects of size 0
                            
                                Importance of "this"
                            
                                Passing a CMake variable to C++ source code [duplicate]
                            
                                Use of Curly Brackets (Braces) Around a variable C++
                            
                                'constexpr' non-static member function will not be implicitly 'const' in C++1y; add 'const' to avoid a change in behavior
                            
                                OpenCV: Normalizing pixel values of an image
                            
                                Re-generating source code from LLVM parse tree?
                            
                                How to copy grayscale image to RGB images red channel in OpenCV?
                            
                                How to check if object is const or not?
                            
                                Why is a const variable declared as a class member still modifiable?
                            
                                Multiple OR or AND conditions in IF statement
                            
                                Using << operator to write to both a file and cout
                            
                                How can I get the innermost template parameter type?
                            
                                How does a "for each" loop in C++ know the length of an array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With