In the official PyTorch C++ examples on GitHub Here you can witness a strange definition of a class: <pre class="prettyprint"><code>class CustomDataset : public torch::data::datasets::Dataset<CustomDataset> {...} </code></pre> My understanding is that this defines a class <code>CustomDataset</code> which "inherits from" or "extends" <code>torch::data::datasets::Dataset<CustomDataset></code>. This is weird to me since the class we're creating is inheriting from another class which is parameterized by the class we're creating...How does this even work? What does it mean? This seems to me like an <code>Integer</code> class inheriting from <code>vector<Integer></code>, which seems absurd.

This is the curiously-recurring template pattern, or CRTP for short. A major advantage of this technique is that it enabled so-called static polymorphism, meaning that functions in <code>torch::data::datasets::Dataset</code> can call into functions of <code>CustomDataset</code>, without needing to make those functions virtual (and thus deal with the runtime mess of virtual method dispatch and so on). You can also perform compile-time metaprogramming such as compile-time <code>enable_if</code>s depending on the properties of the custom dataset type. In the case of PyTorch, <code>BaseDataset</code> (the superclass of <code>Dataset</code>) uses this technique heavily to support operations such as mapping and filtering: <blockquote> <pre class="prettyprint"><code> template <typename TransformType> MapDataset<Self, TransformType> map(TransformType transform) & { return datasets::map(static_cast<Self&>(*this), std::move(transform)); } </code></pre> </blockquote> Note the static cast of <code>this</code> to the derived type (legal as long as CRTP is properly applied); <code>datasets::map</code> constructs a <code>MapDataset</code> object which is also parametrized by the dataset type, allowing the <code>MapDataset</code> implementation to statically call methods such as <code>get_batch</code> (or encounter a compile-time error if they do not exist). Furthermore, since <code>MapDataset</code> receives the custom dataset type as a type parameter, compile-time metaprogramming is possible: <blockquote> <pre class="prettyprint"><code> /// The implementation of `get_batch()` for the stateless case, which simply /// applies the transform to the output of `get_batch()` from the dataset. template < typename D = SourceDataset, typename = torch::disable_if_t<D::is_stateful>> OutputBatchType get_batch_impl(BatchRequestType indices) { return transform_.apply_batch(dataset_.get_batch(std::move(indices))); } /// The implementation of `get_batch()` for the stateful case. Here, we follow /// the semantics of `Optional.map()` in many functional languages, which /// applies a transformation to the optional's content when the optional /// contains a value, and returns a new optional (of a different type) if the /// original optional returned by `get_batch()` was empty. template <typename D = SourceDataset> torch::enable_if_t<D::is_stateful, OutputBatchType> get_batch_impl( BatchRequestType indices) { if (auto batch = dataset_.get_batch(std::move(indices))) { return transform_.apply_batch(std::move(*batch)); } return nullopt; } </code></pre> </blockquote> Notice that the conditional enable is dependent on <code>SourceDataset</code>, which we only have available because the dataset is parametrized with this CRTP pattern.

libtorch (PyTorch C++) weird class syntax

Tags:

c++

pytorch

libtorch

In the official PyTorch C++ examples on GitHub Here you can witness a strange definition of a class:

Click to copy

class CustomDataset : public torch::data::datasets::Dataset<CustomDataset> {...}

My understanding is that this defines a class CustomDataset which "inherits from" or "extends" torch::data::datasets::Dataset<CustomDataset>. This is weird to me since the class we're creating is inheriting from another class which is parameterized by the class we're creating...How does this even work? What does it mean? This seems to me like an Integer class inheriting from vector<Integer>, which seems absurd.

279

asked Apr 20 '20 03:04

JacKeown

1 Answers

This is the curiously-recurring template pattern, or CRTP for short. A major advantage of this technique is that it enabled so-called static polymorphism, meaning that functions in torch::data::datasets::Dataset can call into functions of CustomDataset, without needing to make those functions virtual (and thus deal with the runtime mess of virtual method dispatch and so on). You can also perform compile-time metaprogramming such as compile-time enable_ifs depending on the properties of the custom dataset type.

In the case of PyTorch, BaseDataset (the superclass of Dataset) uses this technique heavily to support operations such as mapping and filtering:

Click to copy

  template <typename TransformType>
  MapDataset<Self, TransformType> map(TransformType transform) & {
    return datasets::map(static_cast<Self&>(*this), std::move(transform));
  }

Note the static cast of this to the derived type (legal as long as CRTP is properly applied); datasets::map constructs a MapDataset object which is also parametrized by the dataset type, allowing the MapDataset implementation to statically call methods such as get_batch (or encounter a compile-time error if they do not exist).

Furthermore, since MapDataset receives the custom dataset type as a type parameter, compile-time metaprogramming is possible:

Click to copy

  /// The implementation of `get_batch()` for the stateless case, which simply
  /// applies the transform to the output of `get_batch()` from the dataset.
  template <
      typename D = SourceDataset,
      typename = torch::disable_if_t<D::is_stateful>>
  OutputBatchType get_batch_impl(BatchRequestType indices) {
    return transform_.apply_batch(dataset_.get_batch(std::move(indices)));
  }

  /// The implementation of `get_batch()` for the stateful case. Here, we follow
  /// the semantics of `Optional.map()` in many functional languages, which
  /// applies a transformation to the optional's content when the optional
  /// contains a value, and returns a new optional (of a different type)  if the
  /// original optional returned by `get_batch()` was empty.
  template <typename D = SourceDataset>
  torch::enable_if_t<D::is_stateful, OutputBatchType> get_batch_impl(
      BatchRequestType indices) {
    if (auto batch = dataset_.get_batch(std::move(indices))) {
      return transform_.apply_batch(std::move(*batch));
    }
    return nullopt;
  }

Notice that the conditional enable is dependent on SourceDataset, which we only have available because the dataset is parametrized with this CRTP pattern.

answered Sep 30 '22 10:09

nanofarad

Related questions
                            
                                Polymorphism with different parameters
                            
                                Value of type "const char *" cannot be assigned to an entity of type "LPSTR"
                            
                                Modern C++ way to repeat code for set number of times
                            
                                Lambda capturing rvalue reference by reference
                            
                                lambdas in unevaluated contexts (Until C++20)
                            
                                C++ Stop Preprocessor Macro Expansion
                            
                                Is it possible to store a reference in a std::any?
                            
                                Segfault when not specifying return type of lambda function
                            
                                Integer sequences implementation C++
                            
                                If make_shared/make_unique can throw bad_alloc, why is it not a common practice to have a try catch block for it?
                            
                                Cmake Cannot specify include directories when use target target_include_directories
                            
                                Do I need to use CRYPTO locking functions for thread safety in OpenSSL 1.1.0+?
                            
                                Triangularizing a tuple
                            
                                TBB: Possible to get Thread IDs?
                            
                                Passing vector with std::move function signature
                            
                                How to remove a std::function<void()> in vector?
                            
                                Floating point math accuracy, c++ vs fortran [duplicate]
                            
                                Are `inline` and `noexcept` redundant in a consteval context?
                            
                                The rand () function generates numbers in the wrong range
                            
                                How to get the OpenCV image from Python and use it in C++ in pybind11?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

libtorch (PyTorch C++) weird class syntax

Tags:

c++

pytorch

libtorch

JacKeown

People also ask

1 Answers

nanofarad

Recent Activity

Donate For Us