Given the following tensor (which is the result of a network [note the grad_fn]): <pre class="prettyprint"><code>tensor([121., 241., 125., 1., 108., 238., 125., 121., 13., 117., 121., 229., 161., 13., 0., 202., 161., 121., 121., 0., 121., 121., 242., 125.], grad_fn=<MvBackward>) </code></pre> Which we will define as: <pre class="prettyprint lang-py prettyprint-override"><code>xx = torch.tensor([121., 241., 125., 1., 108., 238., 125., 121., 13., 117., 121., 229., 161., 13., 0., 202., 161., 121., 121., 0., 121., 121., 242., 125.]).requires_grad_(True) </code></pre> I would like to define an operation which counts the number of occurrences of each value in such a way that the operation will output the following tensor: <pre class="prettyprint"><code>tensor([2, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 7, 0, 0, 0, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 1, 1]) </code></pre> i.e. there are 2 zeros, 1 one, 2 thirteens, etc... the total number of possible values is set upstream, but in this example is 243 So far I have tried the following approaches, which successfully produce the desired tensor, but do not do so in a way that allows computing gradients back through the network: <h3>Attempt 1</h3> <pre class="prettyprint lang-py prettyprint-override"><code>tt = [] for i in range(243): tt.append((xx == i).unsqueeze(0)) torch.cat(tt,dim=0).sum(dim=1) </code></pre> <h3>Attempt 2</h3> <pre class="prettyprint lang-py prettyprint-override"><code>tvtensor = torch.tensor([i for i in range(243)]).unsqueeze(1).repeat(1,xx.shape[0]).float().requires_grad_(True) (xx==tvtensor).sum(dim=1) </code></pre> EDIT: Added Attempt <h3>Attempt 3</h3> -- Didn't really expect this to back prop, but figured I would give it a try anyway <pre class="prettyprint lang-py prettyprint-override"><code>ll = torch.zeros((1,243)) for x in xx: ll[0,x.long()] += 1 </code></pre> Any help is appreciated EDIT: As requested the end goal of this is the following: I am using a technique for calculating structural similarity between two time sequences. One is real and the other is generated. The technique is outlined in this paper (https://link.springer.com/chapter/10.1007/978-3-642-02279-1_33) where a time series is converted to a sequence of code words and the distribution of code words (similar to the way that Bag of Words is used in NLP) is used to represent the time series. Two series are considered similar when the two signal distributions are similar. This is what the counting statistics tensor is for. What is desired is to be able to construct a loss function which consumes this tensor and measures the distance between the real and generated signal (euclidian norm on the time domain data directly does not work well and this approach claimed better results), so that it can update the generator appropriately.

I would do it with <code>unique</code> method (only to count occurrences): <img src="https://i.stack.imgur.com/CVB94.png" alt="enter image description here"> if you want to count the occurrences, you have to add the parameter <code>return_counts=True</code> <img src="https://i.stack.imgur.com/4QmiE.png" alt="enter image description here"> I did it in the version 1.3.1 <img src="https://i.stack.imgur.com/bU1lk.png" alt="enter image description here"> This is the fast way to count occurrences, however is a non-differentiable operation, therefore, this method is not recommendable (anyway I have described the way to count ocurrences). To perform what you want, I think you should turn the input into a distribution by means of a differentiable function (softmax is the most used) and then, use some way to measure the distance between distributions (output and target) like cross-entropy, KL (kullback-liebler), JS or wasserstein.

Is there a method in Pytorch to count the number of unique values in a way that can be back propagated?

Tags:

backpropagation

pytorch

counting

Given the following tensor (which is the result of a network [note the grad_fn]):

tensor([121., 241., 125.,   1., 108., 238., 125., 121.,  13., 117., 121., 229.,
        161.,  13.,   0., 202., 161., 121., 121.,   0., 121., 121., 242., 125.],
       grad_fn=<MvBackward>)

Which we will define as:

xx = torch.tensor([121., 241., 125.,   1., 108., 238., 125., 121.,  13., 117., 121., 229.,
        161.,  13.,   0., 202., 161., 121., 121.,   0., 121., 121., 242., 125.]).requires_grad_(True)

I would like to define an operation which counts the number of occurrences of each value in such a way that the operation will output the following tensor:

tensor([2, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0,
        0, 7, 0, 0, 0, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
        0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0,
        0, 1, 1])

i.e. there are 2 zeros, 1 one, 2 thirteens, etc... the total number of possible values is set upstream, but in this example is 243

So far I have tried the following approaches, which successfully produce the desired tensor, but do not do so in a way that allows computing gradients back through the network:

Attempt 1

tt = []
for i in range(243):
    tt.append((xx == i).unsqueeze(0))
torch.cat(tt,dim=0).sum(dim=1)

Attempt 2

tvtensor = torch.tensor([i for i in range(243)]).unsqueeze(1).repeat(1,xx.shape[0]).float().requires_grad_(True)
(xx==tvtensor).sum(dim=1)

EDIT: Added Attempt

Attempt 3

-- Didn't really expect this to back prop, but figured I would give it a try anyway

ll = torch.zeros((1,243))
for x in xx:
    ll[0,x.long()] += 1

Any help is appreciated

EDIT: As requested the end goal of this is the following:

I am using a technique for calculating structural similarity between two time sequences. One is real and the other is generated. The technique is outlined in this paper (https://link.springer.com/chapter/10.1007/978-3-642-02279-1_33) where a time series is converted to a sequence of code words and the distribution of code words (similar to the way that Bag of Words is used in NLP) is used to represent the time series. Two series are considered similar when the two signal distributions are similar. This is what the counting statistics tensor is for.

What is desired is to be able to construct a loss function which consumes this tensor and measures the distance between the real and generated signal (euclidian norm on the time domain data directly does not work well and this approach claimed better results), so that it can update the generator appropriately.

967

asked Oct 28 '19 21:10

jwelch1324

2 Answers

I would do it with unique method (only to count occurrences):

enter image description here

if you want to count the occurrences, you have to add the parameter return_counts=True

enter image description here

I did it in the version 1.3.1

enter image description here

This is the fast way to count occurrences, however is a non-differentiable operation, therefore, this method is not recommendable (anyway I have described the way to count ocurrences). To perform what you want, I think you should turn the input into a distribution by means of a differentiable function (softmax is the most used) and then, use some way to measure the distance between distributions (output and target) like cross-entropy, KL (kullback-liebler), JS or wasserstein.

113

answered Sep 24 '22 00:09

Julio CamPlaz

You will not be able to do that as unique is simply non-differentiable operation.

Furthermore, only floating point tensors can have gradient as it's defined only for real numbers domain, not for integers.

Still, there might be another, differentiable way to do what you want to achieve, but that's a different question.

answered Sep 21 '22 00:09

Szymon Maszke

Related questions
                            
                                Can you accelerate torch DL training on anything other than "cuda" like "hip" or "OpenCL"?
                            
                                RuntimeError: view size is not compatible with input tensor's size and stride (at least one dimension spans across two contiguous subspaces)
                            
                                In Pytorch, is there a difference between (x<0) and x.lt(0)?
                            
                                How to asynchronously load and train batches to train a DeepLearning model?
                            
                                CUDA vs. DataParallel: Why the difference?
                            
                                Pytorch: Trying to apply the transform to a numpy array... fails with an error
                            
                                What is the function in TensorFlow that is equivalent to expand() in PyTorch?
                            
                                Mini batch training for inputs of variable sizes
                            
                                Variable size input for LSTM in Pytorch
                            
                                Quickly find indices that have values larger than a threshold in Numpy/PyTorch
                            
                                PyTorch gradient differs from manually calculated gradient
                            
                                Is it possible to split the training DataLoader (and dataset) into training and validation datasets?
                            
                                TypeError: object of type 'numpy.int64' has no len()
                            
                                How does pytorch's nn.Module register submodule?
                            
                                Indexing a 3d tensor using a 2d tensor
                            
                                How to run matlab .m files in google colab
                            
                                Using automatic differentiation libraries to compute partial derivatives of an arbitrary tensor
                            
                                Prevent GPU usage in SLURM when --gpus is not set
                            
                                Can't import torch in jupyter notebook
                            
                                Cannot convert list to array: ValueError: only one element tensors can be converted to Python scalars

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a method in Pytorch to count the number of unique values in a way that can be back propagated?

Tags:

backpropagation

pytorch

counting

Attempt 1

Attempt 2

Attempt 3

jwelch1324

People also ask

2 Answers

Julio CamPlaz

Szymon Maszke

Recent Activity

Donate For Us