I have been through the official doc and this but it is hard to understand what is going on. I am trying to understand a DQN source code and it uses the gather function on line 197. Could someone explain in simple terms what the gather function does? What is the purpose of that function?

<code>torch.gather</code> creates a new tensor from the input tensor by taking the values from each row along the input dimension <code>dim</code>. The values in <code>torch.LongTensor</code>, passed as <code>index</code>, specify which value to take from each 'row'. The dimension of the output tensor is same as the dimension of index tensor. Following illustration from the official docs explains it more clearly: <img src="https://i.stack.imgur.com/nudGq.png" alt="Pictoral representation from the docs"> (Note: In the illustration, indexing starts from 1 and not 0). In first example, the dimension given is along rows (top to bottom), so for (1,1) position of <code>result</code>, it takes row value from the <code>index</code> for the <code>src</code> that is <code>1</code>. At (1,1) in source value is <code>1</code> so, outputs <code>1</code> at (1,1) in <code>result</code>. Similarly for (2,2) the row value from the index for <code>src</code> is <code>3</code>. At (3,2) the value in <code>src</code> is <code>8</code> and hence outputs <code>8</code> and so on. Similarly for second example, indexing is along columns, and hence at (2,2) position of the <code>result</code>, the column value from the index for <code>src</code> is <code>3</code>, so at (2,3) from <code>src</code> ,<code>6</code> is taken and outputs to <code>result</code> at (2,2)

What does the gather function do in pytorch in layman terms?

1 Answers

torch.gather creates a new tensor from the input tensor by taking the values from each row along the input dimension dim. The values in torch.LongTensor, passed as index, specify which value to take from each 'row'. The dimension of the output tensor is same as the dimension of index tensor. Following illustration from the official docs explains it more clearly: Pictoral representation from the docs

(Note: In the illustration, indexing starts from 1 and not 0).

In first example, the dimension given is along rows (top to bottom), so for (1,1) position of result, it takes row value from the index for the src that is 1. At (1,1) in source value is 1 so, outputs 1 at (1,1) in result. Similarly for (2,2) the row value from the index for src is 3. At (3,2) the value in src is 8 and hence outputs 8 and so on.

Similarly for second example, indexing is along columns, and hence at (2,2) position of the result, the column value from the index for src is 3, so at (2,3) from src ,6 is taken and outputs to result at (2,2)

136

answered Sep 28 '22 00:09

Ritesh

Related questions
                            
                                Pytorch reshape tensor dimension
                            
                                How to do gradient clipping in pytorch?
                            
                                PyTorch: How to get the shape of a Tensor as a list of int
                            
                                PyTorch: How to use DataLoaders for custom Datasets
                            
                                Convert Pandas dataframe to PyTorch tensor?
                            
                                Convert PyTorch tensor to python list
                            
                                PyTorch: How to change the learning rate of an optimizer at any given moment (no LR schedule)
                            
                                How to load a list of numpy arrays to pytorch dataset loader?
                            
                                What does the parameter retain_graph mean in the Variable's backward() method?
                            
                                Data Augmentation in PyTorch
                            
                                Adding L1/L2 regularization in PyTorch?
                            
                                Understanding torch.nn.Parameter
                            
                                How do I split a custom dataset into training and test datasets?
                            
                                What's the difference between torch.stack() and torch.cat() functions?
                            
                                What does "unsqueeze" do in Pytorch?
                            
                                How to do product of matrices in PyTorch
                            
                                What's the difference between "hidden" and "output" in PyTorch LSTM?
                            
                                pytorch - connection between loss.backward() and optimizer.step()
                            
                                Pytorch tensor to numpy array
                            
                                RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What does the gather function do in pytorch in layman terms?

Tags:

pytorch

amitection

People also ask

1 Answers

Ritesh

Recent Activity

Donate For Us