The function <code>torch.nn.functional.softmax</code> takes two parameters: <code>input</code> and <code>dim</code>. According to its documentation, the softmax operation is applied to all slices of <code>input</code> along the specified <code>dim</code>, and will rescale them so that the elements lie in the range <code>(0, 1)</code> and sum to 1. Let input be: <pre class="prettyprint"><code>input = torch.randn((3, 4, 5, 6)) </code></pre> Suppose I want the following, so that every entry in that array is 1: <pre class="prettyprint"><code>sum = torch.sum(input, dim = 3) # sum's size is (3, 4, 5, 1) </code></pre> How should I apply softmax? <pre class="prettyprint"><code>softmax(input, dim = 0) # Way Number 0 softmax(input, dim = 1) # Way Number 1 softmax(input, dim = 2) # Way Number 2 softmax(input, dim = 3) # Way Number 3 </code></pre> My intuition tells me that is the last one, but I am not sure. English is not my first language and the use of the word <code>along</code> seemed confusing to me because of that. I am not very clear on what "along" means, so I will use an example that could clarify things. Suppose we have a tensor of size (s1, s2, s3, s4), and I want this to happen

Steven's answer is not correct. See the snapshot below. It is actually the reverse way. <img src="https://i.stack.imgur.com/UTmK7.png" alt="enter image description here"> Image transcribed as code: <pre class="prettyprint"><code>>>> x = torch.tensor([[1,2],[3,4]],dtype=torch.float) >>> F.softmax(x,dim=0) tensor([[0.1192, 0.1192], [0.8808, 0.8808]]) >>> F.softmax(x,dim=1) tensor([[0.2689, 0.7311], [0.2689, 0.7311]]) </code></pre>

Pytorch softmax: What dimension to use?

Tags:

python

pytorch

The function torch.nn.functional.softmax takes two parameters: input and dim. According to its documentation, the softmax operation is applied to all slices of input along the specified dim, and will rescale them so that the elements lie in the range (0, 1) and sum to 1.

Let input be:

input = torch.randn((3, 4, 5, 6))

Suppose I want the following, so that every entry in that array is 1:

sum = torch.sum(input, dim = 3) # sum's size is (3, 4, 5, 1)

How should I apply softmax?

softmax(input, dim = 0) # Way Number 0 softmax(input, dim = 1) # Way Number 1 softmax(input, dim = 2) # Way Number 2 softmax(input, dim = 3) # Way Number 3

My intuition tells me that is the last one, but I am not sure. English is not my first language and the use of the word along seemed confusing to me because of that.

I am not very clear on what "along" means, so I will use an example that could clarify things. Suppose we have a tensor of size (s1, s2, s3, s4), and I want this to happen

939

asked Feb 28 '18 19:02

Jadiel de Armas

1 Answers

Steven's answer is not correct. See the snapshot below. It is actually the reverse way.

enter image description here

Image transcribed as code:

>>> x = torch.tensor([[1,2],[3,4]],dtype=torch.float) >>> F.softmax(x,dim=0) tensor([[0.1192, 0.1192],         [0.8808, 0.8808]]) >>> F.softmax(x,dim=1) tensor([[0.2689, 0.7311],         [0.2689, 0.7311]])

196

answered Oct 13 '22 17:10

sww

Related questions
                            
                                Overlapping y-axis tick label and x-axis tick label in matplotlib
                            
                                urllib.quote() throws KeyError
                            
                                How to programmatically create a topic in Apache Kafka using Python
                            
                                Extending setuptools extension to use CMake in setup.py?
                            
                                How can I draw a log-normalized imshow plot with a colorbar representing the raw data in matplotlib
                            
                                How to check the size of a float in python?
                            
                                ImportError: No module named OpenGL.GL
                            
                                What is the purpose of using nginx with gunicorn? [duplicate]
                            
                                Fill form values in a web page via a Python script (not testing)
                            
                                Why does Python have an __ne__ operator method instead of just __eq__?
                            
                                Read .csv file from URL into Python 3.x - _csv.Error: iterator should return strings, not bytes (did you open the file in text mode?)
                            
                                Adding a new column in Data Frame derived from other columns (Spark)
                            
                                Python test to check instance type
                            
                                How to create an OrderedDict in Python?
                            
                                Using f-string with format depending on a condition
                            
                                How does Dropbox use Python on Windows and OS X?
                            
                                Python - Facebook API - Need a working example
                            
                                In Python what is a global statement?
                            
                                How to pickle and unpickle to portable string in Python 3
                            
                                How can I install multiple versions of Python on latest OS X and use them in parallel?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With