I have a byte tensor of integer class labels, e.g. from the MNIST data set. <pre class="prettyprint"><code> 1 7 5 [torch.ByteTensor of size 3] </code></pre> How do use it to create a tensor of 1-hot vectors? <pre class="prettyprint"><code> 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 [torch.DoubleTensor of size 3x10] </code></pre> I know I could do this with a loop, but I'm wondering if there's any clever Torch indexing that will get it for me in a single line.

An alternate method is to shuffle rows from an identity matrix: <pre class="prettyprint"><code>indicies = torch.LongTensor{1,7,5} one_hot = torch.eye(10):index(1, indicies) </code></pre> This was not my idea, I found it in karpathy/char-rnn.

<pre class="prettyprint"><code>indices = torch.LongTensor{1,7,5}:view(-1,1) one_hot = torch.zeros(3, 10) one_hot:scatter(2, indices, 1) </code></pre> You can find the documentation for <code>scatter</code> in the torch/torch7 github readme (in the master branch).

In Torch how do I create a 1-hot tensor from a list of integer labels?

Tags:

indexing

one-hot-encoding

torch

I have a byte tensor of integer class labels, e.g. from the MNIST data set.

 1
 7
 5
[torch.ByteTensor of size 3]

How do use it to create a tensor of 1-hot vectors?

 1  0  0  0  0  0  0  0  0  0
 0  0  0  0  0  0  1  0  0  0
 0  0  0  0  1  0  0  0  0  0
[torch.DoubleTensor of size 3x10]

I know I could do this with a loop, but I'm wondering if there's any clever Torch indexing that will get it for me in a single line.

694

asked Aug 14 '15 15:08

W.P. McNeill

2 Answers

An alternate method is to shuffle rows from an identity matrix:

indicies = torch.LongTensor{1,7,5}
one_hot = torch.eye(10):index(1, indicies)

This was not my idea, I found it in karpathy/char-rnn.

answered Sep 23 '22 14:09

Tarquinnn

indices = torch.LongTensor{1,7,5}:view(-1,1)
one_hot = torch.zeros(3, 10)
one_hot:scatter(2, indices, 1)

You can find the documentation for scatter in the torch/torch7 github readme (in the master branch).

answered Sep 20 '22 14:09

smhx

Related questions
                            
                                How to avoid Deadlock between Insert/Delete statements due to non clustered non unique indexes!
                            
                                SQL Index on Multiple tables, can it be done?
                            
                                Declaring indexes together or separately, what is the difference?
                            
                                NumPy 2D array: selecting indices in a circle
                            
                                Search of Dictionary Keys python
                            
                                Any easy way to tell if mongodb indexes are still being used or not?
                            
                                How can I access MySQL InnoDB index values directly without the MySQL client?
                            
                                Why are all indexes in Rust of type usize?
                            
                                MySQL Query does not use index in table join
                            
                                Role of selectivity in index scan/seek
                            
                                Why is MySQL query using join buffer?
                            
                                fast way to get index of top-k elements of every column in a pandas dataframe
                            
                                Pandas groupby and filter
                            
                                Index not used when LIMIT is used in postgres
                            
                                Index already exists with different options error while using createIndex() in latest MongoDB java driver
                            
                                How to Create Unique Index for Existing table in MySQL which contains Records
                            
                                How to index a string array column for pg_trgm `'term' % ANY (array_column)` query?
                            
                                Non-Clustered Index on a Clustered Index column improves performance?
                            
                                Does creating a nonclustered index on a SQL Server 2005 table prevent selects?
                            
                                Unable to add files with name containing tilde, '~' followed by a number

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With