I am trying to learn how to build RNN for Speech Recognition using TensorFlow. As a start, I wanted to try out some example models put up on TensorFlow page TF-RNN As per what was advised, I had taken some time to understand how word IDs are embedded into a dense representation (Vector Representation) by working through the basic version of word2vec model code. I had an understanding of what <code>tf.nn.embedding_lookup</code> actually does, until I actually encountered the same function being used with two dimensional array in TF-RNN <code>ptb_word_lm.py</code>, when it did not make sense any more. <h3>what I though <code>tf.nn.embedding_lookup</code> does:</h3> Given a 2-d array <code>params</code>, and a 1-d array <code>ids</code>, function <code>tf.nn.embedding_lookup</code> fetches rows from params, corresponding to the indices given in <code>ids</code>, which holds with the dimension of output it is returning. <h3>What I am confused about:</h3> When tried with same params, and 2-d array <code>ids</code>, <code>tf.nn.embedding_lookup</code> returns 3-d array, instead of 2-d which I do not understand why. I looked up the manual for Embedding Lookup, but I still find it difficult to understand how the partitioning works, and the result that is returned. I recently tried some simple example with <code>tf.nn.embedding_lookup</code> and it appears that it returns different values each time. Is this behaviour due to the randomness involved in partitioning ? Please help me understand how <code>tf.nn.embedding_lookup</code> works, and why is used in both <code>word2vec_basic.py</code>, and <code>ptb_word_lm.py</code> i.e., what is the purpose of even using them ?

There is already an answer on what does <code>tf.nn.embedding_lookup</code> here. <hr> <blockquote> When tried with same params, and 2-d array ids, tf.nn.embedding_lookup returns 3-d array, instead of 2-d which I do not understand why. </blockquote> When you had a 1-D list of ids <code>[0, 1]</code>, the function would return a list of embeddings <code>[embedding_0, embedding_1]</code> where <code>embedding_0</code> is an array of shape <code>embedding_size</code>. For instance the list of ids could be a batch of words. Now, you have a matrix of ids, or a list of list of ids. For instance, you now have a batch of sentences, i.e. a batch of list of words, i.e. a list of list of words. If your list of sentences is: <code>[[0, 1], [0, 3]]</code> (sentence 1 is <code>[0, 1]</code>, sentence 2 is <code>[0, 3]</code>), the function will compute a matrix of embeddings, which will be of shape <code>[2, 2, embedding_size]</code>and will look like: <pre class="prettyprint"><code>[[embedding_0, embedding_1], [embedding_0, embedding_3]] </code></pre> <hr> Concerning the <code>partition_strategy</code> argument, you don't have to bother about it. Basically, it allows you to give a list of embedding matrices as <code>params</code> instead of 1 matrix, if you have limitations in computation. So, you could split your embedding matrix of shape <code>[1000, embedding_size]</code> in ten matrices of shape <code>[100, embedding_size]</code> and pass this list of Variables as <code>params</code>. The argument <code>partition_strategy</code> handles the distribution of the vocabulary (the 1000 words) among the 10 matrices.

TensorFlow Embedding Lookup

Tags:

tensorflow

recurrent-neural-network

word2vec

language-model

I am trying to learn how to build RNN for Speech Recognition using TensorFlow. As a start, I wanted to try out some example models put up on TensorFlow page TF-RNN

As per what was advised, I had taken some time to understand how word IDs are embedded into a dense representation (Vector Representation) by working through the basic version of word2vec model code. I had an understanding of what tf.nn.embedding_lookup actually does, until I actually encountered the same function being used with two dimensional array in TF-RNN ptb_word_lm.py, when it did not make sense any more.

what I though `tf.nn.embedding_lookup` does:

Given a 2-d array params, and a 1-d array ids, function tf.nn.embedding_lookup fetches rows from params, corresponding to the indices given in ids, which holds with the dimension of output it is returning.

What I am confused about:

When tried with same params, and 2-d array ids, tf.nn.embedding_lookup returns 3-d array, instead of 2-d which I do not understand why.

I looked up the manual for Embedding Lookup, but I still find it difficult to understand how the partitioning works, and the result that is returned. I recently tried some simple example with tf.nn.embedding_lookup and it appears that it returns different values each time. Is this behaviour due to the randomness involved in partitioning ?

Please help me understand how tf.nn.embedding_lookup works, and why is used in both word2vec_basic.py, and ptb_word_lm.py i.e., what is the purpose of even using them ?

490

asked Jun 18 '16 14:06

VM_AI

1 Answers

There is already an answer on what does tf.nn.embedding_lookup here.

When tried with same params, and 2-d array ids, tf.nn.embedding_lookup returns 3-d array, instead of 2-d which I do not understand why.

When you had a 1-D list of ids [0, 1], the function would return a list of embeddings [embedding_0, embedding_1] where embedding_0 is an array of shape embedding_size. For instance the list of ids could be a batch of words.

Now, you have a matrix of ids, or a list of list of ids. For instance, you now have a batch of sentences, i.e. a batch of list of words, i.e. a list of list of words.

If your list of sentences is: [[0, 1], [0, 3]] (sentence 1 is [0, 1], sentence 2 is [0, 3]), the function will compute a matrix of embeddings, which will be of shape [2, 2, embedding_size]and will look like:

Click to copy

[[embedding_0, embedding_1],
 [embedding_0, embedding_3]]

Concerning the partition_strategy argument, you don't have to bother about it. Basically, it allows you to give a list of embedding matrices as params instead of 1 matrix, if you have limitations in computation.

So, you could split your embedding matrix of shape [1000, embedding_size] in ten matrices of shape [100, embedding_size] and pass this list of Variables as params. The argument partition_strategy handles the distribution of the vocabulary (the 1000 words) among the 10 matrices.

answered Oct 09 '22 03:10

Olivier Moindrot

Related questions
                            
                                tensorflow placeholder - understanding `shape=[None,`
                            
                                How to freeze weights in certain layer with Keras?
                            
                                Function call stack: keras_scratch_graph Error
                            
                                Tensorflow dense_to_sparse [duplicate]
                            
                                Restore variables that are a subset of new model in Tensorflow?
                            
                                Keras class_weight in multi-label binary classification
                            
                                Re-train a frozen *.pb model in TensorFlow
                            
                                Keras initialize large embeddings layer with pretrained embeddings
                            
                                How can I add an optional input to a graph in TensorFlow?
                            
                                What is the difference between tf.group and tf.control_dependencies?
                            
                                What is a dynamic RNN in TensorFlow?
                            
                                TensorFlow - tf.layers vs tf.contrib.layers
                            
                                Python Keras LSTM learning converges too fast on high loss
                            
                                K.gradients(loss, input_img)[0] return "None". (Keras CNN visualization with tensorflow backend)
                            
                                How to create mask images from COCO dataset?
                            
                                Tensorflow InvalidArgumentError (indices) while training with Keras
                            
                                how to fix "There is at least 1 reference to internal data in the interpreter in the form of a numpy array or slice" and run inference on tf.lite
                            
                                Tensorflow error "shape Tensorshape() must have rank 1"
                            
                                tensorflow cifar10_eval.py error:RuntimeError: Attempted to use a closed Session.RuntimeError: Attempted to use a closed Session
                            
                                how to run tensorflow distributed mnist example

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TensorFlow Embedding Lookup

Tags:

tensorflow

recurrent-neural-network

word2vec

language-model

what I though `tf.nn.embedding_lookup` does:

What I am confused about:

VM_AI

People also ask

1 Answers

Olivier Moindrot

Recent Activity

Donate For Us

TensorFlow Embedding Lookup

Tags:

tensorflow

recurrent-neural-network

word2vec

language-model

what I though tf.nn.embedding_lookup does:

What I am confused about:

VM_AI

People also ask

1 Answers

Olivier Moindrot

Related questions

Recent Activity

Donate For Us

what I though `tf.nn.embedding_lookup` does: