Keras- Embedding layer

Question

What does input_dim, output_dim and input_length mean in:

Embedding(input_dim, output_dim, input_length)

From the documentation I understand:

input_dim: int > 0. Size of the vocabulary
output_dim: int >= 0. Dimension of the dense embedding.
input_length: Length of input sequences

So, when my input is a word like google.com each character represented by an integer [5, 2, 2, 5, 8, 3, 4, 1, 2, 9] and maximum word length possible is 75. Maximum characters possible is 38. How should I decide the input_dim, output_dim and input_length?

Sam - Founder of AceAINow.com · Accepted Answer

It order to use words for natural language processing or machine learning tasks, it is necessary to first map them onto a continuous vector space, thus creating word vectors or word embeddings. The Keras Embedding layer is useful for constructing such word vectors.

input_dim : the vocabulary size. This is how many unique words are represented in your corpus.

output_dim : the desired dimension of the word vector. For example, if output_dim = 100, then every word will be mapped onto a vector with 100 elements, whereas if output_dim = 300, then every word will be mapped onto a vector with 300 elements.

input_length : the length of your sequences. For example, if your data consists of sentences, then this variable represents how many words there are in a sentence. As disparate sentences typically contain different number of words, it is usually required to pad your sequences such that all sentences are of equal length. The keras.preprocessing.pad_sequence method can be used for this (https://keras.io/preprocessing/sequence/).

In Keras, it is possible to either 1) use pretrained word vectors such as GloVe or word2vec representations, or 2) learn the word vectors as part of the training process. This blog post (https://blog.keras.io/using-pre-trained-word-embeddings-in-a-keras-model.html) offers a tutorial on how to use GloVe pretrained word vectors. For option 2, Keras will randomly initialize vectors as the default option, and then learn optimal word vectors during the training process.

Keras- Embedding layer

Tags:

python

deep-learning

keras

AKSHAYAA VAIDYANATHAN

1 Answers

Sam - Founder of AceAINow.com

Recent Activity

Donate For Us

Keras- Embedding layer

Tags:

python

deep-learning

keras

AKSHAYAA VAIDYANATHAN

1 Answers

Sam - Founder of AceAINow.com

Related questions

Recent Activity

Donate For Us