Keras embedding layers: how do they work?

Tags:

I am starting using Keras to build neural networks models.

I have a classification problem, where the features are discrete. To manage this case, the standard procedure consists in converting the discrete features in binary arrays, with a one-hot encoding.

However it seems that with Keras this step is not necessary, as one can simply use an Embedding layer to create a feature-vector representation of these discrete features.

How these embeddings are performed?

My understanding is that, if the discrete feature f can assume k values, then an embedding layer creates a matrix with k columns. Every time I receive a value for that feature, say i, during the training phase, only the i column of the matrix will be updated.

Is my understanding correct?

761

asked Mar 13 '17 11:03

Ulderique Demoitre

2 Answers

Suppose you have N objects that do not directly have a mathematical representation. For example words.

As neural networks are only able to work with tensors you should look for some way to translate those objects to tensors. The solution is in a giant matrix (embedding matrix) where it relates each index of an object with its translation to tensor.

object_index_1: vector_1
object_index_1: vector_2
...
object_index_n: vector_n

Selecting the vector of a specific object can be translated to a matrix product in the following way:

enter image description here

Where v is the one-hot vector that determines which word need to be translated. And M is the embedding matrix.

If we propose the usual pipeline, it would be the following:

We have a list of objects.

objects = ['cat', 'dog', 'snake', 'dog', 'mouse', 'cat', 'dog', 'snake', 'dog']

We transform these objects into indices (we calculate the unique objects).

unique = ['cat', 'dog', 'snake', 'mouse'] # list(set(objects))
objects_index = [0, 1, 2, 1, 3, 0, 1, 2, 1] #map(unique.index, objects)

We transform these indices to a one hot vector (remember that there is only one where the index is)

objects_one_hot = [[1, 0, 0, 0], [0, 1, 0, 0], [0, 0, 1, 0], [0, 1, 0, 0], 
     [0, 0 , 0, 1], [1, 0, 0, 0], [0, 1, 0, 0], [0, 0, 1, 0], [0, 1, 0, 0]] # map(lambda x: [int(i==x) for i in range(len(unique))], objects_index)
#objects_one_hot is matrix is 4x9

We create or use the embedding matrix:

#M = matrix of dim x 4 (where dim is the number of dimensions you want the vectors to have). 
#In this case dim=2
M = np.array([[1, 1], [1, 2], [2, 2], [3,3]]).T # or... np.random.rand(2, 4)
#objects_vectors = M * objects_one_hot
objects_vectors = [[1, 1], [1, 2], [2, 2], [1, 2], 
    [3, 3], [1, 1], [1, 2], [2,2], [1, 2]] # M.dot(np.array(objects_one_hot).T)

Normally the embedding matrix is learned during the same model learning, to adapt the best vectors for each object. We already have the mathematical representation of the objects!

As you have seen we have used one hot and later a matrix product. What you really do is take the column of M that represents that word.

During the learning this M will be adapted to improve the representation of the object and as a consequence the loss goes down.

158

answered Sep 28 '22 04:09

Adria Ciurana

The Embedding layer in Keras (also in general) is a way to create dense word encoding. You should think of it as a matrix multiply by One-hot-encoding (OHE) matrix, or simply as a linear layer over OHE matrix.

It is used always as a layer attached directly to the input.

Sparse and dense word encoding denote the encoding effectiveness.

One-hot-encoding (OHE) model is sparse word encoding model. For example if we have 1000 input activations, there will be 1000 OHE vectors for each input feature.

Let's say we know some input activations are dependent, and we have 64 latent features. We would have this embedding:

e = Embedding(1000, 64, input_length=50)

1000 tells we plan to encode 1000 words in total. 64 tells we use 64 dimensional vector space. 50 tells input documents have 50 words each.

Embedding layers will fill up randomly with non-zero values and the parameters need to be learned.

There are other parameters when creating the Embedding layer in here

What is the output from the Embedding layer?

The output of the Embedding layer is a 2D-vector with one embedding for each word in the input sequence of words (input document).

NOTE: If you wish to connect a Dense layer directly to an Embedding layer, you must first flatten the 2D output matrix to a 1D vector using the Flatten layer.

answered Sep 28 '22 04:09

prosti

Related questions
                            
                                Django ForeignKey limit_choices_to a different ForeignKey id
                            
                                Check whether a list starts with the elements of another list
                            
                                pandas: Combining Multiple Categories into One
                            
                                is it possible to plot timelines with matplotlib?
                            
                                Wait until element is not present
                            
                                Debugging robot framework python keyword libraries
                            
                                Fetch connected nodes in a NetworkX graph
                            
                                Date Time split in python
                            
                                How should I interpret "size" parameter in Doc2Vec function of gensim?
                            
                                Django datetime field - convert to timezone in view
                            
                                python abstract property setter with concrete getter
                            
                                Asynchronous property setter
                            
                                copying one file's contents to another in python
                            
                                Pandas - unstack column values into new columns
                            
                                %USERPROFILE% env variable for python
                            
                                How can I print a euro (€) symbol in Python?
                            
                                Install tensorflow on Ubuntu 14.04
                            
                                How to set up a Selenium Python environment for Firefox
                            
                                Using alembic.config.main redirects log output
                            
                                How to convert bitarray to an integer in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras embedding layers: how do they work?

Tags:

python

machine-learning

neural-network

keras

embedding

Ulderique Demoitre

People also ask

2 Answers

Adria Ciurana

prosti

Recent Activity

Donate For Us