How to use additional features along with word embeddings in Keras ?

Tags:

I am training a LSTM model with Keras on the dataset which looks like following. The variable "Description" is a text field and "Age" and "Gender" are categorical and continuous fields.

Age, Gender, Description
22, M, "purchased a phone"
35, F, "shopping for kids"

I am using word-embedding to convert the text fields to word vectors and then input it in the keras model. The code is given below:

model = Sequential()
model.add(Embedding(word_index, 300, weights=[embedding_matrix], input_length=70, trainable=False))

model.add(LSTM(300, dropout=0.3, recurrent_dropout=0.3))
model.add(Dropout(0.6))
model.add(Dense(1))
model.add(Activation('sigmoid'))
model.compile(loss='binary_crossentropy', optimizer='adam', metrics['accuracy'])

This model is running successfully but I want to input "age" and "gender" variables as features as well. What changes are required in the code to use these features as well ?

522

asked Mar 08 '18 14:03

userxxx

2 Answers

You want to add more input layers which is not possible with Sequential Model, you have to go for functional model

from keras.models import Model

which allows you to have multiple inputs and indirect connections.

embed = Embedding(word_index, 300, weights=[embedding_matrix], input_length=70, trainable=False)
lstm = LSTM(300, dropout=0.3, recurrent_dropout=0.3)(embed)
agei = Input(shape=(1,))
conc = Concatenate()(lstm, agei)
drop = Dropout(0.6)(conc)
dens = Dense(1)(drop)
acti = Activation('sigmoid')(dens)

model = Model([embed, agei], acti)
model.compile(loss='binary_crossentropy', optimizer='adam', metrics['accuracy'])

You cannot concatenate before LSTM layer as it doesn't make sense and also you will have 3D Tensor after embedding layer and input is a 2D Tensor.

answered Sep 17 '22 19:09

Suba Selvandran

I wrote about how to do this in keras. It's basically a functional multiple input model, which concatenates both feature vectors like this:

nlp_input = Input(shape=(seq_length,), name='nlp_input')
meta_input = Input(shape=(10,), name='meta_input')
emb = Embedding(output_dim=embedding_size, input_dim=100, input_length=seq_length)(nlp_input)
nlp_out = Bidirectional(LSTM(128))(emb)
x = concatenate([nlp_out, meta_input])
x = Dense(classifier_neurons, activation='relu')(x)
x = Dense(1, activation='sigmoid')(x)
model = Model(inputs=[nlp_input , meta_input], outputs=[x])

answered Sep 20 '22 19:09

ixeption

Related questions
                            
                                iPython - set up magic commands in configuration file
                            
                                How to change the number of axis ticks in seaborn plots
                            
                                numpy.core.multiarray failed to import
                            
                                Time Series Analysis - unevenly spaced measures - pandas + statsmodels
                            
                                When bulding a CNN, I am getting complaints from Keras that do not make sense to me.
                            
                                pandas read_csv column dtype is set to decimal but converts to string
                            
                                Split nested array values from Pandas Dataframe cell over multiple rows
                            
                                Pandas: get multiindex level as series
                            
                                Using tf.unpack() when first dimension of Variable is None
                            
                                Exclude unwanted tag on Beautifulsoup Python
                            
                                How to use paho mqtt client in django?
                            
                                What does `layer.get_weights()` return?
                            
                                Flier colors in boxplot with matplotlib
                            
                                python pandas sum by hour of day
                            
                                Copying MultiIndex dataframes with pd.read_clipboard?
                            
                                Django custom for complex Func (sql function)
                            
                                How to merge/combine columns in pandas?
                            
                                Create a pivot table that lists out values
                            
                                Install Pyicu in python 3.x
                            
                                How to dynamically add EC2 ip addresses to Django ALLOWED_HOSTS

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use additional features along with word embeddings in Keras ?

Tags:

python

machine-learning

tensorflow

keras

lstm

userxxx

People also ask

2 Answers

Suba Selvandran

ixeption

Recent Activity

Donate For Us