How to code a sequence to sequence RNN in keras?

Tags:

I am trying to write a sequence to sequence RNN in keras. I coded this program using what I understood from the web. I first tokenized the text then converted the text into sequence and padded to form feature variable X. The target variable Y was obtained first shifting x to left and then padding it. Lastly I fed my feature and target variable to my LSTM model.

This is my code I written in keras for that purpose.

Click to copy

from keras.preprocessing.text import Tokenizer,base_filter
from keras.preprocessing.sequence import pad_sequences
from keras.models import Sequential
from keras.layers import Dense, Activation,Dropout,Embedding
from keras.layers import LSTM


def shift(seq, n):
    n = n % len(seq)
    return seq[n:] + seq[:n]

txt="abcdefghijklmn"*100

tk = Tokenizer(nb_words=2000, filters=base_filter(), lower=True, split=" ")
tk.fit_on_texts(txt)
x = tk.texts_to_sequences(txt)
#shifing to left
y = shift(x,1)

#padding sequence
max_len = 100
max_features=len(tk.word_counts)
X = pad_sequences(x, maxlen=max_len)
Y = pad_sequences(y, maxlen=max_len)

#lstm model
model = Sequential()
model.add(Embedding(max_features, 128, input_length=max_len, dropout=0.2))
model.add(LSTM(128, dropout_W=0.2, dropout_U=0.2))
model.add(Dense(max_len))
model.add(Activation('softmax'))
model.compile(loss='binary_crossentropy', optimizer='rmsprop')

model.fit(X, Y, batch_size=200, nb_epoch=10)

The problem is its showing an error

Click to copy

Epoch 1/10
IndexError: index 14 is out of bounds for size 14
Apply node that caused the error: AdvancedSubtensor1(if{inplace}.0, Reshape{1}.0)
Toposort index: 80

744

asked Jan 30 '17 10:01

Eka

1 Answers

The problem lies in:

Click to copy

model.add(Embedding(max_features, 128, input_length=max_len, dropout=0.2))

In the Embedding documentation you may see that the first argument provided to it should be set to size of vocabulary + 1. It's because there should be always a place for a null word which index is 0. Because of that you need to change this line to:

Click to copy

model.add(Embedding(max_features + 1, 128, input_length=max_len, dropout=0.2))

136

answered Oct 10 '22 08:10

Marcin Możejko

Related questions
                            
                                How to build a GraphQL API on top of a Django/Elasticsearch/MySQL backend?
                            
                                Fast extraction of chunks of lines from large CSV file
                            
                                I want to plot perpendicular vectors in Python
                            
                                Using SQLAlchemy how do I populate rows after creating the db using db.create_all()
                            
                                z-axis scaling and limits in a 3-D scatter plot in Matplotlib
                            
                                How can I determine the function in which a closure was created?
                            
                                When should I use type checking (if ever) in Python?
                            
                                How to send data to Flask via AJAX?
                            
                                Pandas - sort and head inside groupby
                            
                                Puzzle: how many ways can you hit a target with a laser beam within four reflective walls
                            
                                Django redis LPUSH / RPUSH
                            
                                Installing Keras package with conda install
                            
                                PyQt5 and Python 3.6 installation?
                            
                                Retrieve a number from a span tag, using Python requests and Beautiful Soup
                            
                                Function input() in pyspark
                            
                                Is LIBGDX Slower in python than Java
                            
                                Change execution concurrency of Airflow DAG
                            
                                High GPU Memory-Usage but zero volatile gpu-util
                            
                                Pytest: running tests multiple times with different input data
                            
                                scikit-learn - Convert pipeline prediction to original value/scale

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to code a sequence to sequence RNN in keras?

Tags:

python

neural-network

keras

recurrent-neural-network

sequence-to-sequence

Eka

People also ask

1 Answers

Marcin Możejko

Recent Activity

Donate For Us