Long Sequence In a seq2seq model with attention?

Tags:

I am following along this pytorch tutorial and trying to apply this principle to summarization, where the encoding sequence would be around 1000 words and decoder target 200 words.

How do I apply seq2seq to this? I know it would be very expensive and almost infeasible to run through the whole sequence of 1000 words at once. So dividing the seq into say 20 seq and running in parallel could be an answer. But I'm not sure how to implement it; I also want to incorporate attention into it.

805

asked Jun 04 '17 05:06

vijendra rana

1 Answers

You can not parallelize RNN in time (1000 here) because they are inherently sequential.

You can use a light RNN, something like QRNN or SRU as a faster alternative(which is still sequential).

Another common sequence processing modules are TCN and Transformers which are both parallelizable in time.

Also, note that all of them can be used with attention and work perfectly fine with text.

answered Oct 13 '22 12:10

Separius

Related questions
                            
                                OpenCV background subtraction learning rate cannot change
                            
                                Custom RMSPE loss function in keras
                            
                                Natural sort on Django Queryset
                            
                                PyCharm not displaying Google style docstrings in tooltips
                            
                                How to compute and plot a LOWESS curve in Python?
                            
                                Upload an Image and Display it back as a response using Flask
                            
                                Type hints for classes implementing an interface
                            
                                reduce (fold) in Pandas
                            
                                OpenCV 3 Python - Haar Cascade Upper Body Detector doesn't work on a half body (waist up) photo?
                            
                                pymc3 multivariate traceplot color coding
                            
                                Python 3 in Yocto very slow on Raspberry Pi
                            
                                Can't quit Python script with Ctrl-C if a thread ran webbrowser.open()
                            
                                How to test for sequences that are not string-like using Python 3's standard library
                            
                                python linux - display image with filename as viewer window title
                            
                                Recommended way to package a Django project? Django package with multiple apps or multiple Django packages?
                            
                                Strange behavior in Python, Line missing, different outputs
                            
                                Can signal handlers memory leak in PyQt? [duplicate]
                            
                                Keras CNN, verbose training progress bar display
                            
                                pyplot plot freezes (not responding)
                            
                                Airflow - long running task in SubDag marked as failed after an hour

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Long Sequence In a seq2seq model with attention?

Tags:

python

lstm

pytorch

summarization

vijendra rana

People also ask

1 Answers

Separius

Recent Activity

Donate For Us