TensorFlow: getting all states from a RNN

Tags:

How do you get all the hidden states from tf.nn.rnn() or tf.nn.dynamic_rnn() in TensorFlow? The API only gives me the final state.

The first alternative would be to write a loop when building a model that operates directly on RNNCell. However, the number of timesteps is not fixed for me, and depends on the incoming batch.

Some options are to either use a GRU or to write my own RNNCell that concatenates the state to the output. The former choice isn't general enough and the latter sounds too hacky.

Another option is to do something like the answers in this question, getting all the variables from an RNN. However, I'm not sure how to separate the hidden states from other variables in a standard fashion here.

Is there a nice way to get all the hidden states from an RNN while still using the library-provided RNN APIs?

363

asked Sep 27 '16 04:09

Ankit Vani

1 Answers

tf.nn.dynamic_rnn(also tf.nn.static_rnn) has two return values; "outputs", "state" (https://www.tensorflow.org/api_docs/python/tf/nn/dynamic_rnn)

As you said, "state" is the final state of RNN, but "outputs" are all hidden states of RNN(which shape is [batch_size, max_time, cell.output_size])

You can use "outputs" as hidden states of RNN, because in most library-provided RNNCell, "output" and "state" are same. (except LSTMCell)

Basic https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/ops/rnn_cell_impl.py#L347
GRU https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/ops/rnn_cell_impl.py#L441

152

answered Sep 20 '22 16:09

Junyeop Lee

Related questions
                            
                                unexpected memory footprint differences when spawning python multiprocessing pool
                            
                                worst-case time complexity of str.find in python
                            
                                Python Spark Dataframes: Better way to export groups to text file
                            
                                Convert hOCR to HTML table
                            
                                ipdb with python unittest module
                            
                                Is it possible to mock a C function using python?
                            
                                Parallelizing pandas pyodbc SQL database calls
                            
                                Numpy floating point rounding errors
                            
                                Why do I get this NameError in a generator within a Python class definition?
                            
                                Interactive Ipython Notebooks on Heroku
                            
                                Predicting next word using the language model tensorflow example
                            
                                Execute Python script from Android app in Java?
                            
                                Ignore some modules in autodoc
                            
                                A solution to SQLAlchemy temporary table pain?
                            
                                How do I determine whether a container is infinitely recursive and find its smallest unique container?
                            
                                How can I get stderr from os.popen()?
                            
                                How to generate noisy mock time series or signal (in Python)
                            
                                How to create Pandas Series with Decimal?
                            
                                How to resolve "chromedriver executable needs to be in PATH" error when running Selenium Chrome using virtualenv within PyDev?
                            
                                Allow Python.app on El Capitan (OS X)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TensorFlow: getting all states from a RNN

Tags:

python

machine-learning

tensorflow

deep-learning

Ankit Vani

People also ask

1 Answers

Junyeop Lee

Recent Activity

Donate For Us