How to train TensorFlow network using a generator to produce inputs?

Tags:

The TensorFlow docs describe a bunch of ways to read data using TFRecordReader, TextLineReader, QueueRunner etc and queues.

What I would like to do is much, much simpler: I have a python generator function that produces an infinite sequence of training data as (X, y) tuples (both are numpy arrays, and the first dimension is the batch size). I just want to train a network using that data as inputs.

Is there a simple self-contained example of training a TensorFlow network using a generator which produces the data? (along the lines of the MNIST or CIFAR examples)

755

asked Sep 05 '16 06:09

Alex I

1 Answers

Suppose you have a function that generates data:

 def generator(data):      ...     yield (X, y)

Now you need another function that describes your model architecture. It could be any function that processes X and has to predict y as output (say, neural network).

Suppose your function accepts X and y as inputs, computes a prediction for y from X in some way and returns loss function (e.g. cross-entropy or MSE in the case of regression) between y and predicted y:

 def neural_network(X, y):      # computation of prediction for y using X     ...     return loss(y, y_pred)

To make your model work, you need to define placeholders for both X and y and then run a session:

 X = tf.placeholder(tf.float32, shape=(batch_size, x_dim))  y = tf.placeholder(tf.float32, shape=(batch_size, y_dim))

Placeholders are something like "free variables" which you need to specify when running the session by feed_dict:

 with tf.Session() as sess:      # variables need to be initialized before any sess.run() calls      tf.global_variables_initializer().run()       for X_batch, y_batch in generator(data):          feed_dict = {X: X_batch, y: y_batch}           _, loss_value, ... = sess.run([train_op, loss, ...], feed_dict)          # train_op here stands for optimization operation you have defined          # and loss for loss function (return value of neural_network function)

Hope you would find it useful. However, bear in mind this is not fully working implementation but rather a pseudocode since you specified almost no details.

162

answered Sep 18 '22 17:09

Dmitriy Danevskiy

Related questions
                            
                                git log shows skipped lines on my bash
                            
                                FTP to Azure Blob Storage
                            
                                Accessing a reducer state from within another reducer
                            
                                Google Sheets Pattern Matching/RegEx for COUNTIF
                            
                                JSON.NET JObject - how do I get value from this nested JSON structure
                            
                                Azure Functions: ICollector<T> vs IAsyncCollector<T>
                            
                                Python logging to stdout and log file
                            
                                Angular 2 - Sort list from Observable
                            
                                Difference between pip3 and python3 -m pip
                            
                                Avoid performance impact of a single partition mode in Spark window functions
                            
                                cmake:missing and no known rule to make it when I import a prebuilt library
                            
                                `Unexpected token import` in `webpack.config.babel.js` when using `{modules: false}`

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With