Tensorflow minibatch training

Tags:

tensorflow

How can I train a network in TensorFlow using minibatches of data? In the Deep-MNIST tutorial, they use:

for i in range(1000):
   batch = mnist.train.next_batch(50)
   train_step.run(feed_dict={x: batch[0], y_: batch[1]})

My question is - are x and y_ variables with dimensions suitable to a single example, and batch[0],batch[1] are lists of such inputs and outputs? in this case, will TensorFlow automatically add the gradients for each training example in these lists? or should I create my model so that x and y_ get an entire minibatch?

My problem is that when I try to feed it a list for each placeholder, it tries to input the entire list for the placeholder, and I therefore get a size mismatch: Cannot feed value of shape (n, m) for Tensor u'ts:0', which has shape '(m,)', where n is the minibatch size and m is the individual input size.

Thanks.

268

asked Jul 02 '16 07:07

yoki

1 Answers

In the MNIST tutorial x and y_are placeholders with a defined shape:

x = tf.placeholder(tf.float32, shape=[None, 784])
y_ = tf.placeholder(tf.float32, shape=[None, 10])

The shape=[None, 784] means that this placeholder have 2 dimension.

So, to answer your first question:

are x and y_ variables with dimensions suitable to a single example

The first dimension can contain an undefined number of elements (so, 1, 2, ... 50 ...) and the second dimension can contain exaclly 784 = 28*28 elements (that are the features of a single MNIST image).

If you feed the graph with a python list with shape [1, 784] or [50, 784] is totally the same for tensorflow, it can handle it without any problem.

batch[0],batch[1] are lists of such inputs and outputs? in the tutorial they define batch calling batch = datasets.train.next_batch(50). Thus:

batch[0] is a list with shape [50, 784]
batch[1] is a list with shape [50, 10]

will TensorFlow automatically add the gradients for each training example in these lists? or should I create my model so that x and y_ get an entire minibatch?

Tensorflow handles this for you.

The error you're reporting Cannot feed value of shape (n, m) for Tensor u'ts:0', which has shape '(m,)' is a shape mismatch error.

You're not reshaping the inputs to have the same shape of the placeholder.

answered Oct 02 '22 10:10

nessuno

Related questions
                            
                                Keras, auc on validation set during training does not match with sklearn auc
                            
                                AttributeError: 'tuple' object has no attribute 'type' upon importing tensorflow
                            
                                Tensorflow error "has type list, but expected one of: int, long, float"
                            
                                How to save and restore Keras LSTM model?
                            
                                Batch Normalization in tf.keras does not calculate average mean and average variance
                            
                                How to use a pre-trained embedding matrix in tensorflow 2.0 RNN as initial weights in an embedding layer?
                            
                                AttributeError: 'ShuffleDataset' object has no attribute 'output_shapes' - when following TF tutorial
                            
                                when restoring from a checkpoint, how can I change the data type of the parameters?
                            
                                Why keras use "call" instead of __call__?
                            
                                how to convert perreplica to tensor?
                            
                                What is causing large jumps in training accuracy and loss between epochs?
                            
                                How to visualize RNN/LSTM gradients in Keras/TensorFlow?
                            
                                AttributeError: module 'tensorflow' has no attribute 'get_variable'
                            
                                Tensorflow: How to use tf.keras.metrics in multiclass classification?
                            
                                How to choose the number of units for the Dense layer in the Convoluted neural network for a Image classification problem?
                            
                                Keras: ValueError: logits and labels must have the same shape ((None, 2) vs (None, 1))
                            
                                Would TensorFlow utilize GPU on a Mac if installed on a VM?
                            
                                Create color histogram of an image using tensorflow
                            
                                Tensorboard logging non-tensor (numpy) information (AUC)
                            
                                How to create 2-layers neural network using TensorFlow and python on MNIST data

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With