Tensorflow CNN training images are all different sizes

Tags:

I have created a Deep Convolution Neural Network to classify individual pixels in an image. My training data will always be the same size (32x32x7), but my testing data can be any size.

Github Repository

Currently, my model will only work on images that are the same size. I have used the tensorflow mnist tutorial extensively to help me construct my model. In this tutorial, we only use 28x28 images. How would the following mnist model be changed to accept images of any size?

 x = tf.placeholder(tf.float32, shape=[None, 784])
 y_ = tf.placeholder(tf.float32, shape=[None, 10])
 W = tf.Variable(tf.zeros([784,10]))
 b = tf.Variable(tf.zeros([10]))
 x_image = tf.reshape(x, [-1, 28, 28, 1])

To make things a little bit more complicated, my model has transpose convolutions where the output shape needs to be specified. How would I adjust the following line of code so that the transpose convolution will output a shape that is the same size of the input.

  DeConnv1 = tf.nn.conv3d_transpose(layer1, filter = w, output_shape = [1,32,32,7,1], strides = [1,2,2,2,1], padding = 'SAME')

627

asked Dec 21 '17 17:12

Devin Haslam

2 Answers

Unfortunately there's no way to build dynamic graphs in Tensorflow (You could try with fold but that's outside the scope of the question). This leaves you with two options:

Bucketing: You create multiple input tensors in a few hand picked sizes and then in runtime you choose the right bucket (see example). Either way you'll probably need the second option. Seq2seq with bucketing
Resize the input and output images. Assuming the images all maintain the same aspect ration you can try resizing the image before inference. Not sure why you care about the output since MNIST is a classification task.

Either way you can use the same approach:

from PIL import Image

basewidth = 28 # MNIST image width
img = Image.open('your_input_img.jpg')
wpercent = (basewidth/float(img.size[0]))
hsize = int((float(img.size[1])*float(wpercent)))
img = img.resize((basewidth,hsize), Image.ANTIALIAS)

# Save image or feed directly to tensorflow 
img.save('feed_to_tf.jpg')

137

answered Sep 30 '22 18:09

gidim

The mnist model code which you mentioned is an example using FC networks and not for convolution networks. The input shape of [None,784] is given for mnist size (28 x 28). The example is a FC network which has fixed input size.

What you are asking for is not possible in FC networks because the number of weights and biases are dependent on the input shape. This is possible if you are using a Fully convolution architecture. So my suggestion is to use a fully convolution architecture so that the weights and biases are not dependent on the input shape

answered Sep 30 '22 18:09

Abhijit Balaji

Related questions
                            
                                Is Twisted's Deferred the same as a Promise in JavaScript?
                            
                                django - inline - Search for existing record instead of adding a new one
                            
                                Models inside tests - Django 1.7 issue
                            
                                How to return str from MySQL using mysql.connector?
                            
                                Pandas DataFrame: complete spec for __getitem__()? [closed]
                            
                                Accessing JVM from python
                            
                                Why does this Jython loop fail after a single run?
                            
                                Headless Selenium + Xvfb + Chrome on OSX 10.11
                            
                                Disable special "class" attribute handling
                            
                                Sending JSON data over WebSocket from Matlab using Python Twisted and Autobahn
                            
                                Django REST Framework (ModelViewSet), 405 METHOD NOT ALLOWED
                            
                                JSON formatted logging with Flask and gunicorn
                            
                                completely self-contained virtual environment
                            
                                Nuitka error Cannot find ' ' in package ' ' as absolute import
                            
                                pandas get_level_values for multiple columns
                            
                                PySpark: StructField(..., ..., False) always returns `nullable=true` instead of `nullable=false`
                            
                                Python Jupyter: Shortcut to copy output of a cell
                            
                                How can we build and distribute python scripts in a windows environment
                            
                                How to keep tensorflow session open between predictions? Loading from SavedModel
                            
                                How to save a custom transformer in sklearn?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tensorflow CNN training images are all different sizes

Tags:

python

tensorflow

deep-learning

conv-neural-network

deconvolution

Devin Haslam

People also ask

2 Answers

gidim

Abhijit Balaji

Recent Activity

Donate For Us