After using TensorFlow's retrain.py https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/image_retraining/retrain.py I've successfully generated the "retrained_labels.txt" and "retrained_graph.pb" files. For anybody not familiar with this process, I'm essentially following this tutorial: https://codelabs.developers.google.com/codelabs/tensorflow-for-poets/#0 which is effectively the same steps as this popular video: https://www.youtube.com/watch?v=QfNvhPx5Px8 After the retraining process, I'm attempting to write a Python script that opens all the images in a test images directory, and successively shows each image in an OpenCV window and also runs TensorFlow to classify the image. Problem is, I can't seem to work out how to open the image as a NumPy array (this is the format that the Python OpenCV wrapper uses) and then convert it into a format I can pass into TensorFlow's sess.run(). Currently I'm opening the image with cv2.imread() and then opening it again with tf.gfile.FastGFile(). This is a very poor practice; I'd much rather open the image once and then convert it. Here is the relevant portion of the code where I'm stuck: <pre class="prettyprint"><code># open the image with OpenCV openCVImage = cv2.imread(imageFileWithPath) # show the OpenCV image cv2.imshow(fileName, openCVImage) # get the final tensor from the graph finalTensor = sess.graph.get_tensor_by_name('final_result:0') # open the image in TensorFlow tfImage = tf.gfile.FastGFile(imageFileWithPath, 'rb').read() # run the network to get the predictions predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage}) </code></pre> After reading these posts: How to convert numpy arrays to standard TensorFlow format? Feeding image data in tensorflow for transfer learning I've tried the following: <pre class="prettyprint"><code># show the OpenCV image cv2.imshow(fileName, openCVImage) # get the final tensor from the graph finalTensor = sess.graph.get_tensor_by_name('final_result:0') # convert the NumPy array / OpenCV image to a TensorFlow image openCVImageAsArray = np.asarray(openCVImage, np.float32) tfImage = tf.convert_to_tensor(openCVImageAsArray, np.float32) # run the network to get the predictions predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage}) </code></pre> This results in this error on the sess.run() line: <pre class="prettyprint"><code>TypeError: The value of a feed cannot be a tf.Tensor object. Acceptable feed values include Python scalars, strings, lists, numpy ndarrays, or TensorHandles. </code></pre> I've also tried this: <pre class="prettyprint"><code># show the OpenCV image cv2.imshow(fileName, openCVImage) # get the final tensor from the graph finalTensor = sess.graph.get_tensor_by_name('final_result:0') # convert the NumPy array / OpenCV image to a TensorFlow image tfImage = np.array(openCVImage)[:, :, 0:3] # run the network to get the predictions predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage}) </code></pre> which results in this error: <pre class="prettyprint"><code>ValueError: Cannot feed value of shape (257, 320, 3) for Tensor 'DecodeJpeg/contents:0', which has shape '()' </code></pre> --- EDIT --- I've also tried this: <pre class="prettyprint"><code># show the OpenCV image cv2.imshow(fileName, openCVImage) # get the final tensor from the graph finalTensor = sess.graph.get_tensor_by_name('final_result:0') # convert the NumPy array / OpenCV image to a TensorFlow image tfImage = np.expand_dims(openCVImage, axis=0) # run the network to get the predictions predictions = sess.run(finalTensor, feed_dict={finalTensor: tfImage}) </code></pre> which results in this error: <pre class="prettyprint"><code>ValueError: Cannot feed value of shape (1, 669, 1157, 3) for Tensor 'final_result:0', which has shape '(?, 2)' </code></pre> and I've also tried this: <pre class="prettyprint"><code># show the OpenCV image cv2.imshow(fileName, openCVImage) # get the final tensor from the graph finalTensor = sess.graph.get_tensor_by_name('final_result:0') # convert the NumPy array / OpenCV image to a TensorFlow image tfImage = np.expand_dims(openCVImage, axis=0) # run the network to get the predictions predictions = sess.run(finalTensor, feed_dict={'DecodeJpeg/contents:0': tfImage}) </code></pre> which results in this error: <pre class="prettyprint"><code>ValueError: Cannot feed value of shape (1, 669, 1157, 3) for Tensor 'DecodeJpeg/contents:0', which has shape '()' </code></pre> I'm not sure if this is necessary, but if anyone is curious here is the entire script. Note that this works great except for having to open the image twice: <pre class="prettyprint"><code># test.py import os import tensorflow as tf import numpy as np import cv2 # module-level variables ############################################################################################## RETRAINED_LABELS_TXT_FILE_LOC = os.getcwd() + "/" + "retrained_labels.txt" RETRAINED_GRAPH_PB_FILE_LOC = os.getcwd() + "/" + "retrained_graph.pb" TEST_IMAGES_DIR = os.getcwd() + "/test_images" ####################################################################################################################### def main(): # get a list of classifications from the labels file classifications = [] # for each line in the label file . . . for currentLine in tf.gfile.GFile(RETRAINED_LABELS_TXT_FILE_LOC): # remove the carriage return classification = currentLine.rstrip() # and append to the list classifications.append(classification) # end for # show the classifications to prove out that we were able to read the label file successfully print("classifications = " + str(classifications)) # load the graph from file with tf.gfile.FastGFile(RETRAINED_GRAPH_PB_FILE_LOC, 'rb') as retrainedGraphFile: # instantiate a GraphDef object graphDef = tf.GraphDef() # read in retrained graph into the GraphDef object graphDef.ParseFromString(retrainedGraphFile.read()) # import the graph into the current default Graph, note that we don't need to be concerned with the return value _ = tf.import_graph_def(graphDef, name='') # end with # if the test image directory listed above is not valid, show an error message and bail if not os.path.isdir(TEST_IMAGES_DIR): print("the test image directory does not seem to be a valid directory, check file / directory paths") return # end if with tf.Session() as sess: # for each file in the test images directory . . . for fileName in os.listdir(TEST_IMAGES_DIR): # if the file does not end in .jpg or .jpeg (case-insensitive), continue with the next iteration of the for loop if not (fileName.lower().endswith(".jpg") or fileName.lower().endswith(".jpeg")): continue # end if # show the file name on std out print(fileName) # get the file name and full path of the current image file imageFileWithPath = os.path.join(TEST_IMAGES_DIR, fileName) # attempt to open the image with OpenCV openCVImage = cv2.imread(imageFileWithPath) # if we were not able to successfully open the image, continue with the next iteration of the for loop if openCVImage is None: print("unable to open " + fileName + " as an OpenCV image") continue # end if # show the OpenCV image cv2.imshow(fileName, openCVImage) # get the final tensor from the graph finalTensor = sess.graph.get_tensor_by_name('final_result:0') # ToDo: find a way to convert from a NumPy array / OpenCV image to a TensorFlow image # instead of opening the file twice, these attempts don't work # attempt 1: # openCVImageAsArray = np.asarray(openCVImage, np.float32) # tfImage = tf.convert_to_tensor(openCVImageAsArray, np.float32) # attempt 2: # tfImage = np.array(openCVImage)[:, :, 0:3] # open the image in TensorFlow tfImage = tf.gfile.FastGFile(imageFileWithPath, 'rb').read() # run the network to get the predictions predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage}) # sort predictions from most confidence to least confidence sortedPredictions = predictions[0].argsort()[-len(predictions[0]):][::-1] print("---------------------------------------") # keep track of if we're going through the next for loop for the first time so we can show more info about # the first prediction, which is the most likely prediction (they were sorted descending above) onMostLikelyPrediction = True # for each prediction . . . for prediction in sortedPredictions: strClassification = classifications[prediction] # if the classification (obtained from the directory name) ends with the letter "s", remove the "s" to change from plural to singular if strClassification.endswith("s"): strClassification = strClassification[:-1] # end if # get confidence, then get confidence rounded to 2 places after the decimal confidence = predictions[0][prediction] # if we're on the first (most likely) prediction, state what the object appears to be and show a % confidence to two decimal places if onMostLikelyPrediction: scoreAsAPercent = confidence * 100.0 print("the object appears to be a " + strClassification + ", " + "{0:.2f}".format(scoreAsAPercent) + "% confidence") onMostLikelyPrediction = False # end if # for any prediction, show the confidence as a ratio to five decimal places print(strClassification + " (" + "{0:.5f}".format(confidence) + ")") # end for # pause until a key is pressed so the user can see the current image (shown above) and the prediction info cv2.waitKey() # after a key is pressed, close the current window to prep for the next time around cv2.destroyAllWindows() # end for # end with # write the graph to file so we can view with TensorBoard tfFileWriter = tf.summary.FileWriter(os.getcwd()) tfFileWriter.add_graph(sess.graph) tfFileWriter.close() # end main ####################################################################################################################### if __name__ == "__main__": main() </code></pre>

You were pretty close: <code>{'DecodeJpeg/contents:0': tfImage}</code> decodes a binary jpeg image. You need to use <code>{'DecodeJpeg:0': tfImage}</code> instead if the image is already decoded. Read more here So your code should look like this: <pre class="prettyprint"><code>tfImage = np.array(openCVImage)[:, :, 0:3] # run the network to get the predictions predictions = sess.run(finalTensor, {'DecodeJpeg:0': tfImage}) </code></pre>

How to convert NumPy array image to TensorFlow image?

Tags:

python

opencv

numpy

tensorflow

After using TensorFlow's retrain.py

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/image_retraining/retrain.py

I've successfully generated the "retrained_labels.txt" and "retrained_graph.pb" files. For anybody not familiar with this process, I'm essentially following this tutorial:

https://codelabs.developers.google.com/codelabs/tensorflow-for-poets/#0

which is effectively the same steps as this popular video:

https://www.youtube.com/watch?v=QfNvhPx5Px8

After the retraining process, I'm attempting to write a Python script that opens all the images in a test images directory, and successively shows each image in an OpenCV window and also runs TensorFlow to classify the image.

Problem is, I can't seem to work out how to open the image as a NumPy array (this is the format that the Python OpenCV wrapper uses) and then convert it into a format I can pass into TensorFlow's sess.run().

Currently I'm opening the image with cv2.imread() and then opening it again with tf.gfile.FastGFile(). This is a very poor practice; I'd much rather open the image once and then convert it.

Here is the relevant portion of the code where I'm stuck:

# open the image with OpenCV
openCVImage = cv2.imread(imageFileWithPath)

# show the OpenCV image
cv2.imshow(fileName, openCVImage)

# get the final tensor from the graph
finalTensor = sess.graph.get_tensor_by_name('final_result:0')

# open the image in TensorFlow
tfImage = tf.gfile.FastGFile(imageFileWithPath, 'rb').read()

# run the network to get the predictions
predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage})

After reading these posts:

How to convert numpy arrays to standard TensorFlow format?

Feeding image data in tensorflow for transfer learning

I've tried the following:

# show the OpenCV image
cv2.imshow(fileName, openCVImage)

# get the final tensor from the graph
finalTensor = sess.graph.get_tensor_by_name('final_result:0')

# convert the NumPy array / OpenCV image to a TensorFlow image
openCVImageAsArray = np.asarray(openCVImage, np.float32)
tfImage = tf.convert_to_tensor(openCVImageAsArray, np.float32)

# run the network to get the predictions
predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage})

This results in this error on the sess.run() line:

TypeError: The value of a feed cannot be a tf.Tensor object. Acceptable feed values include Python scalars, strings, lists, numpy ndarrays, or TensorHandles.

I've also tried this:

# show the OpenCV image
cv2.imshow(fileName, openCVImage)

# get the final tensor from the graph
finalTensor = sess.graph.get_tensor_by_name('final_result:0')

# convert the NumPy array / OpenCV image to a TensorFlow image
tfImage = np.array(openCVImage)[:, :, 0:3]

# run the network to get the predictions
predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage})

which results in this error:

ValueError: Cannot feed value of shape (257, 320, 3) for Tensor 'DecodeJpeg/contents:0', which has shape '()'

--- EDIT ---

I've also tried this:

# show the OpenCV image
cv2.imshow(fileName, openCVImage)

# get the final tensor from the graph
finalTensor = sess.graph.get_tensor_by_name('final_result:0')

# convert the NumPy array / OpenCV image to a TensorFlow image
tfImage = np.expand_dims(openCVImage, axis=0)

# run the network to get the predictions
predictions = sess.run(finalTensor, feed_dict={finalTensor: tfImage})

which results in this error:

ValueError: Cannot feed value of shape (1, 669, 1157, 3) for Tensor 'final_result:0', which has shape '(?, 2)'

and I've also tried this:

# show the OpenCV image
cv2.imshow(fileName, openCVImage)

# get the final tensor from the graph
finalTensor = sess.graph.get_tensor_by_name('final_result:0')

# convert the NumPy array / OpenCV image to a TensorFlow image
tfImage = np.expand_dims(openCVImage, axis=0)

# run the network to get the predictions
predictions = sess.run(finalTensor, feed_dict={'DecodeJpeg/contents:0': tfImage})

which results in this error:

ValueError: Cannot feed value of shape (1, 669, 1157, 3) for Tensor 'DecodeJpeg/contents:0', which has shape '()'

I'm not sure if this is necessary, but if anyone is curious here is the entire script. Note that this works great except for having to open the image twice:

# test.py

import os
import tensorflow as tf
import numpy as np
import cv2

# module-level variables ##############################################################################################
RETRAINED_LABELS_TXT_FILE_LOC = os.getcwd() + "/" + "retrained_labels.txt"
RETRAINED_GRAPH_PB_FILE_LOC = os.getcwd() + "/" + "retrained_graph.pb"

TEST_IMAGES_DIR = os.getcwd() + "/test_images"

#######################################################################################################################
def main():
    # get a list of classifications from the labels file
    classifications = []
    # for each line in the label file . . .
    for currentLine in tf.gfile.GFile(RETRAINED_LABELS_TXT_FILE_LOC):
        # remove the carriage return
        classification = currentLine.rstrip()
        # and append to the list
        classifications.append(classification)
    # end for

    # show the classifications to prove out that we were able to read the label file successfully
    print("classifications = " + str(classifications))

    # load the graph from file
    with tf.gfile.FastGFile(RETRAINED_GRAPH_PB_FILE_LOC, 'rb') as retrainedGraphFile:
        # instantiate a GraphDef object
        graphDef = tf.GraphDef()
        # read in retrained graph into the GraphDef object
        graphDef.ParseFromString(retrainedGraphFile.read())
        # import the graph into the current default Graph, note that we don't need to be concerned with the return value
        _ = tf.import_graph_def(graphDef, name='')
    # end with

    # if the test image directory listed above is not valid, show an error message and bail
    if not os.path.isdir(TEST_IMAGES_DIR):
        print("the test image directory does not seem to be a valid directory, check file / directory paths")
        return
    # end if

    with tf.Session() as sess:
        # for each file in the test images directory . . .
        for fileName in os.listdir(TEST_IMAGES_DIR):
            # if the file does not end in .jpg or .jpeg (case-insensitive), continue with the next iteration of the for loop
            if not (fileName.lower().endswith(".jpg") or fileName.lower().endswith(".jpeg")):
                continue
            # end if

            # show the file name on std out
            print(fileName)

            # get the file name and full path of the current image file
            imageFileWithPath = os.path.join(TEST_IMAGES_DIR, fileName)
            # attempt to open the image with OpenCV
            openCVImage = cv2.imread(imageFileWithPath)

            # if we were not able to successfully open the image, continue with the next iteration of the for loop
            if openCVImage is None:
                print("unable to open " + fileName + " as an OpenCV image")
                continue
            # end if

            # show the OpenCV image
            cv2.imshow(fileName, openCVImage)

            # get the final tensor from the graph
            finalTensor = sess.graph.get_tensor_by_name('final_result:0')

            # ToDo: find a way to convert from a NumPy array / OpenCV image to a TensorFlow image
            # instead of opening the file twice, these attempts don't work
            # attempt 1:
            # openCVImageAsArray = np.asarray(openCVImage, np.float32)
            # tfImage = tf.convert_to_tensor(openCVImageAsArray, np.float32)
            # attempt 2:
            # tfImage = np.array(openCVImage)[:, :, 0:3]

            # open the image in TensorFlow
            tfImage = tf.gfile.FastGFile(imageFileWithPath, 'rb').read()

            # run the network to get the predictions
            predictions = sess.run(finalTensor, {'DecodeJpeg/contents:0': tfImage})

            # sort predictions from most confidence to least confidence
            sortedPredictions = predictions[0].argsort()[-len(predictions[0]):][::-1]

            print("---------------------------------------")

            # keep track of if we're going through the next for loop for the first time so we can show more info about
            # the first prediction, which is the most likely prediction (they were sorted descending above)
            onMostLikelyPrediction = True
            # for each prediction . . .
            for prediction in sortedPredictions:
                strClassification = classifications[prediction]

                # if the classification (obtained from the directory name) ends with the letter "s", remove the "s" to change from plural to singular
                if strClassification.endswith("s"):
                    strClassification = strClassification[:-1]
                # end if

                # get confidence, then get confidence rounded to 2 places after the decimal
                confidence = predictions[0][prediction]

                # if we're on the first (most likely) prediction, state what the object appears to be and show a % confidence to two decimal places
                if onMostLikelyPrediction:
                    scoreAsAPercent = confidence * 100.0
                    print("the object appears to be a " + strClassification + ", " + "{0:.2f}".format(scoreAsAPercent) + "% confidence")
                    onMostLikelyPrediction = False
                # end if

                # for any prediction, show the confidence as a ratio to five decimal places
                print(strClassification + " (" +  "{0:.5f}".format(confidence) + ")")
            # end for

            # pause until a key is pressed so the user can see the current image (shown above) and the prediction info
            cv2.waitKey()
            # after a key is pressed, close the current window to prep for the next time around
            cv2.destroyAllWindows()
        # end for
    # end with

    # write the graph to file so we can view with TensorBoard
    tfFileWriter = tf.summary.FileWriter(os.getcwd())
    tfFileWriter.add_graph(sess.graph)
    tfFileWriter.close()

# end main

#######################################################################################################################
if __name__ == "__main__":
    main()

886

asked Feb 11 '18 01:02

cdahms

1 Answers

You were pretty close:

{'DecodeJpeg/contents:0': tfImage} decodes a binary jpeg image.

You need to use {'DecodeJpeg:0': tfImage} instead if the image is already decoded. Read more here

So your code should look like this:

tfImage = np.array(openCVImage)[:, :, 0:3]
# run the network to get the predictions
predictions = sess.run(finalTensor, {'DecodeJpeg:0': tfImage})

151

answered Nov 04 '22 09:11

ias

Related questions
                            
                                Matplotlib Table- Assign different text alignments to different columns
                            
                                scikit learn: custom classifier compatible with GridSearchCV
                            
                                How can I overload operators so that type on the left/right does not matter?
                            
                                Sum of distances from a point to all other points
                            
                                OSRM giving wrong response for distance between 2 points
                            
                                Socket Java client - Python Server
                            
                                Understanding Scipy Convolution
                            
                                How to tell if a python module is intended to be python 2 or python 3?
                            
                                PyTorch: training with GPU gives worse error than training the same thing with CPU
                            
                                PySpark Numeric Window Group By
                            
                                Pandas Datetime Interval Resample to Seconds
                            
                                Matching strings in a column of a data frame with the strings in a column of another data frame using R or Python
                            
                                Django : Maintaining option selected in HTML template
                            
                                Text Extraction from image after detecting text region with contours
                            
                                What is event_loop_policy and why is it needed in python asyncio?
                            
                                1d CNN audio in keras
                            
                                Keras MSE definition
                            
                                How do I obtain the second highest value in a row?
                            
                                AttributeError: 'str' object has no attribute 'ndim' [closed]
                            
                                Is there a copy constructor for Map Fields in Python Protocol Buffers?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to convert NumPy array image to TensorFlow image?

Tags:

python

opencv

numpy

tensorflow

cdahms

People also ask

1 Answers

ias

Recent Activity

Donate For Us