Inconsistency between image resizing with Keras (PIL) and TensorFlow?

Tags:

I am bugged by an apparent inconsistency between:

image resizing functionalities from keras.preprocessing, which are wrappers around PIL functions
image resizing functions in TensorFlow's tf.image.

I am training a deep learning model for a computer vision task with Keras (actually, with tf.keras, but that doesn't matter here). I then serve the model with TF Serving, which requires me to send images to the model as encoded byte-strings, where they are decoded using tf.image.decode_png before going through the model graph.

The problem occurs when I resize the images. Resizing with bilinear interpolation (or any other method) gives different results with PIL compared to with tf.image, to such extent that the classification by the model changes depending on which function I use.

The code below provides a reproducible example.

import numpy as np 
from PIL import Image
from keras.preprocessing.image import load_img, img_to_array
import tensorflow as tf

# Generate an 'image' with numpy, save as png
np.random.seed(42)
image = np.random.randint(0, 255, size=(256, 256, 3)).astype(np.uint8)
Image.fromarray(image).convert("RGB").save('my_image.png')

Now, let's load the image in two ways. First with the PIL wrappers from Keras, as during the model training, then encoded as a binary string and decoded with TensorFlow functions, as in my model server.

# Using Keras PIL wrappers
keras_image = img_to_array(load_img('./my_image.png')) 

# Using TF functionalities
with tf.Session() as sess:
    with open('./my_image.png', 'rb') as f:
        tf_image_ = tf.image.decode_png(f.read())
    tf_image = sess.run(tf_image_)

So far so good, as both images are exactly the same (apart from the dtype, as Keras has casted the image to float32):

# Assert equality
np.array_equal(keras_image, tf_image)
> True

Repeating this code with resizing however gives a different result:

# Using Keras PIL wrappers, with resizing
keras_image_rs = img_to_array(load_img('./my_image.png',
                             target_size=(224, 224),
                             interpolation='bilinear'))

# Using TF functionalities, with resizing
with tf.Session() as sess:
    with open('./my_image.png', 'rb') as f:
        tf_image_ = tf.image.decode_png(f.read())
        # Add and remove dimension
        # As tf.image.resize_* requires a batch dimension
        tf_image_ = tf.expand_dims(tf_image_, 0)
        tf_image_ = tf.image.resize_bilinear(tf_image_,
                                            [224, 224], 
                                             align_corners=True)
        tf_image_ = tf.squeeze(tf_image_, axis=[0])

    tf_image_rs = sess.run(tf_image_)

# Assert equality
np.array_equal(keras_image_rs, tf_image_rs)
> False

The mean absolute difference between the two images is non-negligible:

np.mean(np.abs(keras_image_rs - tf_image_rs))
7.982703

I played with the align_corners argument, and tried other available interpolation methods as well. None give the same output as when resizing the image with PIL. This is quite annoying as it gives me a skew between training and testing results. Does anyone have an idea as to what causes this behavior, or on how to fix it?

508

asked Feb 02 '19 20:02

sdcbr

1 Answers

The behavior described completely matches what's written in this article: How Tensorflow’s tf.image.resize stole 60 days of my life

In short: yes, PIL/sklearn/OpenCV and other common libraries for image manipulation have the correct behavior, while tf.image.resize has a different behavior that won't be changed in order to not break old trained models.

Hence, you should always preprocess your image using the same library outside the computational graph.

Link to the relevant github thread: https://github.com/tensorflow/tensorflow/issues/6720

answered Sep 27 '22 20:09

nessuno

Related questions
                            
                                What is the Python equivalent of Java's UnsupportedOperationException?
                            
                                Likelihood ratio test in Python
                            
                                itertools not defined when used inside module
                            
                                ZipFile.testzip() returning different results on Python 2 and Python 3
                            
                                Pip: could not find a version. No matching distribution found
                            
                                Wrapping column names in Python Pandas DataFrame or Jupyter Notebooks
                            
                                How to convert a series of one value to float only?
                            
                                Airflow no module named for directory in airflow_home directory
                            
                                TypeError: src is not a numpy array, neither a scalar
                            
                                Can you overload the Python 3.6 f-string's "operator"?
                            
                                Understanding apyori's output
                            
                                size legend for plotly bubble map/chart
                            
                                Find (only) the first row satisfying a given condition in pandas DataFrame
                            
                                Fastest way to compute image dataset channel wise mean and standard deviation in Python
                            
                                What does "Client failed to connect to the D-BUS daemon" mean?
                            
                                Generic NamedTuple in Python 3.6
                            
                                How to type hint a dictionary with values of different types
                            
                                Python Visual Studio Code autoDocstring Configuration
                            
                                HTTP PUT request in Python using JSON data
                            
                                Why is numpy sum 10 times slower than the + operator?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Inconsistency between image resizing with Keras (PIL) and TensorFlow?

Tags:

python

tensorflow

python-imaging-library

deep-learning

keras

sdcbr

People also ask

1 Answers

nessuno

Recent Activity

Donate For Us