How to read images with different size in a TFRecord file

Tags:

I have created a dataset and saved it into a TFRecord file. The thing is the pictures have different size, so I want to save the size as well with the images. So I used the TFRecordWriter and defined the features like:

example = tf.train.Example(features=tf.train.Features(feature={
  'rows': _int64_feature(image.shape[0]),
  'cols': _int64_feature(image.shape[1]),
  'image_raw': _bytes_feature(image_raw)}))

I expected that I can read and decode the image using TFRecordReader, but the thing is I cannot get the value of rows and cols from the file because they are tensors. So how am I supposed to do to read the size dynamically and reshape the image accordingly. Thanks guys

457

asked Jan 27 '16 03:01

Tong Shen

2 Answers

You can call tf.reshape with a dynamic shape parameter.

image_rows = tf.cast(features['rows'], tf.int32)
image_cols = tf.cast(features['cols'], tf.int32)
image_data = tf.decode_raw(features['image_raw'], tf.uint8)
image = tf.reshape(image_data, tf.pack([image_rows, image_cols, 3]))

141

answered Nov 02 '22 07:11

bgshi

I suggest a workflow like:

TARGET_HEIGHT = 500
TARGET_WIDTH = 500

image = tf.image.decode_jpeg(image_buffer, channels=3)
image = tf.image.convert_image_dtype(image, dtype=tf.float32)

# Choose your bbox here.
bbox_begin = ...  (should be (h_start, w_start, 0))
bbox_size = tf.constant((TARGET_HEIGHT, TARGET_WIDTH, 3), dtype=tf.int32)

cropped_image = tf.slice(image, bbox_begin, bbox_size)

cropped_image has a constant tensor size, and can then be thrown into a shuffle batch.

You can dynamically access the size of the decoded image using tf.shape(image). You can do computations on the resulting sub-elements and then stitch them back together using something like bbox_begin = tf.pack([bbox_h_start, bbox_y_start, 0]). Just need to insert your own logic in there for determining the start points of the crop, and what you want to do if the image starts out smaller than you want for your pipeline.

If you want to upsize only if the image is smaller than your target dimensions, you'll need to use tf.control_flow_ops.cond or equivalent. But you could use min and max operations to set the size of your crop window so that you're returning the full image iff it's smaller than the requested dimensions, and then unconditionally resize up to 500x500. The cropped image will already be at 500x500, so the resize should become an effective no-op.

answered Nov 02 '22 08:11

dga

Related questions
                            
                                iPython notebook - configuring to shut down the kernel on closing the tab
                            
                                removing the name of a pandas dataframe index after appending a total row to a dataframe
                            
                                How to fake javascript enabled in Python requests/beautifulsoup
                            
                                Normalize different date data into single format in Python [duplicate]
                            
                                Python Testing - Reset all mocks?
                            
                                Speeding up an iloc solution within a pandas dataframe
                            
                                TypeError: '_io.TextIOWrapper' object is not callable, creating text file error
                            
                                Parsing Yaml in Python: Detect duplicated keys
                            
                                Scipy.optimize.minimize method='SLSQP' ignores constraint
                            
                                Python-pandas Replace NA with the median or mean of a group in dataframe
                            
                                From string to sympy expression
                            
                                Generating LMDB for Caffe
                            
                                Default rounding mode in python, and how to specify it to another one?
                            
                                How do I change directory in python so it remains after running the script?
                            
                                How to write in .csv file from a generator in python
                            
                                Valid parameters for astype in NumPy
                            
                                How to loop through a column in Python?
                            
                                How does python assign values after assignment operator [duplicate]
                            
                                how to get insights for all campaigns in single query + Facebook marketing api
                            
                                Sharing Google sheet with service account email

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to read images with different size in a TFRecord file

Tags:

python

tensorflow

deep-learning

Tong Shen

People also ask

2 Answers

bgshi

dga

Recent Activity

Donate For Us