I have my data in multiple pickle files stored on disk. I want to use tensorflow's tf.data.Dataset to load my data into training pipeline. My code goes: <pre class="prettyprint"><code>def _parse_file(path): image, label = *load pickle file* return image, label paths = glob.glob('*.pkl') print(len(paths)) dataset = tf.data.Dataset.from_tensor_slices(paths) dataset = dataset.map(_parse_file) iterator = dataset.make_one_shot_iterator() </code></pre> Problem is I don't know how to implement the <code>_parse_file</code> fuction. The argument to this function, <code>path</code>, is of tensor type. I tried <pre class="prettyprint"><code>def _parse_file(path): with tf.Session() as s: p = s.run(path) image, label = pickle.load(open(p, 'rb')) return image, label </code></pre> and got error message: <pre class="prettyprint"><code>InvalidArgumentError (see above for traceback): You must feed a value for placeholder tensor 'arg0' with dtype string [[Node: arg0 = Placeholder[dtype=DT_STRING, shape=<unknown>, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]] </code></pre> After some search on the Internet I still have no idea how to do it. I will be grateful to anyone providing me a hint.

I have solved this myself. I should use <code>tf.py_func</code> as in this doc.

How to load pickle files by tensorflow's tf.data API

Tags:

tensorflow

pickle

I have my data in multiple pickle files stored on disk. I want to use tensorflow's tf.data.Dataset to load my data into training pipeline. My code goes:

def _parse_file(path):
    image, label = *load pickle file*
    return image, label
paths = glob.glob('*.pkl')
print(len(paths))
dataset = tf.data.Dataset.from_tensor_slices(paths)
dataset = dataset.map(_parse_file)
iterator = dataset.make_one_shot_iterator()

Problem is I don't know how to implement the _parse_file fuction. The argument to this function, path, is of tensor type. I tried

def _parse_file(path):
    with tf.Session() as s:
        p = s.run(path)
        image, label = pickle.load(open(p, 'rb'))
    return image, label

and got error message:

InvalidArgumentError (see above for traceback): You must feed a value for placeholder tensor 'arg0' with dtype string
     [[Node: arg0 = Placeholder[dtype=DT_STRING, shape=<unknown>, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]

After some search on the Internet I still have no idea how to do it. I will be grateful to anyone providing me a hint.

942

asked Jun 15 '18 06:06

Zhao Chen

1 Answers

I have solved this myself. I should use tf.py_func as in this doc.

answered Oct 03 '22 13:10

Zhao Chen

Related questions
                            
                                tensorflow neural network multi layer perceptron for regression example
                            
                                Slicing-based assignment in Keras / Tensorflow?
                            
                                Tensorflow- ImportError: libcublas.so.8.0: cannot open shared object file: No such file or directory
                            
                                ValueError at /image/ Tensor Tensor("activation_5/Softmax:0", shape=(?, 4), dtype=float32) is not an element of this graph
                            
                                You must feed a value for placeholder tensor 'Placeholder' with dtype float and shape [?,784] for MNIST dataset
                            
                                Get the loss that a given optimizer is minimizing in Tensorflow
                            
                                Run a Tensorflow model without having Tensorflow installed
                            
                                Converting google-cloud-ml github Reddit example from regression to classification and adding keys?
                            
                                Mobilenet SSD Input Image Size
                            
                                First training epoch is very slow
                            
                                Deep multi-task learning with missing labels
                            
                                What is actually num_unit in LSTM cell circuit?
                            
                                How to get the default session from a tf.estimator?
                            
                                Estimating high resolution images from lower ones using a Keras model based on ConvLSTM2D
                            
                                How to publish custom (non-tensorflow) models using tensorflow-serving?
                            
                                Create model using one - hot encoding in Keras
                            
                                I am getting an accuracy of 1.0 every time in neural network
                            
                                In tensorflow estimator, what does it mean for num_epochs to be None?
                            
                                Keras Top 5 predictions
                            
                                Creating a neural network in keras to multiply two input integers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to load pickle files by tensorflow's tf.data API

Tags:

tensorflow

pickle

Zhao Chen

People also ask

1 Answers

Zhao Chen

Recent Activity

Donate For Us