<p>I'm trying to write a function, that reads json files in tensorflow. The json files have the following structure: </p> <pre class="prettyprint"><code>{ "bounding_box": { "y": 98.5, "x": 94.0, "height": 197, "width": 188 }, "rotation": { "yaw": -27.97019577026367, "roll": 2.206029415130615, "pitch": 0.0}, "confidence": 3.053506851196289, "landmarks": { "1": { "y": 180.87722778320312, "x": 124.47326660156205}, "0": { "y": 178.60653686523438, "x": 183.41931152343795}, "2": { "y": 224.5936889648438, "x": 141.62365722656205 }}} </code></pre> <p>I only need the bounding box information. There are a few examples on how to write read_and_decode-functions, and I'm trying to transform these examples into a function for json files, but there are still a lot of questions...: </p> <pre class="prettyprint"><code>def read_and_decode(filename_queue): reader = tf.WhichKindOfReader() # ??? _, serialized_example = reader.read(filename_queue) features = tf.parse_single_example( serialized_example, features={ 'bounding_box':{ 'y': tf.VarLenFeature(<whatstheproperdatatype>) ??? 'x': 'height': 'width': # I only need the bounding box... - do I need to write # the format information for the other features...??? } }) y=tf.decode() # decoding necessary? x= height= width= return x,y,height,width </code></pre> <p>I've done research on the internet for hours, but can't find anything really detailled on how to read json in tensorflow... </p> <p>Maybe someone can give me a clue...</p>

<h3>Update</h3> <p>The solution below does get the job done but it is not very efficient, see comments for details.</p> <h3>Original answer</h3> <p>You can use standard python json parsing with TensorFlow if you wrap the functions with <code>tf.py_func</code>:</p> <pre class="prettyprint lang-py prettyprint-override"><code>import json import numpy as np import tensorflow as tf def get_bbox(str): obj = json.loads(str.decode('utf-8')) bbox = obj['bounding_box'] return np.array([bbox['x'], bbox['y'], bbox['height'], bbox['width']], dtype='f') def get_multiple_bboxes(str): return [[get_bbox(x) for x in str]] raw = tf.placeholder(tf.string, [None]) [parsed] = tf.py_func(get_multiple_bboxes, [raw], [tf.float32]) </code></pre> <p>Note that <code>tf.py_func</code> returns a <em>list of tensors</em> rather than just a single tensor, which is why we need to wrap <code>parsed</code> in a list <code>[parsed]</code>. If not, <code>parsed</code> would get the shape <code>[1, None, 4]</code> rather than the desired shape <code>[None, 4]</code> (where <code>None</code> is the batch size).</p> <p>Using your data you get the following results:</p> <pre class="prettyprint lang-py prettyprint-override"><code>json_string = """{ "bounding_box": { "y": 98.5, "x": 94.0, "height": 197, "width": 188 }, "rotation": { "yaw": -27.97019577026367, "roll": 2.206029415130615, "pitch": 0.0}, "confidence": 3.053506851196289, "landmarks": { "1": { "y": 180.87722778320312, "x": 124.47326660156205}, "0": { "y": 178.60653686523438, "x": 183.41931152343795}, "2": { "y": 224.5936889648438, "x": 141.62365722656205 }}}""" my_data = np.array([json_string, json_string, json_string]) init_op = tf.initialize_all_variables() with tf.Session() as sess: sess.run(init_op) print(sess.run(parsed, feed_dict={raw: my_data})) print(sess.run(tf.shape(parsed), feed_dict={raw: my_data})) </code></pre> <pre class="prettyprint lang-none prettyprint-override"><code>[[ 94. 98.5 197. 188. ] [ 94. 98.5 197. 188. ] [ 94. 98.5 197. 188. ]] [3 4] </code></pre>

How to read json files in Tensorflow?

Tags:

python

json

neural-network

tensorflow

I'm trying to write a function, that reads json files in tensorflow. The json files have the following structure:

Click to copy

{
    "bounding_box": {
        "y": 98.5, 
        "x": 94.0, 
        "height": 197, 
        "width": 188
     }, 
    "rotation": {
        "yaw": -27.97019577026367,
        "roll": 2.206029415130615, 
        "pitch": 0.0}, 
        "confidence": 3.053506851196289, 
        "landmarks": {
            "1": {
                "y": 180.87722778320312, 
                "x": 124.47326660156205}, 
            "0": {
                "y": 178.60653686523438, 
                "x": 183.41931152343795}, 
            "2": {
                "y": 224.5936889648438, 
                "x": 141.62365722656205
}}}

I only need the bounding box information. There are a few examples on how to write read_and_decode-functions, and I'm trying to transform these examples into a function for json files, but there are still a lot of questions...:

Click to copy

def read_and_decode(filename_queue):

  reader = tf.WhichKindOfReader() # ??? 
  _, serialized_example = reader.read(filename_queue)
  features = tf.parse_single_example( 
      serialized_example,

      features={

          'bounding_box':{ 

              'y': tf.VarLenFeature(<whatstheproperdatatype>) ???
              'x': 
              'height': 
              'width': 

          # I only need the bounding box... - do I need to write 
          # the format information for the other features...???

          }
      })

  y=tf.decode() # decoding necessary?
  x=
  height=
  width= 

  return x,y,height,width

I've done research on the internet for hours, but can't find anything really detailled on how to read json in tensorflow...

Maybe someone can give me a clue...

241

asked Jul 14 '16 18:07

meridius

2 Answers

Update

The solution below does get the job done but it is not very efficient, see comments for details.

Original answer

You can use standard python json parsing with TensorFlow if you wrap the functions with tf.py_func:

Click to copy

import json
import numpy as np
import tensorflow as tf

def get_bbox(str):
    obj = json.loads(str.decode('utf-8'))
    bbox = obj['bounding_box']
    return np.array([bbox['x'], bbox['y'], bbox['height'], bbox['width']], dtype='f')

def get_multiple_bboxes(str):
    return [[get_bbox(x) for x in str]]

raw = tf.placeholder(tf.string, [None])
[parsed] = tf.py_func(get_multiple_bboxes, [raw], [tf.float32])

Note that tf.py_func returns a list of tensors rather than just a single tensor, which is why we need to wrap parsed in a list [parsed]. If not, parsed would get the shape [1, None, 4] rather than the desired shape [None, 4] (where None is the batch size).

Using your data you get the following results:

Click to copy

json_string = """{
    "bounding_box": {
        "y": 98.5,
        "x": 94.0,
        "height": 197,
        "width": 188
     },
    "rotation": {
        "yaw": -27.97019577026367,
        "roll": 2.206029415130615,
        "pitch": 0.0},
        "confidence": 3.053506851196289,
        "landmarks": {
            "1": {
                "y": 180.87722778320312,
                "x": 124.47326660156205},
            "0": {
                "y": 178.60653686523438,
                "x": 183.41931152343795},
            "2": {
                "y": 224.5936889648438,
                "x": 141.62365722656205
}}}"""
my_data = np.array([json_string, json_string, json_string])

init_op = tf.initialize_all_variables()
with tf.Session() as sess:
    sess.run(init_op)
    print(sess.run(parsed, feed_dict={raw: my_data}))
    print(sess.run(tf.shape(parsed), feed_dict={raw: my_data}))

Click to copy

[[  94.    98.5  197.   188. ]
 [  94.    98.5  197.   188. ]
 [  94.    98.5  197.   188. ]]
[3 4]

145

answered Oct 10 '22 10:10

Backlin

This might be skirting the issue, but you could preprocess your data with a command line tool like https://stedolan.github.io/jq/tutorial/ into a line-based data format, like csv. Would possibly be more efficient also.

answered Oct 10 '22 08:10

Shan Carter

Related questions
                            
                                Insert data in AWS Redshift via AWS Lambda
                            
                                Pandas and Cassandra: numpy array format incompatibility
                            
                                python __main__ and __init__ proper usage
                            
                                SciPy interp2D for pairs of coordinates
                            
                                Create possible combinations of specific size
                            
                                How can I use values read from TFRecords as arguments to tf.reshape?
                            
                                how could I use complete penn treebank dataset inside python/nltk
                            
                                python read file non blocking on windows
                            
                                How to fit different inputs into an sklearn Pipeline?
                            
                                python numpy: Change the column type of a numpy matrix
                            
                                Undo L2 Normalization in sklearn python
                            
                                Error solving Matrix equation with numpy
                            
                                batch upload videos to youtube via command line python
                            
                                Python keras how to transform a dense layer into a convolutional layer
                            
                                How to get parameter arguments from a frozen spicy.stats distribution?
                            
                                How to inherit a python generator and overwrite __iter__
                            
                                Why is subprocess.run output different from shell output of same command?
                            
                                pytest -> How to use fixture return value in test method under a class
                            
                                How to search and replace text in an XML file using Python?
                            
                                Add constraints to scipy.optimize.curve_fit?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to read json files in Tensorflow?

Tags:

python

json

neural-network

tensorflow

meridius

People also ask

2 Answers

Update

Original answer

Backlin

Shan Carter

Recent Activity

Donate For Us