How to serve a tensorflow-module, specifically Universal Sentence Encoder?

Tags:

I have spent several hours trying to set up Tensorflow serving of the Tensorflow-hub module, "Universal Sentence Encoder." There is a similar question here:

How to make the tensorflow hub embeddings servable using tensorflow serving?

I have been doing this on a Windows machine.

This is the code I used to build the model:

import tensorflow as tf
import tensorflow_hub as hub

MODEL_NAME = 'test'
VERSION = 1
SERVE_PATH = './models/{}/{}'.format(MODEL_NAME, VERSION)

with tf.Graph().as_default():
  module = hub.Module("https://tfhub.dev/google/universal-sentence- 
  encoder/1")
  text = tf.placeholder(tf.string, [None])
  embedding = module(text)

  init_op = tf.group([tf.global_variables_initializer(), 
  tf.tables_initializer()])
  with tf.Session() as session:
  session.run(init_op)
    tf.saved_model.simple_save(
    session,
    SERVE_PATH,
    inputs = {"text": text},
    outputs = {"embedding": embedding},
    legacy_init_op = tf.tables_initializer()        
   )

I have gotten to the point where running the following line:

saved_model_cli show --dir ${PWD}/models/test/1 --tag_set serve --signature_def serving_default

gives me the following result:

The given SavedModel SignatureDef contains the following input(s):
inputs['text'] tensor_info:
  dtype: DT_STRING
  shape: (-1)
  name: Placeholder:0
The given SavedModel SignatureDef contains the following output(s):
 outputs['embedding'] tensor_info:
  dtype: DT_FLOAT
  shape: (-1, 512)
  name: module_apply_default/Encoder_en/hidden_layers/l2_normalize:0

I have then tried running:

saved_model_cli run --dir ${PWD}/models/test/1 --tag_set serve --signature_def serving_default --input_exprs 'text=["what this is"]'

which gives the error:

  File "<string>", line 1
[what this is]
         ^
SyntaxError: invalid syntax

I've tried changing the format of the part 'text=["what this is"]', but nothing worked for me.

Regardless of if this part works, the main goal is to set up the module for serving and create a callable API.

I've tried with docker, the following line:

docker run -p 8501:8501 --name tf-serve -v ${PWD}/models/:/models -t tensorflow/serving --model_base_path=/models/test

Things appear to be set up properly:

Building single TensorFlow model file config:  model_name: model model_base_path: /models/test
2018-10-09 07:05:08.692140: I tensorflow_serving/model_servers/server_core.cc:462] Adding/updating models.
2018-10-09 07:05:08.692301: I tensorflow_serving/model_servers/server_core.cc:517]  (Re-)adding model: model
2018-10-09 07:05:08.798733: I tensorflow_serving/core/basic_manager.cc:739] Successfully reserved resources to load servable {name: model version: 1}
2018-10-09 07:05:08.798841: I tensorflow_serving/core/loader_harness.cc:66] Approving load for servable version {name: model version: 1}
2018-10-09 07:05:08.798870: I tensorflow_serving/core/loader_harness.cc:74] Loading servable version {name: model version: 1}
2018-10-09 07:05:08.798904: I external/org_tensorflow/tensorflow/contrib/session_bundle/bundle_shim.cc:360] Attempting to load native SavedModelBundle in bundle-shim from: /models/test/1
2018-10-09 07:05:08.798947: I external/org_tensorflow/tensorflow/cc/saved_model/reader.cc:31] Reading SavedModel from: /models/test/1
2018-10-09 07:05:09.055822: I external/org_tensorflow/tensorflow/cc/saved_model/reader.cc:54] Reading meta graph with tags { serve }
2018-10-09 07:05:09.338142: I external/org_tensorflow/tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2018-10-09 07:05:09.576751: I external/org_tensorflow/tensorflow/cc/saved_model/loader.cc:162] Restoring SavedModel bundle.
2018-10-09 07:05:28.975611: I external/org_tensorflow/tensorflow/cc/saved_model/loader.cc:138] Running MainOp with key saved_model_main_op on SavedModel bundle.
2018-10-09 07:06:30.941577: I external/org_tensorflow/tensorflow/cc/saved_model/loader.cc:259] SavedModel load for tags { serve }; Status: success. Took 82120946 microseconds.
2018-10-09 07:06:30.990252: I tensorflow_serving/servables/tensorflow/saved_model_warmup.cc:83] No warmup data file found at /models/test/1/assets.extra/tf_serving_warmup_requests
2018-10-09 07:06:31.046262: I tensorflow_serving/core/loader_harness.cc:86] Successfully loaded servable version {name: model version: 1}
2018-10-09 07:06:31.184541: I tensorflow_serving/model_servers/server.cc:285] Running gRPC ModelServer at 0.0.0.0:8500 ...
[warn] getaddrinfo: address family for nodename not supported
2018-10-09 07:06:31.221644: I tensorflow_serving/model_servers/server.cc:301] Exporting HTTP/REST API at:localhost:8501 ...
[evhttp_server.cc : 235] RAW: Entering the event loop ...

I've tried

curl http://localhost:8501/v1/models/test

which gives

{ "error": "Malformed request: GET /v1/models/test:predict" }

and

curl -d '{"text": "Hello"}' -X POST http://localhost:8501/v1/models/test:predict

which gives

{ "error": "JSON Parse error: Invalid value. at offset: 0" }

A similar question is here

Tensorflow Serving: Rest API returns "Malformed request" error

Just looking for any way to get this module serving. Thanks.

558

asked Oct 09 '18 07:10

Joe Hidakatsu

1 Answers

I was finally able to figure things out. I'll post what I did here in case someone else is trying to do the same thing.

My issue with the saved_model_cli run command was with the quotes (using Windows command prompt). Change 'text=["what this is"]' to "text=['what this is']"

The issue with the POST request was two-fold. One, I noticed that the model's name is model, so should have been http://localhost:8501/v1/models/model:predict

Secondly, the input format was not correct. I used Postman, and the body of the request looks like this: {"inputs": {"text": ["Hello"]}}

189

answered Oct 04 '22 18:10

Joe Hidakatsu

Related questions
                            
                                Changing time components of pandas datetime64 column
                            
                                How to create charts with Plotly on Django?
                            
                                Folium map not displaying
                            
                                Why can't pdb access a variable containing an exception?
                            
                                Running Flask with Gunicorn raises TypeError: index() takes 0 positional arguments but 2 were given
                            
                                Binary to String/Text in Python
                            
                                Heroku Scheduler With Python Script
                            
                                Weird repeated sequence printed to console when installing packages through conda
                            
                                Convert Pandas Dataframe to Float with commas and negative numbers
                            
                                How do I perform an UPDATE of existing rows of a db table using a Pandas DataFrame?
                            
                                How to mock a method return value of a class
                            
                                Python logger per function or per module
                            
                                Is 3-space indentation required in reST?
                            
                                DBSCAN with custom metric
                            
                                Keras LSTM: a time-series multi-step multi-features forecasting - poor results
                            
                                Efficient Way of making a set of tuple in which the order of tuple doesn't matters
                            
                                Multiple instances of Python running simultaneously limited to 35
                            
                                How to Install the latest version of seaborn
                            
                                What is Numpy equivalence of dataframe.loc() in Pandas
                            
                                Keras Concatenate Layers: Difference between different types of concatenate functions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to serve a tensorflow-module, specifically Universal Sentence Encoder?

Tags:

python

tensorflow

tensorflow-serving

tensorflow-hub

Joe Hidakatsu

People also ask

1 Answers

Joe Hidakatsu

Recent Activity

Donate For Us