TensorFlow in production for real time predictions in high traffic app - how to use?

Tags:

What is the right way to use TensorFlow for real time predictions in a high traffic application.

Ideally I would have a server/cluster running tensorflow listening on a port(s) where I can connect from app servers and get predictions similar to the way databases are used. Training should be done by cron jobs feeding the training data through the network to the same server/cluster.

How does one actually use tensorflow in production? Should I build a setup where the python is running as a server and use the python scripts to get predictions? I'm still new to this but I feel that such script will need to open sessions etc.. which is not scalable. (I'm talking about 100s of predictions/sec).

Any pointer to relevant information will be highly appreciated. I could not find any.

442

asked Feb 15 '16 16:02

Nir

1 Answers

This morning, our colleagues released TensorFlow Serving on GitHub, which addresses some of the use cases that you mentioned. It is a distributed wrapper for TensorFlow that is designed to support high-performance serving of multiple models. It supports both bulk processing and interactive requests from app servers.

For more information, see the basic and advanced tutorials.

answered Sep 29 '22 16:09

mrry

Related questions
                            
                                Python/Scipy 2D Interpolation (Non-uniform Data)
                            
                                Django: why are Django model fields class attributes?
                            
                                What's your folder layout for a Flask app divided in modules?
                            
                                pickling error in python?
                            
                                mod_wsgi and multiple installations of python
                            
                                lxml not adding newlines when inserting a new element into existing xml
                            
                                RFCOMM without pairing using PyBluez on Debian?
                            
                                Multidimensional Scaling Fitting in Numpy, Pandas and Sklearn (ValueError)
                            
                                What part of speech does "s" stand for in WordNet synsets
                            
                                selenium.common.exceptions.WebDriverException: Message: 'Can not connect to GhostDriver'
                            
                                multiprocessing.Process.is_alive() returns True although process has finished, why?
                            
                                argparse argument dependency
                            
                                Multiprocessing of shared list
                            
                                How to Zoom with Axes3D in Matplotlib
                            
                                Why does python print version info to stderr?
                            
                                How to aggregate matching pairs into "connected components" in Python
                            
                                weights option for seaborn distplot?
                            
                                how to add href link in email content when sending email through smtplib
                            
                                PyCharm asks for python interpreter every time project is loaded
                            
                                Python Pandas inferring column datatypes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TensorFlow in production for real time predictions in high traffic app - how to use?

Tags:

python

machine-learning

tensorflow

tensorflow-serving

Nir

People also ask

1 Answers

mrry

Recent Activity

Donate For Us