Azure Machine Learning Request Response latency

Tags:

I have made an Azure Machine Learning Experiment which takes a small dataset (12x3 array) and some parameters and does some calculations using a few Python modules (a linear regression calculation and some more). This all works fine.

I have deployed the experiment and now want to throw data at it from the front-end of my application. The API-call goes in and comes back with correct results, but it takes up to 30 seconds to calculate a simple linear regression. Sometimes it is 20 seconds, sometimes only 1 second. I even got it down to 100 ms one time (which is what I'd like), but 90% of the time the request takes more than 20 seconds to complete, which is unacceptable.

I guess it has something to do with it still being an experiment, or it is still in a development slot, but I can't find the settings to get it to run on a faster machine.

Is there a way to speed up my execution?

Edit: To clarify: The varying timings are obtained with the same test data, simply by sending the same request multiple times. This made me conclude it must have something to do with my request being put in a queue, there is some start-up latency or I'm throttled in some other way.

778

asked Jan 25 '16 10:01

JrtPec

1 Answers

First, I am assuming you are doing your timing test on the published AML endpoint.

When a call is made to the AML the first call must warm up the container. By default a web service has 20 containers. Each container is cold, and a cold container can cause a large(30 sec) delay. In the string returned by the AML endpoint, only count requests that have the isWarm flag set to true. By smashing the service with MANY requests(relative to how many containers you have running) can get all your containers warmed.

If you are sending out dozens of requests a instance, the endpoint might be getting throttled. You can adjust the number of calls your endpoint can accept by going to manage.windowsazure.com/

manage.windowsazure.com/
Azure ML Section from left bar
select your workspace
go to web services tab
Select your web service from list
adjust the number of calls with slider

By enabling debugging onto your endpoint you can get logs about the execution time for each of your modules to complete. You can use this to determine if a module is not running as you intended which may add to the time.

Overall, there is an overhead when using the Execute python module, but I'd expect this request to complete in under 3 secs.

answered Sep 29 '22 14:09

Dan Ciborowski - MSFT

Related questions
                            
                                Determining pixel height of a string with newlines in Pillow
                            
                                How to set default value using SqlAlchemy_Utils ChoiceType
                            
                                Python Pandas sorting by multiindex and column
                            
                                count occurrences of arrays in multidimensional arrays in python
                            
                                How does @mock.patch know which parameter to use for each mock object?
                            
                                Is there a way to convert unicode to the nearest ASCII equivalent? [duplicate]
                            
                                importerror: no module named flask.ext.script
                            
                                Python Bag of Words clustering
                            
                                Generating gradient map of 2D array
                            
                                Using string labels in Tensorflow
                            
                                split list by certain repeated index value
                            
                                How to tell scikit-learn for which label the F-1/precision/recall score is given (in binary classification)?
                            
                                How can I train a simple, non-linear regression model with tensor flow?
                            
                                Why does .mro() on a metaclass have a different signature? `descriptor 'mro' of 'type' object needs an argument`
                            
                                How can I find the location of the source code of a built-in Python method?
                            
                                py2exe fails to create EXE due to missing DLL when opencv is imported
                            
                                NumPy array indexing a 4D array
                            
                                Install local dist package into virtualenv
                            
                                Iterating over queryset values
                            
                                Neural network backpropagation algorithm not working in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Azure Machine Learning Request Response latency

Tags:

python

azure

azure-machine-learning-studio

JrtPec

People also ask

1 Answers

Dan Ciborowski - MSFT

Recent Activity

Donate For Us