How to increase AWS Sagemaker invocation time out while waiting for a response

Tags:

I deployed a large 3D model to aws sagemaker. Inference will take 2 minutes or more. I get the following error while calling the predictor from Python:

An error occurred (ModelError) when calling the InvokeEndpoint operation: Received server error (0) from model with message "Your invocation timed out while waiting for a response from container model. Review the latency metrics for each container in Amazon CloudWatch, resolve the issue, and try again."'

In Cloud Watch I also see some PING time outs while the container is processing:

2020-10-07T16:02:39.718+02:00 2020/10/07 14:02:39 https://forums.aws.amazon.com/ 106#106: *251 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.32.0.2, server: , request: "GET /ping HTTP/1.1", upstream: "http://unix:/tmp/gunicorn.sock/ping", host: "model.aws.local:8080"

How do I increase the invocation time out?

Or is there a way to make async invocations to an sagemaker endpoint?

656

asked Oct 07 '20 14:10

Stiefel

1 Answers

It’s currently not possible to increase timeout—this is an open issue in GitHub. Looking through the issue and similar questions on SO, it seems like you may be able to use batch transforms in conjunction with inference.

References

https://stackoverflow.com/a/55642675/806876

Sagemaker Python SDK timeout issue: https://github.com/aws/sagemaker-python-sdk/issues/1119

192

answered Oct 06 '22 11:10

pygeek

Related questions
                            
                                Animation of tangent line of a 3D curve
                            
                                os.link() vs. os.rename() vs. os.replace() for writing atomic write files. What is the best approach?
                            
                                Reasons for differences in memory consumption and performances of np.zeros and np.full
                            
                                Find Fraction using LP
                            
                                Training stability of Wasserstein GANs
                            
                                Detecting insertion/removal of USB input devices on Windows 10
                            
                                TensorFlow 2.0 C++ - Load pre-trained model
                            
                                how to increase resolution of text in scanned images in python?
                            
                                matplotlib figure won't show when Python is run from VS Code integrated terminal
                            
                                ImportError: cannot import name 'Feature' from 'setuptools [closed]
                            
                                how to add a different model form to modelformset_factory
                            
                                tensorflow_hub throwing this error: 'SentencepieceOp' when loading the link
                            
                                Why multiprocess python grpc server do not work?
                            
                                ValueError: Expect x to be a 1-D sorted array_like.I am trying to plot smooth curve but couldn't
                            
                                Calling a function with unknown number of parameters Python
                            
                                How to use an optimization algorithm to find the best possible parameter
                            
                                What's the computational complexity of .iloc() in pandas dataframes?
                            
                                Create a new camera source using OpenCV Python (Camera Driver Using Python)
                            
                                ModuleNotFoundError: No module named 'tensorflow_hub'
                            
                                Numpy finding interval which has a least k points

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to increase AWS Sagemaker invocation time out while waiting for a response

Tags:

python

amazon-web-services

timeout

inference

amazon-sagemaker

Stiefel

People also ask

1 Answers

References

pygeek

Recent Activity

Donate For Us