how to run a pre-trained model in AWS sagemaker?

Tags:

I have a model.pkl file which is pre-trained and all other files related to the ml model. I want it to deploy it on the aws sagemaker. But without training, how to deploy it to the aws sagmekaer, as fit() method in aws sagemaker run the train command and push the model.tar.gz to the s3 location and when deploy method is used it uses the same s3 location to deploy the model, we don't manual create the same location in s3 as it is created by the aws model and name it given by using some timestamp. How to put out our own personalized model.tar.gz file in the s3 location and call the deploy() function by using the same s3 location.

387

asked Oct 10 '19 16:10

tarun mittal

1 Answers

All you need is:

to have your model in an arbitrary S3 location in a model.tar.gz archive
to have an inference script in a SageMaker-compatible docker image that is able to read your model.pkl, serve it and handle inferences.
to create an endpoint associating your artifact to your inference code

When you ask for an endpoint deployment, SageMaker will take care of downloading your model.tar.gz and uncompressing to the appropriate location in the docker image of the server, which is /opt/ml/model

Depending on the framework you use, you may use either a pre-existing docker image (available for Scikit-learn, TensorFlow, PyTorch, MXNet) or you may need to create your own.

Regarding custom image creation, see here the specification and here two examples of custom containers for R and sklearn (the sklearn one is less relevant now that there is a pre-built docker image along with a sagemaker sklearn SDK)
Regarding leveraging existing containers for Sklearn, PyTorch, MXNet, TF, check this example: Random Forest in SageMaker Sklearn container. In this example, nothing prevents you from deploying a model that was trained elsewhere. Note that with a train/deploy environment mismatch you may run in errors due to some software version difference though.

Regarding your following experience:

when deploy method is used it uses the same s3 location to deploy the model, we don't manual create the same location in s3 as it is created by the aws model and name it given by using some timestamp

I agree that sometimes the demos that use the SageMaker Python SDK (one of the many available SDKs for SageMaker) may be misleading, in the sense that they often leverage the fact that an Estimator that has just been trained can be deployed (Estimator.deploy(..)) in the same session, without having to instantiate the intermediary model concept that maps inference code to model artifact. This design is presumably done on behalf of code compacity, but in real life, training and deployment of a given model may well be done from different scripts running in different systems. It's perfectly possible to deploy a model with training it previously in the same session, you need to instantiate a sagemaker.model.Model object and then deploy it.

186

answered Nov 15 '22 06:11

Olivier Cruchant

Related questions
                            
                                How to ignore idle timeout from AWS ELB in the browser
                            
                                Accessing Parameter Store from VPC / Lambda
                            
                                AWS SQS FIFO - How to get more than 10 messages at a time?
                            
                                Terraform AWS S3 to Lambda Notification Trigger
                            
                                AWS API Gateway Custom Authorizer not invoked
                            
                                an internal error occurred during: uploading code to lambda
                            
                                Identifying and deleting S3 Objects that are not being accessed?
                            
                                AWS CloudFormation Script Fails - Cognito is not allowed to use your email identity
                            
                                aws CAPABILITY_AUTO_EXPAND console web codepipeline with cloudformation
                            
                                Django on AWS Elastic Beanstalk - No module named MySQLdb Error
                            
                                AWS Step cannot correctly invoke AWS Batch job with complex parameters
                            
                                Kubernetes Kops without dns
                            
                                AWS::ApiGateway::Stage requires DeploymentId ... but where do I find this?
                            
                                How to run python code on AWS lambda with package dependencies >500MB?
                            
                                AWS RDS IAM Authentication with Terraform
                            
                                AWS Sagemaker Ground Truth WorkerID for private team
                            
                                AWS update Athena meta: Glue Crawler vs MSCK Repair Table
                            
                                Creating presigned url for a S3 folder in python
                            
                                Jenkins suddenly started failing to provision agents in Amazon EKS
                            
                                Storing many small files (on S3)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to run a pre-trained model in AWS sagemaker?

Tags:

amazon-web-services

machine-learning

model

amazon-sagemaker

tarun mittal

People also ask

1 Answers

Olivier Cruchant

Recent Activity

Donate For Us