I just found that using Amazon's Elastic Map Reduce, I can specify a step to have one of three ActionOnFailure choices: <ul> <li>TERMINATE_JOB_FLOW</li> <li>CANCEL_AND_WAIT</li> <li>CONTINUE</li> </ul> TERMINATE_JOB_FLOW is the default and obvious - it shuts down the entire cluster upon a failure in the step. What is the difference between CANCEL_AND_WAIT and CONTINUE? It appears to me that both will keep the cluster running and simply move on to the next step when it is added.

Say you have launched a cluster and added following 3 steps to it: <ul> <li>Step1</li> <li>Step2</li> <li>Step3</li> </ul> Now, if <code>Step1</code> has ActionOnFailure as <code>CANCEL_AND_WAIT</code>, then in the event on failure of <code>Step1</code>, it would cancel all the remaining steps and the cluster will get into a <code>Waiting</code> status. And I guess if you laucng your cluster with <code>--stay-alive</code> option then this is the default behaviour. if <code>Step1</code> has ActionOnFailure as <code>CONTINUE</code>, then in the event on failure of <code>Step1</code>, it would continue with the execution of <code>Step2</code>. if <code>Step1</code> has ActionOnFailure as <code>TERMINATE_JOB_FLOW</code>, then in the event on failure of <code>Step1</code>, it would shut down the cluster as you mentioned.

Elastic Map Reduce: difference between CANCEL_AND_WAIT and CONTINUE?

1 Answers

Say you have launched a cluster and added following 3 steps to it:

Step1
Step2
Step3

Now, if Step1 has ActionOnFailure as CANCEL_AND_WAIT, then in the event on failure of Step1, it would cancel all the remaining steps and the cluster will get into a Waiting status. And I guess if you laucng your cluster with --stay-alive option then this is the default behaviour.

if Step1 has ActionOnFailure as CONTINUE, then in the event on failure of Step1, it would continue with the execution of Step2.

if Step1 has ActionOnFailure as TERMINATE_JOB_FLOW, then in the event on failure of Step1, it would shut down the cluster as you mentioned.

165

answered Nov 02 '22 14:11

Amar

Related questions
                            
                                How to get messages receive count in Amazon SQS using boto library in Python?
                            
                                How to get the public ip of current ec2 instance in python?
                            
                                How do I upload a CSV file in myBucket and Read File in S3 AWS using Python
                            
                                How do I get the S3 key's created date with boto?
                            
                                How to get IAM Policy Document via boto
                            
                                How do you use Boto3 download_file with AWS KMS?
                            
                                Boto script to download latest file from s3 bucket
                            
                                How to find size of a folder inside an S3 bucket?
                            
                                List EC2 volumes in Boto
                            
                                How can I handle a boto exception in python?
                            
                                Unable to read instance data, giving up error in python boto
                            
                                Boto - How to delete a record set from route53 -Tried to delete resource record set but it was not found
                            
                                Weird behaviour of boto inside docker
                            
                                Pre-signed URLs and x-amz-acl
                            
                                Django, Heroku, boto: direct file upload to Google cloud storage
                            
                                amazon dynamodb query without primary key knowledge
                            
                                How can I configure Auto Scaling with boto using scaling policies and metrics?
                            
                                How to check if SSH connection was established with AWS instance
                            
                                amazon s3 The specified key does not exist after get_key success
                            
                                How do I create an EC2 image from a running instance using boto?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Elastic Map Reduce: difference between CANCEL_AND_WAIT and CONTINUE?

Tags:

boto

amazon-emr

elastic-map-reduce

Suman

People also ask

1 Answers

Amar

Recent Activity

Donate For Us