Splittling SQS Lambda batch into partial success/partial failure

Tags:

The AWS SQS -> Lambda integration allows you to process incoming messages in a batch, where you configure the maximum number you can receive in a single batch. If you throw an exception during processing, to indicate failure, all the messages are not deleted from the incoming queue and can be picked up by another lambda for processing once the visibility timeout has passed.

Is there any way to keep the batch processing, for performance reasons, but allow some messages from the batch to succeed (and be deleted from the inbound queue) and only leave some of the batch un-deleted?

880

asked May 21 '19 08:05

matt freake

1 Answers

The problem with manually re-enqueueing the failed messages to the queue is that you can get into an infinite loop where those items perpetually fail and get re-enqueued and fail again. Since they are being resent to the queue their retry count gets reset every time which means they'll never fail out into a dead letter queue. You also lose the benefits of the visibility timeout. This is also bad for monitoring purposes since you'll never be able to know if you're in a bad state unless you go manually check your logs.

A better approach would be to manually delete the successful items and then throw an exception to fail the rest of the batch. The successful items will be removed from the queue, all the items that actually failed will hit their normal visibility timeout periods and retain their receive count values, and you'll be able to actually use and monitor a dead letter queue. This is also overall less work than the other approach.

Considerations

Only override the default behavior if there has been a partial batch failure. If all the items succeeded, let the default behavior take its course
Since you're tracking the failures of each queue item, you'll need to catch and log each exception as they come in so that you can see what's going on later

139

answered Sep 21 '22 21:09

cdzar

Related questions
                            
                                When to use terraform vs serverless framework to deploy AWS lambdas and surrounding resources? [closed]
                            
                                Does AWS Lambda charge for the time spent initializing code?
                            
                                Unzipped size must be smaller than 262144000 bytes - AWS Lambda Error
                            
                                SQS fifo queues not ensuring single time delivery when used as lambda trigger
                            
                                Pass cookie to CloudFront origin but prevent from caching
                            
                                AWS / Python Lambda function checking if a query string is present
                            
                                AWS Lambda not importing Asyncio
                            
                                Lambda processing same SNS event multiple times?
                            
                                AWS API Gateway error response generates 502 "Bad Gateway"
                            
                                Does AWS charge for lambda in sleep state
                            
                                Nodejs API call returning undefined to lambda function
                            
                                Export the Lambda ARN
                            
                                AWS Lambda: Clarification on retrieving data from event object
                            
                                AWS DynamoDB trigger using Lambda in JAVA
                            
                                How can I access a local API using Amazon Alexa
                            
                                AWS MediaConvert on media that has no audio
                            
                                Split S3 file into smaller files of 1000 lines
                            
                                django-zappa: Error loading psycopg2 module: libpq.so.5: cannot open shared object file: No such file or directory
                            
                                aws cdk appsync Schema Creation Status is FAILED with details: Internal Failure while saving the schema
                            
                                Adding parameters to aws lambda events using templates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Splittling SQS Lambda batch into partial success/partial failure

Tags:

amazon-sqs

aws-lambda

matt freake

People also ask

1 Answers

Considerations

cdzar

Recent Activity

Donate For Us