How to fix intermittent 503 Service Unavailable after idling/redeployments on AWS HTTP API Gateway & Fargate/ECS?

Problem:

We receive intermittent HTTP 503 Service Unavailable for some of our requests. A new deployment (with task redeployment) increases the rate, but even after 10-15 minutes they still occur intermittently.

In Cloud Watch we see the failing 503 Requests

2020-06-05T14:19:01.810+02:00 xx.117.163.xx - - [05/Jun/2020:12:19:01 +0000] "GET ANY /api/{proxy+} HTTP/1.1" 503 33 Np24bwmwsiasJDQ=

but it seems like they do not reach a living backend instance.

We enabled VPC Flow Logs and it seems that HTTP API Gateway wants to route some requests to stopped tasks even after they've gone long for good (far exceeding 60s).

More puzzling: If we keep the system busy, the rate drops to nearly zero. Otherwise after a longer period of idling the intermittent errors seem to reoccur.

Questions

How can we fix this issue?
Are there options to further pinpoint the root issue?

231

asked Jun 05 '20 14:06

bentolor

1 Answers

I was facing this issues and solved it by configuring my ALB being internal, instead of internet-facing(regarding the scheme). Hope it may help someone with the same issue.

Context: The environment is API Gateway + ALB(ECS)

Update The first ALB I configured was to manage my backend services. Recently I also did another ALB(to deal with my front-end instances), in this case, I exposed a public IP(instead of just a private one). This could be achieved by changing the scheme to internet-facing, at first I thought this would bring the same problem as I had before, then I figured that it was something pretty simple. I just needed to add a policy to allow traffic from the internet to the ALB I created.

answered Nov 15 '22 19:11

xalves

Related questions
                            
                                How do I access the current user in a cloudformation template?
                            
                                How to make browser cache identical image with different aws s3 presigned url?
                            
                                Can AWS step function executes more than 25000 times?
                            
                                SNS Mobile push notifications extremely confused
                            
                                AWS ECS service Tasks getting replaced with (reason Request timed out)
                            
                                How do I remove a "grantee" user from S3 permissions tab?
                            
                                AWS CloudSearch: different documents in 1 domain?
                            
                                DynamoDB API: How can I build an "add JSON attribute if not present" update request?
                            
                                SES error missing final '@domain'
                            
                                Upload file to Amazon S3 from Android slow
                            
                                Inform browser clients when Lambda function is done using Amazon SQS
                            
                                using aws-sdk to upload images to s3 using nodejs
                            
                                Access AWS S3 bucket from another account using roles
                            
                                Running a simple HTTPS Node JS Server on Amazon EC2
                            
                                How to encrypt AWS Lambda environment variables using CloudFormation
                            
                                Refresh AWS Quicksight automatically [closed]
                            
                                How to debug "Missing Authentication Token" in AWS API Gateway?
                            
                                Auto Delete SQS queue
                            
                                AWS Java SDK: AbortedException on call to AmazonSQSClient.receiveMessage
                            
                                Netflix Zuul/Ribbon/Eureka vs AWS ELB/ALB & ECS

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to fix intermittent 503 Service Unavailable after idling/redeployments on AWS HTTP API Gateway & Fargate/ECS?

Tags:

amazon-web-services

aws-api-gateway

aws-fargate