Lambda throttling below concurrency limit

Tags:

We use Lambda to power APIs (via API Gateway) accessed via news media websites, receiving a fluctuating but high load of traffic. We began experiencing throttles, so we raised our concurrency limit to 2000. However, we still experience throttles multiple times per day.

Oddly in CloudWatch metrics, the concurrent requests peak at around 600 or lower when we're throttled. See this CloudWatch chart as an example:

Lambda Throttling in CloudWatch Metrics

Has anyone experienced this before? Why do you think this is happening? What can we do about it?

More Information

This chart is across all Lambdas for our entire region.
When throttling occurs, it happens across all Lambda instances.
We primarily trigger Lambdas via API Gateway, but there's a few that are triggered via SNS (fairly high rate of data).
We have CloudFront in front of all APIs, and with some of them we have a 5 second cache time (for the super frequently requested APIs - saves us $$$)

Additionally, here's an image that also shows total invocation count and average duration over the same time period. It's hard to know what's causal (duration up because of throttling, or vice versa, because some of the lambdas do call other lambdas). Please see the appropriate axis because the scales are quite different.

enter image description here

867

asked Jun 21 '18 04:06

Matthew Blackford

1 Answers

I think this has to do with Lambda concurrency burst limits.

Basically, there's a limit on how many instances of your Lambda function you can run concurrently under sudden load and this limit is different to the overall per-region Lambda concurrency limit.

You can find more information about it here:

https://docs.aws.amazon.com/lambda/latest/dg/scaling.html

The relevant part:

AWS Lambda dynamically scales function execution in response to increased traffic, up to your concurrency limit. Under sustained load, your function's concurrency bursts to an initial level between 500 and 3000 concurrent executions that varies per region. After the initial burst, the function's capacity increases by an additional 500 concurrent executions each minute until either the load is accommodated, or the total concurrency of all functions in the region hits the limit.

197

answered Oct 09 '22 21:10

Commit

Related questions
                            
                                AWS RDS with Postgres : Is OOM killer configured
                            
                                Storing secrets and credentials inside of an Android App
                            
                                How to delete a Sagemaker Ground Truth Labeling Job?
                            
                                AWS API Gateway: why post request body is encoded base64?
                            
                                Is it possible to upload to S3 by just providing a URL? [closed]
                            
                                Amazon AWS: How Do I Programmatically Calculate My Spending?
                            
                                Can I use signed and unsigned urls on the same Cloudfront distribution?
                            
                                Laravel Queue with Amazon SQS
                            
                                Amazon S3 CORS issue with SVG on All major browser
                            
                                Uploading multiple files at same time from local to s3 bucket through node js
                            
                                How do I limit access to S3 Bucket for particular IAM Role?
                            
                                How to query third party JSON API from AWS Lambda function
                            
                                Handling https requests without API Gateway
                            
                                botocore.exceptions.ProfileNotFound when code run on AWS elastic beanstalk, but locally it's OK
                            
                                How to host multiple domains and subdomains on single AWS EC2 instance
                            
                                Replacement for AWS Lambda invokeAsync (deprecated)
                            
                                AWS Cognito not sending verification SMS
                            
                                What is the smallest subnet one can create on AWS in the VPC?
                            
                                Mock AWS services for testing
                            
                                AWS Glue output file name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Lambda throttling below concurrency limit

Tags:

amazon-web-services

aws-lambda

aws-api-gateway

throttling

Matthew Blackford

People also ask

1 Answers

Commit

Recent Activity

Donate For Us