AWS ECS Task Memory Hard and Soft Limits

Tags:

I'm confused about the purpose of having both hard and soft memory limits for ECS task definitions.

IIRC the soft limit is how much memory the scheduler reserves on an instance for the task to run, and the hard limit is how much memory a container can use before it is murdered.

My issue is that if the ECS scheduler allocates tasks to instances based on the soft limit, you could have a situation where a task that is using memory above the soft limit but below the hard limit could cause the instance to exceed its max memory (assuming all other tasks are using memory slightly below or equal to their soft limit).

Is this correct?

Thanks

983

asked Jun 26 '17 16:06

maambmb

2 Answers

If you expect to run a compute workload that is primarily memory bound instead of CPU bound then you should use only the hard limit, not the soft limit. From the docs:

You must specify a non-zero integer for one or both of memory or memoryReservation in container definitions. If you specify both, memory must be greater than memoryReservation. If you specify memoryReservation, then that value is subtracted from the available memory resources for the container instance on which the container is placed; otherwise, the value of memory is used.

http://docs.aws.amazon.com/AmazonECS/latest/developerguide/task_definition_parameters.html

By specifying only a hard memory limit for your tasks you avoid running out of memory because ECS stops placing tasks on the instance, and docker kills any containers that try to go over the hard limit.

The soft memory limit feature is designed for CPU bound applications where you want to reserve a small minimum of memory (the soft limit) but allow occasional bursts up to the hard limit. In this type of CPU heavy workload you don't really care about the specific value of memory usage for the containers that much because the containers will run out of CPU long before they exhaust the memory of the instance, so you can place tasks based on CPU reservation and the soft memory limit. In this setup the hard limit is just a failsafe in case something goes out of control or there is a memory leak.

So in summary you should evaluate your workload using load tests and see whether it tends to run out of CPU first or out of memory first. If you are CPU bound then you can use the soft memory limit with an optional hard limit just as a failsafe. If you are memory bound then you will need to use just the hard limit with no soft limit.

answered Oct 10 '22 10:10

nathanpeck

@nathanpeck is the authority here, but I just wanted to address a specific scenario that you brought up:

My issue is that if the ECS scheduler allocates tasks to instances based on the soft limit, you could have a situation where a task that is using memory above the soft limit but below the hard limit could cause the instance to exceed its max memory (assuming all other tasks are using memory slightly below or equal to their soft limit).

This post from AWS explains what occurs in such a scenario:

If containers try to consume memory between these two values (or between the soft limit and the host capacity if a hard limit is not set), they may compete with each other. In this case, what happens depends on the heuristics used by the Linux kernel’s OOM (Out of Memory) killer. ECS and Docker are both uninvolved here; it’s the Linux kernel reacting to memory pressure. If something is above its soft limit, it’s more likely to be killed than something below its soft limit, but figuring out which process gets killed requires knowing all the other processes on the system and what they are doing with their memory as well. Again the new memory feature we announced can come to rescue here. While the OOM behavior isn’t changing, now containers can be configured to swap out to disk in a memory pressure scenario. This can potentially alleviate the need for the OOM killer to kick in (if containers are configured to swap).

answered Oct 10 '22 09:10

pavelv

Related questions
                            
                                AWS lambda function stops working after timed out error
                            
                                What does the dualstack prefix mean in AWS ELB?
                            
                                s3.getObject().createReadStream() : How to catch the error?
                            
                                Can't delete AWS internet Gateway
                            
                                Why every time Elastic Beanstalk issues a command to its instance it always timed out?
                            
                                Linking Container in AWS Fargate
                            
                                Is it possible to use AWS as a web host?
                            
                                Elastic Beanstalk Ruby/Rails need to install git so bundle install works.. but is not
                            
                                connecting AWS SAM Local with dynamodb in docker
                            
                                AWS elastic-search. FORBIDDEN/8/index write (api). Unable to write to index
                            
                                Unable to load AWS credentials from the /AwsCredentials.properties file on the classpath
                            
                                Amazon Elastic Beanstalk node and npm non-standard install locations
                            
                                Amazon Web Service can't delete an Elastic Beanstalk environment
                            
                                How to create folder or key on s3 using AWS SDK for Node.js?
                            
                                How to delete multiple files in S3 bucket with AWS CLI
                            
                                Error "uninitialized constant AWS (NameError)"
                            
                                AWS API Gateway: User anonymous is not authorized to execute API
                            
                                Cross Account Alias Records
                            
                                create an ec2 instance with multiple key pairs
                            
                                AWS DynamoDB - Pick a record/item randomly?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

AWS ECS Task Memory Hard and Soft Limits

Tags:

memory

amazon-web-services

amazon-ecs

cluster-computing

maambmb

People also ask

2 Answers

nathanpeck

pavelv

Recent Activity

Donate For Us