Operating the Celery Worker in the ECS Fargate

Tags:

I am working on a project using AWS ECS. I want to use Celery as a distributed task queue. Celery Worker can be build up as EC2 type, but because of the large amount of time that the instance is in the idle state, I think it would be cost-effective for AWS Fargate to run the job and quit immediately.

Do you have suggestions on how to use the Celery Worker efficiently in the AWS cloud?

903

asked Nov 09 '18 01:11

youngminz

2 Answers

Fargate launch type is going to take longer to spin up than EC2 launch type, because AWS is doing all the "host things" for you when you start the task, including the notoriously slow attaching of an ENI, and likely downloading the image from a Docker repo. Right now there's no contest, EC2 launch type is faster every time.

So it really depends on the type of work you want the workers to do. You can expect a new Fargate task to take a few minutes to enter a RUNNING state for the aforementioned reasons. EC2 launch, on the other hand, because the ENI is already in place on your host and the image is already downloaded (at best) or mostly downloaded (likely worst), will move from PENDING to RUNNING very quickly.

Edit: As @Rocket04 points out in a comment below, it appears AWS has improved Fargate startup times for scaling applications. Hooray!

Use EC2 launch type for steady workloads, use Fargate launch type for burst capacity

This is the current prevailing wisdom, often discussed as a cost factor because Fargate can't take advantage of the typical EC2 cost savings mechanisms like reserved instances and spot pricing. It's expensive to run Fargate all the time, compared to EC2.

To be clear, it's perfectly fine to run 100% in Fargate (we do), but you have to be willing to accept the downsides of doing that - slower scaling and cost.

Note you can run both launch types in the same cluster. Clusters are logical anyway, just a way to organize your resources.

Example cluster

This example shows a static EC2 launch type service running 4 celery tasks. The number of tasks, specs, instance size and all doesn't really matter, do it up however you like. The important thing is - EC2 launch type service doesn't need to scale; the Fargate launch type service is able to scale from nothing running (during periods where there's little or no work to do) to as many workers as you can handle, based on your scaling rules.

EC2 launch type Celery service

Running 1 EC2 launch type t3.medium (2vcpu/4GB).

Min tasks: 2, Desired: 4, Max tasks: 4

Running 4 celery tasks at 512/1024 in this EC2 launch type.

No scaling policies

Fargate launch type Celery service

Min tasks: 0, Desired: (x), Max tasks: 32

Running (x) celery tasks (same task def as EC2 launch type) at 512/1024

Add scaling policies to this service

answered Sep 22 '22 19:09

bluescores

Your Idea is great! but you missed something,

celery is a worker, not a task it should run 24/7.

celery doesn't stop when task completes. It will still runs and waits for other tasks so ECS only look at celery and it runs 24/7. So ECS never knows about celery task startings and endings.

If celery down who will bring up celery when a task assigned? there is no connection between your messaging broker and ECS to start celery.

Actually celery has a capability to run task on-demand as per messaging queue if it runs 24/7. Otherwise, nobody knows that new task was assigned.

Solution 1 : replace celery and rewrite ur all logics to support ECS tasks and create trigger mechanism for ECS tasks as per ur needs.

FYI: the above solution needs lot of efforts and not a practical

answered Sep 23 '22 19:09

Jameel Grand

Related questions
                            
                                How can I include the relative path to a module in a Python logging statement?
                            
                                Custom Colormap
                            
                                What is a "stateful object" in tensorflow?
                            
                                How to get coefficients and feature importances from MultiOutputRegressor?
                            
                                pyqt add rectangle in Qgraphicsscene
                            
                                Scipy Optimize is only returning x0, only completing one iteration
                            
                                Run Azure Databricks without Spark cluster
                            
                                TFDV Tensorflow Data Validation: how can I save/load the protobuf schema to/from a file
                            
                                Convert DatetimeIndex to datetime
                            
                                How can I rename "levelname" to "level" in Python log messages?
                            
                                Pytest assertion doesn't show differences on AssertionError
                            
                                Read a JSON and convert the keys to int
                            
                                Airflow DAG in functions?
                            
                                how to install win32clipboard
                            
                                What does single(not double) asterisk * means when unpacking dictionary in Python?
                            
                                Add multiple docs in yaml file | PyYAML
                            
                                Django datetime not validating right
                            
                                How to get quarter beginning date in python
                            
                                Insert a pandas dataframe into a SQLite table
                            
                                SQLAlchemy: filtering by a key in a JSON column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Operating the Celery Worker in the ECS Fargate

Tags:

python

amazon-web-services

celery

amazon-ecs

aws-fargate