Prioritizing queues among multiple queues in celery?

Tags:

celery-task

We are using celery for our asynchronous background tasks and we have 2 queues for different priority tasks. We have 2 cluster of nodes serving them separately. Things are working well as expected.

Question:

We get mostly low priority tasks. For optimized resource utilization, I am wondering is there a way to configure workers(listening to high priority queue) to listen to both queues. But take jobs from the higher priority queue as long as some job is there? and fallback to low priority queue otherwise.

I have gone through the priority based task scheduling discussed @ Celery Task Priority.

But my questions is prioritize queues not just tasks within a queue.

600

asked Sep 13 '17 18:09

arunk2

2 Answers

You can partially achieve this by defining multiple queues for the worker, when starting it.

You can do it with the following command: Also, refer here for more details.

celery -A proj worker -l info -Q Q1,Q2

Though this approach has a problem. It doesn't do it with fallback kind of approach. Since, workers listening to multiple queue evenly distribute the resources among them.

Hence, your requirement of processing only from 'high priority queue' even when there is something in 'normal priority queue' cannot be achieved. This can be minimized by allocating more Workers (may be 75%) for 'high priority queue' and 25% for 'normal priority queue'. or different share based on you work load.

176

answered Oct 13 '22 21:10

Suresh

This is now possible with Celery >= 4.1.1 + Redis transport (probably earlier version too). You just need to set a broker transport option in your celeryconfig.py module. This setting was implemented with Kombu 4.0.0.

broker_transport_options = {
  visibility_timeout: 1200,  # this doesn't affect priority, but it's part of redis config
  queue_order_strategy: 'priority'
}

It's also possible to specify with an environment variable.

For a worker started with $ celery -A proj worker -l info -Q Q1,Q2 the idle worker will check Q1 first and execute Q1 tasks if available before checking Q2.

source

Bonus off topic help, this also works with Airflow 1.10.2 workers, except it seems like the queue order is not preserved from the command line. Using 'queue_order_strategy'='sorted' and naming your queues appropriately works (Q1, Q2 would work perfectly). Airflow pool-based priority is not preserved between dags so this really helps!

answered Oct 13 '22 20:10

c-wilson

Related questions
                            
                                Django run tasks (possibly) in the far future
                            
                                Celery chain not working with batches
                            
                                Flask with Celery - Application context not available
                            
                                Celery and signals
                            
                                how to serialize binary files to use with a celery task
                            
                                First steps with Celery using a virtualenv
                            
                                Unknown queue names show on Rabbitmq mgmt. when using Celery
                            
                                celerybeat - multiple instances & monitoring
                            
                                Running RabbitMQ+Celery in the same server as production environment
                            
                                Lightweight notification technique
                            
                                How to track the progress of individual tasks inside a group which forms the header to a chord in celery?
                            
                                Flask, blueprints uses celery task and got cycle import
                            
                                Is it possible to skip delegating a celery task if the params and the task name is already queued in the server?
                            
                                Celery + SQS - pycurl error
                            
                                Django Celery ConnectionError: Too many heartbeats missed
                            
                                Django, Celery, Redis, RabbitMQ: Chained Tasks for Fanout-On-Writes
                            
                                Celery - minimize memory consumption
                            
                                celery task clean-up with DB backend
                            
                                celery tutorial: NotRegistered error
                            
                                Typical memory usage for Django applications

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With