In celery, how to ensure tasks are retried when worker crashes

Tags:

First of all please don't consider this question as a duplicate of this question

I have a setup an environment which uses celery and redis as broker and result_backend. My question is how can I make sure that when the celery workers crash, all the scheduled tasks are re-tried, when the celery worker is back up.

I have seen advice on using CELERY_ACKS_LATE = True , so that the broker will re-drive the tasks until it get an ACK, but in my case its not working. Whenever I schedule a task its immediately goes to the worker which persists it until the scheduled time of execution. Let me give some example:

I am scheduling a task like this: res=test_task.apply_async(countdown=600) , but immediately in celery worker logs i can see something like : Got task from broker: test_task[a137c44e-b08e-4569-8677-f84070873fc0] eta:[2013-01-...] . Now when I kill the celery worker, these scheduled tasks are lost. My settings:

BROKER_URL = "redis://localhost:6379/0"  
CELERY_ALWAYS_EAGER = False  
CELERY_RESULT_BACKEND = "redis://localhost:6379/0"  
CELERY_ACKS_LATE = True

414

asked Jan 18 '13 17:01

aqs

1 Answers

Apparently this is how celery behaves. When worker is abruptly killed (but dispatching process isn't), the message will be considered as 'failed' even though you have acks_late=True

Motivation (to my understanding) is that if consumer was killed by OS due to out-of-mem, there is no point in redelivering the same task.

You may see the exact issue here: https://github.com/celery/celery/issues/1628

I actually disagree with this behaviour. IMO it would make more sense not to acknowledge.

143

answered Oct 09 '22 05:10

odedfos

Related questions
                            
                                Does Redis Db has built-in compression option
                            
                                Redis scan command match option does not work in Python
                            
                                Reasons for Redis to slow down
                            
                                Resque worker failing with PostgreSQL server
                            
                                Is this a good use-case for Redis on a ServiceStack REST API?
                            
                                Compare in-memory cluster computing systems
                            
                                Unix sockets slower than tcp when connected to redis
                            
                                store ip ranges in Redis
                            
                                Spring Redis Error Handle
                            
                                Rails, Heroku and invalid byte sequence in UTF-8 error
                            
                                How to get Redis Hash Length?
                            
                                Laravel Caching with Redis is very slow
                            
                                Start Python Celery task via Redis Pub/Sub
                            
                                Redis on Heroku saving multiple hash keys as one. But OK on local redis db
                            
                                How to set up PhpRedis in Laravel 5+?
                            
                                Redis hash structure occupies more memory in cluster mode
                            
                                Looking for disk-based alternative to Redis key-value store [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In celery, how to ensure tasks are retried when worker crashes

Tags:

redis

scheduled-tasks

celery

celery-task

django-celery

aqs

People also ask

1 Answers

odedfos

Recent Activity

Donate For Us