If I have a function defined as follows: <pre class="prettyprint"><code>def add(x,y): return x+y </code></pre> Is there a way to dynamically add this function as a celery PeriodicTask and kick it off at runtime? I'd like to be able to do something like (pseudocode): <pre class="prettyprint"><code>some_unique_task_id = celery.beat.schedule_task(add, run_every=crontab(minute="*/30")) celery.beat.start(some_unique_task_id) </code></pre> I would also want to stop or remove that task dynamically with something like (pseudocode): <pre class="prettyprint"><code>celery.beat.remove_task(some_unique_task_id) </code></pre> or <pre class="prettyprint"><code>celery.beat.stop(some_unique_task_id) </code></pre> FYI I am not using djcelery, which lets you manage periodic tasks via the django admin.

This was finally made possible by a fix included in celery v4.1.0. Now, you just need to change the schedule entries in the database backend, and celery-beat will act according to the new schedule. The docs vaguely describe how this works. The default scheduler for celery-beat, <code>PersistentScheduler</code>, uses a shelve file as its schedule database. Any changes to the <code>beat_schedule</code> dictionary in the <code>PersistentScheduler</code> instance are synced with this database (by default, every 3 minutes), and vice-versa. The docs describe how to add new entries to the <code>beat_schedule</code> using <code>app.add_periodic_task</code>. To modify an existing entry, just add a new entry with the same <code>name</code>. Delete an entry as you would from a dictionary: <code>del app.conf.beat_schedule['name']</code>. Suppose you want to monitor and modify your celery beat schedule using an external app. Then you have several options: <ol> <li>You can <code>open</code> the shelve database file and read its contents like a dictionary. Write back to this file for modifications.</li> <li>You can run another instance of the Celery app, and use that one to modify the shelve file as described above. </li> <li>You can use the custom scheduler class from django-celery-beat to store the schedule in a django-managed database, and access the entries there. </li> <li>You can use the scheduler from celerybeat-mongo to store the schedule in a MongoDB backend, and access the entries there. </li> </ol>

How to dynamically add / remove periodic tasks to Celery (celerybeat)

Tags:

python

celery

celerybeat

If I have a function defined as follows:

def add(x,y):   return x+y

Is there a way to dynamically add this function as a celery PeriodicTask and kick it off at runtime? I'd like to be able to do something like (pseudocode):

some_unique_task_id = celery.beat.schedule_task(add, run_every=crontab(minute="*/30")) celery.beat.start(some_unique_task_id)

I would also want to stop or remove that task dynamically with something like (pseudocode):

celery.beat.remove_task(some_unique_task_id)

celery.beat.stop(some_unique_task_id)

FYI I am not using djcelery, which lets you manage periodic tasks via the django admin.

403

asked Apr 17 '12 16:04

Jamie Forrest

2 Answers

This question was answered on google groups.

I AM NOT THE AUTHOR, all credit goes to Jean Mark

Here's a proper solution for this. Confirmed working, In my scenario, I sub-classed Periodic Task and created a model out of it since I can add other fields to the model as I need and also so I could add the "terminate" method. You have to set the periodic task's enabled property to False and save it before you delete it. The whole subclassing is not a must, the schedule_every method is the one that really does the work. When you're ready to terminate you task (if you didn't subclass it) you can just use PeriodicTask.objects.filter(name=...) to search for your task, disable it, then delete it.

Hope this helps!

from djcelery.models import PeriodicTask, IntervalSchedule from datetime import datetime  class TaskScheduler(models.Model):      periodic_task = models.ForeignKey(PeriodicTask)      @staticmethod     def schedule_every(task_name, period, every, args=None, kwargs=None):     """ schedules a task by name every "every" "period". So an example call would be:          TaskScheduler('mycustomtask', 'seconds', 30, [1,2,3])           that would schedule your custom task to run every 30 seconds with the arguments 1,2 and 3 passed to the actual task.      """         permissible_periods = ['days', 'hours', 'minutes', 'seconds']         if period not in permissible_periods:             raise Exception('Invalid period specified')         # create the periodic task and the interval         ptask_name = "%s_%s" % (task_name, datetime.datetime.now()) # create some name for the period task         interval_schedules = IntervalSchedule.objects.filter(period=period, every=every)         if interval_schedules: # just check if interval schedules exist like that already and reuse em             interval_schedule = interval_schedules[0]         else: # create a brand new interval schedule             interval_schedule = IntervalSchedule()             interval_schedule.every = every # should check to make sure this is a positive int             interval_schedule.period = period              interval_schedule.save()         ptask = PeriodicTask(name=ptask_name, task=task_name, interval=interval_schedule)         if args:             ptask.args = args         if kwargs:             ptask.kwargs = kwargs         ptask.save()         return TaskScheduler.objects.create(periodic_task=ptask)      def stop(self):         """pauses the task"""         ptask = self.periodic_task         ptask.enabled = False         ptask.save()      def start(self):         """starts the task"""         ptask = self.periodic_task         ptask.enabled = True         ptask.save()      def terminate(self):         self.stop()         ptask = self.periodic_task         self.delete()         ptask.delete()

121

answered Sep 30 '22 09:09

McP

This was finally made possible by a fix included in celery v4.1.0. Now, you just need to change the schedule entries in the database backend, and celery-beat will act according to the new schedule.

The docs vaguely describe how this works. The default scheduler for celery-beat, PersistentScheduler, uses a shelve file as its schedule database. Any changes to the beat_schedule dictionary in the PersistentScheduler instance are synced with this database (by default, every 3 minutes), and vice-versa. The docs describe how to add new entries to the beat_schedule using app.add_periodic_task. To modify an existing entry, just add a new entry with the same name. Delete an entry as you would from a dictionary: del app.conf.beat_schedule['name'].

Suppose you want to monitor and modify your celery beat schedule using an external app. Then you have several options:

You can open the shelve database file and read its contents like a dictionary. Write back to this file for modifications.
You can run another instance of the Celery app, and use that one to modify the shelve file as described above.
You can use the custom scheduler class from django-celery-beat to store the schedule in a django-managed database, and access the entries there.
You can use the scheduler from celerybeat-mongo to store the schedule in a MongoDB backend, and access the entries there.

answered Sep 30 '22 07:09

Tristan Brown

Related questions
                            
                                lxml etree xmlparser remove unwanted namespace
                            
                                join or merge with overwrite in pandas
                            
                                Cast base class to derived class python (or more pythonic way of extending classes)
                            
                                How to add sequential counter column on groups using Pandas groupby
                            
                                Is there something similar to 'rake routes' in django? [duplicate]
                            
                                Pytest: Deselecting tests
                            
                                Python copy files to a new directory and rename if file name already exists
                            
                                In numpy.sum() there is parameter called "keepdims". What does it do?
                            
                                Sqlalchemy delete subquery
                            
                                numpy.sin function in degrees?
                            
                                How can I use pywin32 with a virtualenv without having to include the host environment's site-packages folder?
                            
                                Fast replacement of values in a numpy array
                            
                                How can I change the x axis in matplotlib so there is no white space?
                            
                                How can i use multiple requests and pass items in between them in scrapy python
                            
                                How to debug python application under uWSGI?
                            
                                Converting datetime to POSIX time
                            
                                Custom validation in Django admin
                            
                                round off float to nearest 0.5 in python [duplicate]
                            
                                Modulo operator in Python
                            
                                Python for a Perl programmer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With