Apscheduler is executing job multiple times

Tags:

I have a django application running with uwsgi (with 10 workers) + ngnix. I am using apscheduler for scheduling purpose. Whenever i schedule a job it is being executed multiple times. From these answers ans1, ans2 i got to know this is because the scheduler is started in each worker of uwsgi. I did conditional initializing of the scheduler by binding it to a socket as suggested in this answer and also by keeping a status in the db, so that only one instance of scheduler will be started, but still the same problem exists and also sometimes when creating a job the scheduler is found not running and the job keeps pending and not executed.

I am initializing apscheduler in urls of the django application with following code. This will start the scheduler when application starts.

def job_listener(ev):
    print('event',ev)


job_defaults = {
    'coalesce': True,  
    'max_instances': 1
}

scheduler = BackgroundScheduler(job_defaults=job_defaults, timezone=TIME_ZONE, daemon=False)
scheduler.add_jobstore(MongoDBJobStore(client=client), 'default')
scheduler.add_executor(ThreadPoolExecutor(), 'default')
scheduler.add_executor(ProcessPoolExecutor(),'processpool')
scheduler.add_listener(job_listener)


def initialize_scheduler():
    try:
        if scheduler_db_conn.find_one():
            print('scheduler already running')
            return True
        scheduler.start()
        scheduler_db_conn.save({'status': True})
        print('---------------scheduler started --------------->')
        return True
    except:
        return False

I use following code to create the job.

from scheduler_conf import scheduler
def create_job(arg_list):
    try:
        print('scheduler status-->',scheduler.running)
        job = scheduler.add_job(**arg_list)
        return True
    except:
        print('error in creating Job')
        return False

I am not able to configure and run the scheduler properly. I have referred all the threads in apschedule but still hasn't got a solution.

If i don't limit from having multiple schedulers running in each worker the job is executed multiple times.
But if i limit to only one scheduler running inside a worker,some jobs keep pending and not execute.

Whats the solution for this?

724

asked Aug 31 '16 15:08

daemon24

1 Answers

Let's consider the following facts:

(1) UWSGI, by default, pre-loads your Django App into the UWSGI Master process' memory BEFORE forking its workers.

(2) UWSGI "forks" workers from the master, meaning they are essentially copied into the memory of each worker. Because of how fork() is implemented, a Child process (i.e. a worker) does not inherit the threads of a Parent.

(3) When you call BackgroundScheduler.start(), a thread is created which is responsible for executing jobs on whatever worker/master calls this function.

All you must do, is call BackgroundScheduler.start() on the Master process, before any workers are created. By doing so, when the workers are created, they WILL NOT INHERIT the BackgroundScheduler thread (#2 above), and thus will not execute any jobs (but they still can schedule/modify/delete jobs by communicating with the jobstore!).

To do this, just make sure you call BackgroundScheduler.start() in whatever function/module instantiates your app. For instance, in the following Django project structure, we'd (likely) want to execute this code in wsgi.py, which is the entry point for the UWSGI server.:

mysite/
manage.py
mysite/
    __init__.py
    settings.py
    urls.py
    wsgi.py

Pitfalls:

Don't "initializ[e] apscheduler in urls of the django application.... This will start the scheduler when application starts." These may be loaded by each worker, and thus start() is executed multiple times.

Don't start the UWSGI server in "lazy-app" mode, this will load the app in each of the workers, after they are created.

Don't run the BackgroundScheduler with the default (memory) jobstore. This will create split-brain syndrome between all workers. You want to enforce a single-point-of-truth, like you are with MongoDB, for all CRUD operations performed on jobs.

This post may give you more detail, only in a Gunicorn (WSGI server) environment.

173

answered Sep 27 '22 23:09

The Aelfinn

Related questions
                            
                                uWSGI + Django + Python: no module named uwsgi
                            
                                Django: Override save method to handle unique=True IntegrityError
                            
                                Django pre_delete signal gets ignored
                            
                                Why do I keep getting this "name 'Model' is not defined" error in my Django project?
                            
                                django postgres conditional constrain
                            
                                Advantages of using REST API framework over simple URL and view creation in Django?
                            
                                how to test a django decorator?
                            
                                global name 'ParseError' is not defined, I used try and except to avoid it but this still shows up
                            
                                Django - NoReverseMatch at /accounts/password_reset/
                            
                                How to dump request.POST to dict, maintaining multiple value fields?
                            
                                Serializing ManyToMany relationship with intermediary model in Django Rest Framework
                            
                                Serving multiple templates from a single view (or should I use multiple views?)
                            
                                How does Django name the index automatically created for foreign keys columns?
                            
                                Ipdb not showing output with Django nose tests
                            
                                How to change color of Django-tables row?
                            
                                Django rest framework imagefield optional
                            
                                how to send a failure response from django to ajax
                            
                                manyToMany with django rest framework
                            
                                Django Rest Framework : How to add a custom field to the response of the GET request?
                            
                                What is the standard docstring for a django model metaclass?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Apscheduler is executing job multiple times

Tags:

django

uwsgi

apscheduler

daemon24

People also ask

1 Answers

The Aelfinn

Recent Activity

Donate For Us