Python / rq - monitoring worker status

Tags:

task-queue

If this is an idiotic question, I apologize and will go hide my head in shame, but:

I'm using rq to queue jobs in Python. I want it to work like this:

Job A starts. Job A grabs data via web API and stores it.
Job A runs.
Job A completes.
Upon completion of A, job B starts. Job B checks each record stored by job A and adds some additional response data.
Upon completion of job B, user gets a happy e-mail saying their report's ready.

My code so far:

redis_conn = Redis()
use_connection(redis_conn)
q = Queue('normal', connection=redis_conn) # this is terrible, I know - fixing later
w = Worker(q)
job = q.enqueue(getlinksmod.lsGet, theURL,total,domainid)
w.work()

I assumed my best solution was to have 2 workers, one for job A and one for B. The job B worker could monitor job A and, when job A was done, get started on job B.

What I can't figure out to save my life is how I get one worker to monitor the status of another. I can grab the job ID from job A with job.id. I can grab the worker name with w.name. But haven't the foggiest as to how I pass any of that information to the other worker.

Or, is there a much simpler way to do this that I'm totally missing?

707

asked Aug 23 '12 21:08

user1066609

2 Answers

Update januari 2015, this pull request is now merged, and the parameter is renamed to depends_on, ie:

second_job = q.enqueue(email_customer, depends_on=first_job)

The original post left intact for people running older versions and such:

I have submitted a pull request (https://github.com/nvie/rq/pull/207) to handle job dependencies in RQ. When this pull request gets merged in, you'll be able to do:

def generate_report():
    pass

def email_customer():
    pass

first_job = q.enqueue(generate_report)
second_job = q.enqueue(email_customer, after=first_job)
# In the second enqueue call, job is created,
# but only moved into queue after first_job finishes

For now, I suggest writing a wrapper function to sequentially run your jobs. For example:

def generate_report():
     pass

def email_customer():
    pass

def generate_report_and_email():
    generate_report()
    email_customer() # You can also enqueue this function, if you really want to

# Somewhere else
q.enqueue(generate_report_and_email)

answered Sep 28 '22 08:09

Selwin Ong

From this page on the rq docs, it looks like each job object has a result attribute, callable by job.result, which you can check. If the job hasn't finished, it'll be None, but if you ensure that your job returns some value (even just "Done"), then you can have your other worker check the result of the first job and then begin working only when job.result has a value, meaning the first worker was completed.

answered Sep 28 '22 10:09

jdotjdot

Related questions
                            
                                django-social-auth : connected successfully, how to query for users now?
                            
                                Is Jython faster than Python?
                            
                                Python bug - or my stupidity - EOL while scanning string literal
                            
                                imshow and histogram2d: can't get them to work
                            
                                How to force my whole package to use a __future__ directive?
                            
                                Importing from sub-folder hierarchy in python
                            
                                Mysterious pickle error while profiling a multi-process Python script
                            
                                Tastypie, filtering many to many relationships
                            
                                How to add header files in setup.py so dependencies are observed when building the extensions?
                            
                                How do I determine programmatically if a PDF is searchable?
                            
                                Configuration of OpenCV 2.3 of Python in Eclipse
                            
                                python : undefined symbol: PyUnicodeUCS2_DecodeUTF8
                            
                                Python - locals() and closure
                            
                                How to pull from the remote using dulwich?
                            
                                Is there any good and easy-to-use module built in Python for editing memory?
                            
                                How to list specific node/edge in networkx?
                            
                                Python:How to prohibit opening of an application if it is already running
                            
                                In SQLAlchemy, how do I define an event to fire DDL using declarative syntax?
                            
                                Extract translator comments with xgettext from JavaScript (in Python mode)
                            
                                Generating an evenly sampled array from unevenly sampled data in NumPy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With