Get worker ID in Celery

Tags:

celery

I want to use Celery to run jobs on a GPU server with four Tesla cards. I run the Celery worker with a pool of four workers such that each card always runs one job.

My problem is how to instruct the workers to each claim one GPU. Currently I rely on the assumption that the worker processes should all have contiguous process IDs:

Click to copy

device_id = os.getpid() % self.ndevices

However, I this is not guaranteed to always work, i.e. when worker processes get restarted over time. So ideally, I would like to get the ID of each worker directly. Can someone tell me if it is possible to inspect the worker from within a task or can suggest a different solution to distribute the jobs across the GPUs?

622

asked Jun 18 '13 19:06

oceanhug

1 Answers

If you are using CELERYD_POOL = 'processes', the worker pool is handled by billiard, which does happen to expose its 0-based process index:

Click to copy

from billiard import current_process
from celery import task

@task
def print_info():
    # This will print an int in [0..concurrency[
    print current_process().index

The index is 0-based, and if a worker happens to be restarted it will keep its index.

I couldn't find any documentation regarding the index value though :/

answered Oct 03 '22 05:10

Nicolas Cortot

Related questions
                            
                                Does Post-Mortem Debugging in Python allow for Stepping or Continuing?
                            
                                Debugging pytest post mortem exceptions in pycharm/pydev
                            
                                Django/Python: Update the relation to point at settings.AUTH_USER_MODEL
                            
                                Float precision breakdown in python/numpy when adding numbers
                            
                                What is special about deleting an empty list?
                            
                                Can I pass python **kwargs to parent class from sub?
                            
                                How to get non-blocking/real-time behavior from Python logging module? (output to PyQt QTextBrowser)
                            
                                plt.figure() vs subplots in Matplotlib
                            
                                Perform a logical exclusive OR on a Django Q object
                            
                                Python SciPy convolve vs fftconvolve
                            
                                Is there a way to listen to multiple python sockets at once
                            
                                What's going on with this python syntax? (c == c in s)
                            
                                Sorting a list of dictionaries based on the order of values of another list
                            
                                Resizing an image in python
                            
                                Adding to a dict using a list of key strings as path
                            
                                Python decorating property setter with list [duplicate]
                            
                                OperationalError: (OperationalError) (2003, "Can't connect to MySQL server on '192.168.129.139' (111)") None None
                            
                                python, wrap and object into a list if not is an iterable
                            
                                Widget's "destroyed" signal is not fired (PyQT)
                            
                                "Backspace" over last character written to file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With