Unit testing celery tasks directly

Tags:

I know this will be seen as a duplicate, but I have looked around before asking this question, however all of the questions seem to be either outdated or don't help at all with my problem. This is where I've looked before writing this question:

Official Docs
How do you unit test a Celery task? (5 years old, all dead links)
How to unit test code that runs celery tasks? (2 years old)
How do I capture Celery tasks during unit testing? (3 years old)

I'm currently working on a project that heavily uses Celery to handle asynchronous tasks; to make the entire code-base stable I'm writing unit tests for the entire project however I haven't been able to write a single working test for Celery so far.

Most of my code needs to keep track of the tasks that were run in order to determine wether or not all results are ready to be queried. This is implemented in my code as follows:

@app.task(bind=True)
def some_task(self, record_id):
    associate(self.request.id, record_id)  # Not the actual DB code, but you get the idea

# Somewhere else in my code, eg: Flask endpoint
record = some_db_record()
some_task.apply_async(args=[record.id])

Since I don't have a *nix based machine to run my code on, I tried solving this by setting the always eager option to true, however this causes issues whenever any sub-task tries to query the result:

@app.task(bind=True)
def foo(self): 
    task = bar.apply_async()
    foo_poll.apply_async(args=[task.id]) 

@app.task(bind=True, max_retries=None):
def foo_poll(self, celery_id)
    task =  AsyncResult(celery_id)
    if not task.ready():  # RuntimeError: Cannot retrieve result with task_always_eager enabled
        return self.retry(countdown=5)
    else:
        pass  # Do something with the result

@app.task
def bar():
    time.sleep(10)

I tried fixing this by patching the AsyncResult methods, however this caused issues as self.request.id would be None:

with patch.object(AsyncResult, "_get_task_meta", side_effect=lambda: {"status": SUCCESS, "result": None}) as method:
    foo()

@app.task(bind=True)
def foo(self):
    pass   # self.request.id is now None, which I need to track sub-tasks

Does anyone know how I could do this? Or if Celery is even worth using anymore? I'm at the point where I find the documentation and any questions related to testing so overwhelmingly complex I just feel like ditching it all together and just go back to multithreading.

422

asked Jul 10 '17 15:07

Paradoxis

2 Answers

I had about the same issue and came up with two possible approaches:

Call tasks in tests directly and wrap all inner celery interactions with if self.request.called_directly and run task directly if True or with apply_async if False.
Wrap task.ready() and other statuses check with functions where I check for ALWAYS_EAGER and task readiness.

Eventually I came up with kinda mix of both with the rule to avoid nested tasks as much as I can. And also put as little code in @app.task as I can in order to be able to test task functions in as much isolation as possible.

It might look quite frustrating and awful, but in fact it's not.

Also you can check how big guys like Sentry do this (spoiler: mocks and some nifty helpers).

So it's definitely possible, it's just not an easy way to find some best practices around.

101

answered Sep 27 '22 19:09

valignatev

It is possible to test the function without the celery task binding by calling it directly and by using a mock to replace the task object.

The inner function is hidden behind some_task.__wrapped__.__func__.

Here is an example of how to use it in a test case:

def test_some_task(self):
    mock_task = Mock()
    mock_task.request.id = 5  # your test data here
    record_id = 5  # more test data
    some_task_inner = some_task.__wrapped__.__func__
    some_task_inner(mock_task, record_id)
    # ...

answered Sep 27 '22 19:09

Erik Kalkoken

Related questions
                            
                                Why is python pandas dataframe rounding my values?
                            
                                Python pty.spawn stdin not echoed but redirected to master's stdout
                            
                                Python/SciPy: How to get cubic spline equations from CubicSpline
                            
                                Anaconda Python 3.6 -- pythonw and python supposed to be equivalent?
                            
                                Keras does not utilize 100% cpu
                            
                                Python matplotlib ValueError for logit scale axis label
                            
                                How can I optimize the calculation over this function in numpy?
                            
                                TooManyRequests Overpass Error
                            
                                PyYAML: load and dump yaml file and preserve tags ( !CustomTag )
                            
                                use jupyter widgets to save clicks on a pandas dataframe
                            
                                Panda's info() to HTML
                            
                                Python Gensim how to make WMD similarity run faster with multiprocessing
                            
                                OpenCV Python Error: error: (-215) (mtype == CV_8U || mtype == CV_8S) && _mask.sameSize(*psrc1) in function cv::binary_op
                            
                                Python 2.7 and Pandas Boxplot connecting median values
                            
                                Django: Update Page Information Without Refreshing
                            
                                Show group on every record in groupby
                            
                                Using the Django ORM, How can you create a unique hash for all possible combinations
                            
                                url_for with _external=True on heroku doesn't append the server name on the URL
                            
                                Why does the call method gets called at build time in Keras layers
                            
                                Colorbar for each row in ImageGrid

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Unit testing celery tasks directly

Tags:

python

unit-testing

celery

Paradoxis

People also ask

2 Answers

valignatev

Erik Kalkoken

Recent Activity

Donate For Us