Celery difference between concurrency, workers and autoscaling

Tags:

In my /etc/defaults/celeryd config file, I've set:

CELERYD_NODES="agent1 agent2 agent3 agent4 agent5 agent6 agent7 agent8" CELERYD_OPTS="--autoscale=10,3 --concurrency=5"

I understand that the daemon spawns 8 celery workers, but I'm fully not sure what autoscale and concurrency do together. I thought that concurrency was a way to specify the max number of threads that a worker can use and autoscale was a way for the worker to scale up and down child workers, if necessary.

The tasks have a largish payload (some 20-50kB) and there are like 2-3 million such tasks, but each task runs in less than a second. I'm seeing memory usage spike up because the broker distributes the tasks to every worker, thus replicating the payload multiple times.

I think the issue is in the config and that the combination of workers + concurrency + autoscaling is excessive and I would like to get a better understanding of what these three options do.

283

asked Aug 08 '15 20:08

Joseph

1 Answers

Let's distinguish between workers and worker processes. You spawn a celery worker, this then spawns a number of processes (depending on things like --concurrency and --autoscale, the default is to spawn as many processes as cores on the machine). There is no point in running more than one worker on a particular machine unless you want to do routing.

I would suggest running only 1 worker per machine with the default number of processes. This will reduce memory usage by eliminating the duplication of data between workers.

If you still have memory issues then save the data to a store and pass only an id to the workers.

112

answered Oct 05 '22 22:10

scytale

Related questions
                            
                                Django ModelForm to have a hidden input
                            
                                How to remove duplicates from Python list and keep order? [duplicate]
                            
                                How to call Python functions dynamically
                            
                                Prepend a line to an existing file in Python
                            
                                How do I create documentation with Pydoc?
                            
                                How to get the first word in the string
                            
                                Calculate cosine similarity given 2 sentence strings
                            
                                cannot import name patterns
                            
                                How to remove punctuation marks from a string in Python 3.x using .translate()?
                            
                                Django form - set label
                            
                                How to obtain values of request variables using Python and Flask [duplicate]
                            
                                how to store a complex object in redis (using redis-py)
                            
                                How to import a csv file using python with headers intact, where first column is a non-numerical
                            
                                Finding elements not in a list
                            
                                Common pitfalls in Python [duplicate]
                            
                                Combining two sorted lists in Python
                            
                                Random is barely random at all?
                            
                                Understanding metaclass and inheritance in Python [duplicate]
                            
                                Python MySQLdb: connection.close() VS. cursor.close()
                            
                                for x in y(): how does this work? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Celery difference between concurrency, workers and autoscaling

Tags:

python

concurrency

celery

Joseph

People also ask

1 Answers

scytale

Recent Activity

Donate For Us