How to define max_queue_size, workers and use_multiprocessing in keras fit_generator()?

Tags:

I am applying transfer-learning on a pre-trained network using the GPU version of keras. I don't understand how to define the parameters max_queue_size, workers, and use_multiprocessing. If I change these parameters (primarily to speed-up learning), I am unsure whether all data is still seen per epoch.

max_queue_size:

maximum size of the internal training queue which is used to "precache" samples from the generator
Question: Does this refer to how many batches are prepared on CPU? How is it related to workers? How to define it optimally?

workers:

number of threads generating batches in parallel. Batches are computed in parallel on the CPU and passed on the fly onto the GPU for neural network computations
Question: How do I find out how many batches my CPU can/should generate in parallel?

use_multiprocessing:

whether to use process-based threading
Question: Do I have to set this parameter to true if I change workers? Does it relate to CPU usage?

Related questions can be found here:

Detailed explanation of model.fit_generator() parameters: queue size, workers and use_multiprocessing
What does worker mean in fit_generator in Keras?
What is the parameter “max_q_size” used for in “model.fit_generator”?
A detailed example of how to use data generators with Keras.

I am using fit_generator() as follows:

    history = model.fit_generator(generator=trainGenerator,                                   steps_per_epoch=trainGenerator.samples//nBatches,     # total number of steps (batches of samples)                                   epochs=nEpochs,                   # number of epochs to train the model                                   verbose=2,                        # verbosity mode. 0 = silent, 1 = progress bar, 2 = one line per epoch                                   callbacks=callback,               # keras.callbacks.Callback instances to apply during training                                   validation_data=valGenerator,     # generator or tuple on which to evaluate the loss and any model metrics at the end of each epoch                                   validation_steps=                                   valGenerator.samples//nBatches,   # number of steps (batches of samples) to yield from validation_data generator before stopping at the end of every epoch                                   class_weight=classWeights,                # optional dictionary mapping class indices (integers) to a weight (float) value, used for weighting the loss function                                   max_queue_size=10,                # maximum size for the generator queue                                   workers=1,                        # maximum number of processes to spin up when using process-based threading                                   use_multiprocessing=False,        # whether to use process-based threading                                   shuffle=True,                     # whether to shuffle the order of the batches at the beginning of each epoch                                   initial_epoch=0)

The specs of my machine are:

CPU : 2xXeon E5-2260 2.6 GHz Cores: 10 Graphic card: Titan X, Maxwell, GM200 RAM: 128 GB HDD: 4TB SSD: 512 GB

810

asked Apr 05 '19 08:04

Sophie Crommelinck

1 Answers

Q_0:

Question: Does this refer to how many batches are prepared on CPU? How is it related to workers? How to define it optimally?

From the link you posted, you can learn that your CPU keeps creating batches until the queue is at the maximum queue size or reaches the stop. You want to have batches ready for your GPU to "take" so that the GPU doesn't have to wait for the CPU. An ideal value for the queue size would be to make it large enough that your GPU is always running near the maximum and never has to wait for the CPU to prepare new batches.

Q_1:

Question: How do I find out how many batches my CPU can/should generate in parallel?

If you see that your GPU is idling and waiting for batches, try to increase the amount of workers and perhaps also the queue size.

Q_2:

Do I have to set this parameter to true if I change workers? Does it relate to CPU usage?

Here is a practical analysis of what happens when you set it to True or False. Here is a recommendation to set it to False to prevent freezing (in my setup True works fine without freezing). Perhaps someone else can increase our understanding of the topic.

In summary:

Try not to have a sequential setup, try to enable the CPU to provide enough data for the GPU.

Also: You could (should?) create several questions the next time, so that it is easier to answer them.

172

answered Oct 07 '22 00:10

a-doering

Related questions
                            
                                Open web in new tab Selenium + Python
                            
                                requests: how to disable / bypass proxy
                            
                                Hidden features of PyCharm [closed]
                            
                                Can I use Django F() objects with string concatenation?
                            
                                PyQt4.QtCore.pyqtSignal object has no attribute 'connect'
                            
                                What's an efficient way to find if a point lies in the convex hull of a point cloud?
                            
                                Python: SWIG vs ctypes
                            
                                How to split a string of space separated numbers into integers?
                            
                                Django, template context processors
                            
                                How to format elapsed time from seconds to hours, minutes, seconds and milliseconds in Python?
                            
                                ubuntu /usr/bin/env: python: No such file or directory
                            
                                How to remove any URL within a string in Python
                            
                                Django: Converting an entire set of a Model's objects into a single dictionary
                            
                                Pandas: Knowing when an operation affects the original dataframe
                            
                                What's the difference between Model.query and session.query(Model) in SQLAlchemy?
                            
                                How to set up a staging environment on Google App Engine
                            
                                Which Eclipse package should I download for PyDev?
                            
                                How to use Python's "easy_install" on Windows ... it's not so easy
                            
                                WARNING:tensorflow:sample_weight modes were coerced from ... to ['...']
                            
                                __new__ and __init__ in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to define max_queue_size, workers and use_multiprocessing in keras fit_generator()?

Tags:

python

machine-learning

tensorflow

gpu

keras

Sophie Crommelinck

People also ask

1 Answers

In summary:

a-doering

Recent Activity

Donate For Us