I read the documentation of both <code>CentralStorageStrategy</code> and <code>MirroredStrategy</code>, but can not understand the essence of difference between them. In <code>MirroredStrategy</code>: <blockquote> Each variable in the model is mirrored across all the replicas. </blockquote> In <code>CentralStorageStrategy</code>: <blockquote> Variables are not mirrored, instead they are placed on the CPU and operations are replicated across all local GPUs. </blockquote> Source: https://www.tensorflow.org/guide/distributed_training What does it mean in practice? What are use cases for the <code>CentralStorageStrategy</code> and how does the training work if variables are placed on the CPU in this strategy?

Consider one particular variable (call it "my_var") in your usual, single-GPU, non-distributed use case (e.g. a weight matrix of a convolutional layer). If you use 4 GPUs, MirroredStrategy will create 4 variables instead of "my_var" variable, one on each GPU. However each variable will have the same value, because they are always updated in the same way. So the variable updates happen in sync on all the GPUs. In case of the CentralStorageStrategy, only one variable is created for "my_var", in the host (CPU) memory. The updates only happen in one place. Which one is better probably depends on the computer's topology and how fast CPU-GPU communication is compared with GPU-GPU. If the GPUs can communicate fast with each other, MirroredStrategy may be more efficient. But I'd benchmark it to be sure.

Difference between MirroredStrategy and CentralStorageStrategy

1 Answers

Consider one particular variable (call it "my_var") in your usual, single-GPU, non-distributed use case (e.g. a weight matrix of a convolutional layer).

If you use 4 GPUs, MirroredStrategy will create 4 variables instead of "my_var" variable, one on each GPU. However each variable will have the same value, because they are always updated in the same way. So the variable updates happen in sync on all the GPUs.

In case of the CentralStorageStrategy, only one variable is created for "my_var", in the host (CPU) memory. The updates only happen in one place.

Which one is better probably depends on the computer's topology and how fast CPU-GPU communication is compared with GPU-GPU. If the GPUs can communicate fast with each other, MirroredStrategy may be more efficient. But I'd benchmark it to be sure.

107

answered Oct 16 '22 11:10

isarandi

Related questions
                            
                                Delete row/column from Excel with xlsxwriter
                            
                                Bert Embedding Layer raises `Type Error: unsupported operand type(s) for +: 'None Type' and 'int'` with BiLSTM
                            
                                How to build TensorFlow lite with select TensorFlow ops for x86_64 systems?
                            
                                How to extract data from a Tweepy object into a pandas dataframe?
                            
                                Generate a column based on a constraint in pandas
                            
                                Why does my Streamlit application open multiple times?
                            
                                How to convert nested json structure to dataframe
                            
                                Can I get() or xcom.pull() a variable in the MAIN part of an Airflow script (outside a PythonOperator)?
                            
                                Sort lines in text file between patterns
                            
                                Where is the class list_iterator defined?
                            
                                mount error when trying to access the Azure DBFS file system in Azure Databricks
                            
                                How to load numpy array in a tensorflow dataset
                            
                                pytorch debugging timeout with PyCharm
                            
                                Fixing 'Import [module] could not be resolved' in pyright
                            
                                Python: How to automate 'Allow' flash player content in Firefox?
                            
                                Python does not allow annotating the types of variables when unpacking
                            
                                How to measure xgboost regressor accuracy using accuracy_score (or other suggested function)
                            
                                Group and find all values that belong to n unique maximum values
                            
                                sklearn ColumnTransformer with MultilabelBinarizer
                            
                                How to merge and groupby between seperate dataframes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between MirroredStrategy and CentralStorageStrategy

Tags:

python

tensorflow2.0

Victor

People also ask

1 Answers

isarandi

Recent Activity

Donate For Us