How to take the average of the weights of two networks?

Tags:

Suppose in PyTorch I have model1 and model2 which have the same architecture. They were further trained on same data or one model is an earlier version of the othter, but it is not technically relevant for the question. Now I want to set the weights of model to be the average of the weights of model1 and model2. How would I do that in PyTorch?

959

asked Feb 01 '18 10:02

patapouf_ai

1 Answers

beta = 0.5 #The interpolation parameter    
params1 = model1.named_parameters()
params2 = model2.named_parameters()

dict_params2 = dict(params2)

for name1, param1 in params1:
    if name1 in dict_params2:
        dict_params2[name1].data.copy_(beta*param1.data + (1-beta)*dict_params2[name1].data)

model.load_state_dict(dict_params2)

Taken from pytorch forums. You could grab the parameters, transform and load them back but make sure the dimensions match.

Also I would be really interested in knowing about your findings with these..

180

answered Nov 10 '22 01:11

Littleone

Related questions
                            
                                Join dataframes - one with multiindex columns and the other without
                            
                                Python script should end with new line or not ? Pylint contradicting itself?
                            
                                Python Pandas: TypeError: unsupported operand type(s) for +: 'datetime.time' and 'Timedelta'
                            
                                How can I do a Monte Carlo analysis on an equation?
                            
                                statespace.SARIMAX model: why the model use all the data to train mode, and predict the a range of train model
                            
                                How to do a cumulative "all"
                            
                                Merging pandas columns (one-to-many)
                            
                                How to use tf.data's initializable iterators within a tf.estimator's input_fn?
                            
                                Add values of keys and sort it by occurrence of the keys in a list of dictionaries in Python
                            
                                How to convert a html document into a pdf using report lab with python
                            
                                Update DOM without reloading the page in Django
                            
                                How to speed-up k-means from Scikit learn?
                            
                                How to convert pandas dataframe columns to native python data types?
                            
                                Copy configuration file on installation
                            
                                Extracting a part of a Spacy document as a new document
                            
                                How do I rearrange/reorder (not necessarily sort) a pandas dataframe index?
                            
                                How to prettify HTML so tag attributes will remain in one single line?
                            
                                how to convert bytes to string in Python 3 [duplicate]
                            
                                Execute some code before Pytest runs
                            
                                How to await a select.select call in python asyncio

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to take the average of the weights of two networks?

Tags:

python

neural-network

deep-learning

pytorch

patapouf_ai

People also ask

1 Answers

Littleone

Recent Activity

Donate For Us