Tied weights in Autoencoder

Tags:

I have been looking at autoencoders and have been wondering whether to used tied weights or not. I intend on stacking them as a pretraining step and then using their hidden representations to feed a NN.

Using untied weights it would look like:

f(x)=σ₂(b₂+W₂*σ₁(b₁+W₁*x))

Using tied weights it would look like:

f(x)=σ₂(b₂+W₁^T*σ₁(b₁+W₁*x))

From a very simplistic view, could one say that tying the weights ensures that encoder part is generating the best representation given the architecture versus if the weights were independent then decoder could effectively take a non-optimal representation and still decode it?

I ask because if the decoder is where the "magic" occurs and I intend to only use the encoder to drive my NN, wouldn't that be problematic.

251

asked Apr 27 '16 12:04

Paul O

1 Answers

Autoencoders with tied weights have some important advantages :

It's easier to learn.
In linear case it's equvialent to PCA - this may lead to more geometrically adequate coding.
Tied weights are sort of regularisation.

But of course - they're not perfect : they may not be optimal when your data comes from highly nolinear manifold. Depending on size of your data I would try both approaches - with tied weights and not if it's possible.

UPDATE :

You asked also why representation which comes from autoencoder with tight weights might be better than one without. Of course it's not the case that such representation is always better but if the reconstruction error is sensible then different units in coding layer represents something which might be considered as generators of perpendicular features which are explaining the most of the variance in data (exatly like PCAs do). This is why such representation might be pretty useful in further phase of learning.

197

answered Sep 21 '22 01:09

Marcin Możejko

Related questions
                            
                                How to obtain information gain from a scikit-learn DecisionTreeClassifier?
                            
                                Python's implementation of Mutual Information
                            
                                what's the use of transformer_weights in scikit-learn pipeline?
                            
                                difference between LinearRegression and svm.SVR(kernel="linear")
                            
                                What are good algorithms for detecting abnormality?
                            
                                Miminum requirements for Google tensorflow image classifier
                            
                                Fit mixture of Gaussians with fixed covariance in Python
                            
                                How to combine TFIDF features with other features
                            
                                Perceptron learning algorithm doesn't work
                            
                                Representing Natural Language as RDF
                            
                                Scikit-learn χ² (chi-squared) statistic and corresponding contingency table
                            
                                sklearn: How to reset a Regressor or classifier object in sknn
                            
                                Input/output error while using google colab with google drive
                            
                                AssertionError: Could not compute output Tensor
                            
                                How to plot a learning curve for a keras experiment?
                            
                                how to use weight when training a weak learner for adaboost
                            
                                How to use Tensorflow Optimizer without recomputing activations in reinforcement learning program that returns control after each iteration?
                            
                                Python sklearn show loss values during training
                            
                                "UserWarning: An input could not be retrieved. It could be because a worker has died. We do not have any information on the lost sample."
                            
                                Multiple pipelines that merge within a sklearn Pipeline?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tied weights in Autoencoder

Tags:

machine-learning

neural-network

deep-learning

autoencoder

Paul O

People also ask

1 Answers

Marcin Możejko

Recent Activity

Donate For Us