Neural Networks - Difference between deep autoencoder and stacked autoencoder [closed]

Tags:

Disclaimer: I also posted this question on CrossValidated but it is not receiving any attention. If this is not the place for it I will gladly remove it.

As I understand it, the only difference between them is the way the two networks are trained. Deep autoencoders are trained in the same way as a single-layer neural network, while stacked autoencoders are trained with a greedy, layer-wise approach. Hugo Larochelle confirms this in the comment of this video. I wonder if this is the ONLY difference, any pointers?

743

asked Mar 15 '18 10:03

RiccB

1 Answers

The terminology in the field isn't fixed, well-cut and clearly defined and different researches can mean different things or add different aspects to the same terms. Example discussions:

What is the difference between Deep Learning and traditional Artificial Neural Network machine learning? (some people think that 2 layers is deep enough, some mean 10+ or 100+ layers).
Multi-layer perceptron vs deep neural network (mostly synonyms but there are researches that prefer one vs the other).

As for AE, according to various sources, deep autoencoder and stacked autoencoder are exact synonyms, e.g., here's a quote from "Hands-On Machine Learning with Scikit-Learn and TensorFlow":

Just like other neural networks we have discussed, autoencoders can have multiple hidden layers. In this case they are called stacked autoencoders (or deep autoencoders).

Later on, the author discusses two methods of training an autoencoder and uses both terms interchangeably.

I would agree that the perception of the term "stacked" is that an autoencoder can extended with new layers without retraining, but this is actually true regardless of how existing layers have been trained (jointly or separately). Also regardless of the training method, the researches may or may not call it deep enough. So I wouldn't focus too much on terminology. It can stabilize some day but not right now.

197

answered Oct 30 '22 09:10

Maxim

Related questions
                            
                                Wrapper Methods for feature selection (Machine Learning) In Scikit Learn
                            
                                scikit-learn - Convert pipeline prediction to original value/scale
                            
                                Faster-RCNN, why don't we just use only RPN for detection?
                            
                                semantic segmentation for large images
                            
                                What is the relation between NEAT and reinforcement learning?
                            
                                How can I use tensorflow metric function within keras models?
                            
                                How to convert string labels to one-hot vectors in TensorFlow?
                            
                                Logistic regression on One-hot encoding
                            
                                Keras -- Input Shape for Embedding Layer
                            
                                Weighted Training Examples in Tensorflow
                            
                                How to train a model in tensorflow using java
                            
                                Advantage of using experiments in TensorFlow
                            
                                Catboost: what are reasonable values for l2_leaf_reg?
                            
                                scikit learn: custom classifier compatible with GridSearchCV
                            
                                Text Extraction from image after detecting text region with contours
                            
                                1d CNN audio in keras
                            
                                Keras MSE definition
                            
                                XGboost model consistently obtaining 100% accuracy?
                            
                                Why "softmax_cross_entropy_with_logits_v2" backprops into labels
                            
                                KFolds Cross Validation vs train_test_split

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Neural Networks - Difference between deep autoencoder and stacked autoencoder [closed]

Tags:

machine-learning

neural-network

deep-learning

autoencoder

RiccB

People also ask

1 Answers

Maxim

Recent Activity

Donate For Us