I have seen description about the dropout in different parts of the neural network:
dropout in the weight matrix,
dropout in the hidden layer after the matrix multiplication and before relu,
dropout in the hidden layer after the relu,
and dropout in the output score prior to the softmax function
I am a little confused about where I should perform the dropout. Could someone help elaborate about that? Thanks!
So:
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With