I have created an Auto Encoder Neural Network in MATLAB. I have quite large inputs at the first layer which I have to reconstruct through the network's output layer. I cannot use the large inputs as it is,so I convert it to between [0, 1] using <code>sigmf</code> function of MATLAB. It gives me a values of 1.000000 for all the large values. I have tried using setting the format but it does not help. Is there a workaround to using large values with my auto encoder?

Before I give you my answer, let's think a bit about the rationale behind an auto-encoder (AE): The purpose of auto-encoder is to learn, in an unsupervised manner, something about the underlying structure of the input data. How does AE achieves this goal? If it manages to reconstruct the input signal from its output signal (that is usually of lower dimension) it means that it did not lost information and it effectively managed to learn a more compact representation. In most examples, it is assumed, for simplicity, that both input signal and output signal ranges in [0..1]. Therefore, the same non-linearity (<code>sigmf</code>) is applied both for obtaining the output signal and for reconstructing back the inputs from the outputs. Something like <pre class="prettyprint"><code>output = sigmf( W*input + b ); % compute output signal reconstruct = sigmf( W'*output + b_prime ); % notice the different constant b_prime </code></pre> Then the AE learning stage tries to minimize the training error <code>|| output - reconstruct ||</code>. However, who said the reconstruction non-linearity must be identical to the one used for computing the output? In your case, the assumption that inputs ranges in [0..1] does not hold. Therefore, it seems that you need to use a different non-linearity for the reconstruction. You should pick one that agrees with the actual range of you inputs. If, for example, your input ranges in (0..inf) you may consider using <code>exp</code> or <code>().^2</code> as the reconstruction non-linearity. You may use polynomials of various degrees, <code>log</code> or whatever function you think may fit the spread of your input data. <hr> Disclaimer: I never actually encountered such a case and have not seen this type of solution in literature. However, I believe it makes sense and at least worth trying.

Using large input values with Auto Encoders

Tags:

matlab

autoencoder

I have created an Auto Encoder Neural Network in MATLAB. I have quite large inputs at the first layer which I have to reconstruct through the network's output layer. I cannot use the large inputs as it is,so I convert it to between [0, 1] using sigmf function of MATLAB. It gives me a values of 1.000000 for all the large values. I have tried using setting the format but it does not help.

Is there a workaround to using large values with my auto encoder?

412

asked Jul 14 '14 08:07

Sasha

2 Answers

The process of convert your inputs to the range [0,1] is called normalization, however, as you noticed, the sigmf function is not adequate for this task. This link maybe is useful to you.

Suposse that your inputs are given by a matrix of N rows and M columns, where each row represent an input pattern and each column is a feature. If your first column is:

vec =

   -0.1941
   -2.1384
   -0.8396
    1.3546
   -1.0722

Then you can convert it to the range [0,1] using:

%# get max and min
maxVec = max(vec);
minVec = min(vec);

%# normalize to -1...1
vecNormalized = ((vec-minVec)./(maxVec-minVec))

vecNormalized =

    0.5566
         0
    0.3718
    1.0000
    0.3052

As @Dan indicates in the comments, another option is to standarize the data. The goal of this process is to scale the inputs to have mean 0 and a variance of 1. In this case, you need to substract the mean value of the column and divide by the standard deviation:

meanVec = mean(vec);
stdVec = std(vec);

vecStandarized = (vec-meanVec)./ stdVec

vecStandarized =

    0.2981
   -1.2121
   -0.2032
    1.5011
   -0.3839

105

answered Nov 15 '22 03:11

Pablo EM

Before I give you my answer, let's think a bit about the rationale behind an auto-encoder (AE):
The purpose of auto-encoder is to learn, in an unsupervised manner, something about the underlying structure of the input data. How does AE achieves this goal? If it manages to reconstruct the input signal from its output signal (that is usually of lower dimension) it means that it did not lost information and it effectively managed to learn a more compact representation.

In most examples, it is assumed, for simplicity, that both input signal and output signal ranges in [0..1]. Therefore, the same non-linearity (sigmf) is applied both for obtaining the output signal and for reconstructing back the inputs from the outputs.
Something like

output = sigmf( W*input + b ); % compute output signal
reconstruct = sigmf( W'*output + b_prime ); % notice the different constant b_prime

Then the AE learning stage tries to minimize the training error || output - reconstruct ||.

However, who said the reconstruction non-linearity must be identical to the one used for computing the output?

In your case, the assumption that inputs ranges in [0..1] does not hold. Therefore, it seems that you need to use a different non-linearity for the reconstruction. You should pick one that agrees with the actual range of you inputs.

If, for example, your input ranges in (0..inf) you may consider using exp or ().^2 as the reconstruction non-linearity. You may use polynomials of various degrees, log or whatever function you think may fit the spread of your input data.

Disclaimer: I never actually encountered such a case and have not seen this type of solution in literature. However, I believe it makes sense and at least worth trying.

answered Nov 15 '22 05:11

Shai

Related questions
                            
                                Detect MATLAB Help Browser
                            
                                How do I inherit documentation from super classes in Matlab?
                            
                                Is it possible to improve speed in ODE solvers from matlab? (ode45 ode15s etc)
                            
                                How to make a video from a 3d matrix in matlab
                            
                                How to define a derived property in object oriented Matlab
                            
                                Reading .ply files in matlab
                            
                                Put datatip stack on top of axis label and update axes label after a change was done on axes position
                            
                                How to add new element to structure array in Matlab?
                            
                                Why kinect color and depth won't align correctly?
                            
                                Dynamically assign the getter for a dependent property in MATLAB
                            
                                Improving MATLAB code speed
                            
                                General method to find submatrix in matlab matrix
                            
                                Compiling mexopencv in OS X 10.9 with Xcode 5 and Matlab R2013b
                            
                                Functions with a flexible list of ordered/unordered and labeled/unlabeled inputs in MATLAB
                            
                                How do I use MATLAB's inputParser with optional string inputs? The documentation says "use a validation function" but it's unclear how to do that
                            
                                Know the parent function
                            
                                How to dump variables as MATLAB source code?
                            
                                MATLAB identify adjacient regions in 3D image
                            
                                How to speed up table-retrieval with MATLAB and JDBC?
                            
                                How do you get emacs to recognize .m files as Matlab files, not objective-C files?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With