Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What is difference between PCA , TruncatedSVD and ICA in details?

Can anybody tell me the differences between PCA(Principal component analysis ) , TruncatedSVD(Truncated singular value decomposition) and ICA(Independent component analysis) in detail?

like image 774
Ashok Lathwal Avatar asked Aug 05 '17 12:08

Ashok Lathwal


People also ask

What is the difference between PCA and ICA?

Specifically, PCA is often used to compress information i.e. dimensionality reduction. While ICA aims to separate information by transforming the input space into a maximally independent basis.

What is difference between PCA and SVD?

What is the difference between SVD and PCA? SVD gives you the whole nine-yard of diagonalizing a matrix into special matrices that are easy to manipulate and to analyze. It lay down the foundation to untangle data into independent components. PCA skips less significant components.

What is TruncatedSVD?

Truncated SVD generates the matrices with the specified number of columns, whereas SVD outputs n columns of matrices. It decreases the number of output and better works on the sparse matrices for features output.

What is the difference between truncated SVD and PCA?

TruncatedSVD is very similar to PCA , but differs in that the matrix does not need to be centered. When the columnwise (per-feature) means of are subtracted from the feature values, truncated SVD on the resulting matrix is equivalent to PCA.


1 Answers

Doing it in detail will require long pages PDF document :-).

But the idea is simple:

  1. Principal Component Analysis (PCA) - analyze the data native coordinates. Namely the coordinated which along the data has its most energy (Variance). For n Samples of dimension d there will be $ d $ orthogonal directions. Namely the data projected on them has no correlation. If we look on the data as Random Variables, it means we found a coordinate system where the Cross Correlation (First Moment) of any pair from the projected data is vanished.
    This is a very efficient way to approximate the data in lower dimensionality by keeping most of its energy.
  2. Truncated SVD - One could show that one of the ways of calculating those system of coordinate is using the SVD. Hence this is method to apply the ideas behind PCA.
  3. Independent Component Analysis (ICA) - This is one step farther from PCA. While in PCA we dealt with only First Order Moments of the data (Correlation) in ICA we're looking into higher moments and try to find a projection of data which vanishes higher moments (Think of Lack of Correlation vs. Probability Independence).
like image 186
Royi Avatar answered Sep 21 '22 17:09

Royi