Canonical Correlation Analysis in Python with sklearn

Tags:

I'm trying to use sklearn to carry out Canonical Correlation Analysis (CCA). I'm starting with the simple example that is included in the manual:

from sklearn.cross_decomposition import CCA
X = [[0., 0., 1.], [1.,0.,0.], [2.,2.,2.], [3.,5.,4.]]
Y = [[0.1, -0.2], [0.9, 1.1], [6.2, 5.9], [11.9, 12.3]]
cca = CCA(n_components=1)
cca.fit(X, Y)

X_c, Y_c = cca.transform(X, Y)

I understand that in cca.x_weights_ I get the "canonical coefficents", i.e., the linear combinations of the original X variables (the columns of matrices "A" and "B" returned by MATLAB). However, where are the the "canonical correlations", i.e, the maximum correlation reached when applying the transformation given by the canonical coeficients (i.e., vector "r" returned by MATLAB). Is it possible to also get that in Python?

451

asked Oct 10 '14 11:10

manu

1 Answers

You can calculate the correlations using the outputs of .transfrom. This can be done with either numpy or scipy. I prefer scipy's stats module:

X_c, Y_c = cca.transform(X, Y)
import scipy.stats
corrcoef,p_value = scipy.stats.pearsonr(X_c,Y_c)

Clearly, since in your case you don't have enough samples (i.e., n < p+q), you're correlation is 1.

119

answered Sep 18 '22 14:09

idnavid

Related questions
                            
                                Why is .sum() faster than .any() or .max()?
                            
                                How to make dynamic content display across a grid using Django with Bootstrap?
                            
                                How can I access Firefox's internal indexedDB files using Python?
                            
                                Google OAuth API - Python client import error
                            
                                Statistic estimation of total nodes in a tree where edge traversal is expensive
                            
                                django reg extend - current transaction is aborted, commands ignored until end of transaction block
                            
                                Mongo connections never released - Django and Mongoengine running on gunicorn with gevent
                            
                                Doing "group by" in django but still retaining complete object
                            
                                gevent profiler for long running code
                            
                                How to monitor the active window on a remote PC
                            
                                Always treat a ForeignKey field like it was in raw_id_fields in Django Admin
                            
                                cannot add inline to django site admin framework
                            
                                Reset sorl Thumbnail for a Django site
                            
                                Programming function containing cut in negative imaginary axis
                            
                                How to prevent Scrapy from URL encoding request URLs
                            
                                PyDev + Django - undefined variables from import
                            
                                Flask-login not working as expected
                            
                                Multiple Levels of Toctree's in Python-Sphinx
                            
                                WebSocket closes after 1000 messages

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Canonical Correlation Analysis in Python with sklearn

Tags:

python

matlab

scikit-learn

correlation

manu

People also ask

1 Answers

idnavid

Recent Activity

Donate For Us