Scikit-learn (Python): what does f_regression() compute?

Tags:

I'm trying to understand what f_regression() in the feature selection package does. (http://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.f_regression.html#sklearn.feature_selection.f_regression)

According to the documentation, the first step in f_regression is as follows:

"1. the regressor of interest and the data are orthogonalized wrt constant regressors."

What does this line mean, exactly? What are these constant regressors?

Thanks!

724

asked Jul 18 '14 17:07

monkeybiz7

1 Answers

It means that the mean is subtracted on both variables.

A constant regressor is a vector full of ones. What this vector can explain in your data is then subtracted out. This leads to a vector with zero sum, i.e. a centered variable.

What f1_regression essentially calculates is correlation, a scalar product between centered and appropriately rescaled variables.

The resulting score is a function of this value and the degrees of freedom, i.e. the dimensionality of the vectors. The higher the score, the more probably the variables are associated.

answered Oct 16 '22 05:10

eickenberg

Related questions
                            
                                Making figure transparent with colored background
                            
                                Pyinstaller QtCore Module import error
                            
                                Subclass of numpy ndarray doesn't work as expected
                            
                                Get indices of intersecting rows of Numpy 2d Array
                            
                                How to use encrypted RSA private key with PyCrypto?
                            
                                Handling escaped quotes with Python's csv.reader
                            
                                Speeding up a numpy loop in python?
                            
                                Django mod_wsgi: Exception occurred processing wsgi script
                            
                                Pandas multi-index slices for level names
                            
                                how to add cookies to tornado httpclient
                            
                                Is there a way to use an AWS signed URL to perform a multipart upload?
                            
                                Python process hanging due to open Paramiko ssh connections
                            
                                Make HelloWorld python script executable
                            
                                How to write on new string uses byte 'wb' mode?
                            
                                How can I cleanly exit a Pyro Daemon by client request?
                            
                                Sample a truncated integer power law in Python?
                            
                                What is the best way to return a string to the SWIG python interface?
                            
                                Writing a synchronous test suite for an async tornado web socket server
                            
                                Python: display a matrix with negative and positive values [duplicate]
                            
                                Pycharm Type-Hinting of Class Fields / Instance Variables

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scikit-learn (Python): what does f_regression() compute?

Tags:

python

scikit-learn

monkeybiz7

People also ask

1 Answers

eickenberg

Recent Activity

Donate For Us