I know it is possible to obtain the polynomial features as numbers by using: <code>polynomial_features.transform(X)</code>. According to the manual, for a degree of two the features are: <code>[1, a, b, a^2, ab, b^2]</code>. But how do I obtain a description of the features for higher orders ? <code>.get_params()</code> does not show any list of features.

By the way, there is more appropriate function now: PolynomialFeatures.get_feature_names. <pre class="prettyprint lang-python prettyprint-override"><code>from sklearn.preprocessing import PolynomialFeatures import pandas as pd import numpy as np data = pd.DataFrame.from_dict({ 'x': np.random.randint(low=1, high=10, size=5), 'y': np.random.randint(low=-1, high=1, size=5), }) p = PolynomialFeatures(degree=2).fit(data) print p.get_feature_names(data.columns) </code></pre> This will output as follows: <pre class="prettyprint"><code>['1', 'x', 'y', 'x^2', 'x y', 'y^2'] </code></pre> N.B. For some reason you gotta fit your PolynomialFeatures object before you will be able to use get_feature_names(). If you are Pandas-lover (as I am), you can easily form DataFrame with all new features like this: <pre class="prettyprint lang-python prettyprint-override"><code>features = DataFrame(p.transform(data), columns=p.get_feature_names(data.columns)) print features </code></pre> Result will look like this: <pre class="prettyprint"><code> 1 x y x^2 x y y^2 0 1.0 8.0 -1.0 64.0 -8.0 1.0 1 1.0 9.0 -1.0 81.0 -9.0 1.0 2 1.0 1.0 0.0 1.0 0.0 0.0 3 1.0 6.0 0.0 36.0 0.0 0.0 4 1.0 5.0 -1.0 25.0 -5.0 1.0 </code></pre>

<pre class="prettyprint"><code>import numpy as np from sklearn.preprocessing import PolynomialFeatures X = np.array([2,3]) poly = PolynomialFeatures(3) Y = poly.fit_transform(X) print Y # prints [[ 1 2 3 4 6 9 8 12 18 27]] print poly.powers_ </code></pre> This code will print: <pre class="prettyprint"><code>[[0 0] [1 0] [0 1] [2 0] [1 1] [0 2] [3 0] [2 1] [1 2] [0 3]] </code></pre> So if the i'th cell is <code>(x,y)</code>, that means that <code>Y[i]=(a**x)*(b**y)</code>. For instance, in the code example <code>[2 1]</code> equals to <code>(2**2)*(3**1)=12</code>.

sklearn: how to get coefficients of polynomial features

Tags:

python

scikit-learn

I know it is possible to obtain the polynomial features as numbers by using: polynomial_features.transform(X). According to the manual, for a degree of two the features are: [1, a, b, a^2, ab, b^2]. But how do I obtain a description of the features for higher orders ? .get_params() does not show any list of features.

629

asked Jul 08 '15 11:07

Moritz

2 Answers

By the way, there is more appropriate function now: PolynomialFeatures.get_feature_names.

from sklearn.preprocessing import PolynomialFeatures
import pandas as pd
import numpy as np

data = pd.DataFrame.from_dict({
    'x': np.random.randint(low=1, high=10, size=5),
    'y': np.random.randint(low=-1, high=1, size=5),
})

p = PolynomialFeatures(degree=2).fit(data)
print p.get_feature_names(data.columns)

This will output as follows:

['1', 'x', 'y', 'x^2', 'x y', 'y^2']

N.B. For some reason you gotta fit your PolynomialFeatures object before you will be able to use get_feature_names().

If you are Pandas-lover (as I am), you can easily form DataFrame with all new features like this:

features = DataFrame(p.transform(data), columns=p.get_feature_names(data.columns))
print features

Result will look like this:

     1    x    y   x^2  x y  y^2
0  1.0  8.0 -1.0  64.0 -8.0  1.0
1  1.0  9.0 -1.0  81.0 -9.0  1.0
2  1.0  1.0  0.0  1.0   0.0  0.0
3  1.0  6.0  0.0  36.0  0.0  0.0
4  1.0  5.0 -1.0  25.0 -5.0  1.0

103

answered Sep 25 '22 08:09

prez

import numpy as np
from sklearn.preprocessing import PolynomialFeatures

X = np.array([2,3])

poly = PolynomialFeatures(3)
Y = poly.fit_transform(X)
print Y
# prints [[ 1  2  3  4  6  9  8 12 18 27]]
print poly.powers_

This code will print:

[[0 0]
 [1 0]
 [0 1]
 [2 0]
 [1 1]
 [0 2]
 [3 0]
 [2 1]
 [1 2]
 [0 3]]

So if the i'th cell is (x,y), that means that Y[i]=(a**x)*(b**y). For instance, in the code example [2 1] equals to (2**2)*(3**1)=12.

answered Sep 23 '22 08:09

omerbp

Related questions
                            
                                PySpark: TypeError: condition should be string or Column
                            
                                Is there a way to outline text with a dark line in PIL?
                            
                                Python - Plotting colored grid based on values
                            
                                Error: Statement expected, found py: Dedent
                            
                                How to get user agent information in Selenium WebDriver with Python
                            
                                Sort a list from an index to another index - python [duplicate]
                            
                                Recursion and return statements
                            
                                How to upload html documentation generated from sphinx to github?
                            
                                How to highlight text in a tkinter Text widget
                            
                                Py2Exe: DLL load failed
                            
                                Turtle Graphics Not Responding
                            
                                TypeError: ‘DoesNotExist’ object is not callable
                            
                                How to maintain state in Python without classes?
                            
                                Where is BeautifulSoup4 hiding?
                            
                                Python Progress Bar THROUGH Logging Module
                            
                                LDA model generates different topics everytime i train on the same corpus
                            
                                No handlers could be found for logger "apscheduler.scheduler"
                            
                                Why does pressing Ctrl-backslash result in core dump?
                            
                                pip, proxy authentication and "Not supported proxy scheme"
                            
                                Django custom command error: unrecognized arguments

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With