I am dealing with highly imbalanced data set and my idea is to obtain values of feature weights from my libSVM model. As for now I am OK with the linear kernel, where I can obtain feature weights, but when I am using <code>rbf</code> or <code>poly</code>, I fail to reach my objective. Here I am using <code>sklearn</code> for my model and it's easy to obtain feature weights for linear kernel using <code>.coef_</code>. Can anyone help me to do same thing for <code>rbf</code> or <code>poly</code>? What I've tried to do so far is given below: <pre class="prettyprint"><code>svr = SVC(C=10, cache_size=200, class_weight='auto', coef0=0.0, degree=3.0, gamma=0.12,kernel='rbf', max_iter=-1, probability=True, random_state=0,shrinking=True, tol=0.001, verbose=False) clf = svr.fit(data_train,target_train) print clf.coef_ </code></pre>

I was stuck with a similar problem but for a different reason. My objective was to compute the inference not using the built-in <code>SVC.predict</code>. Assuming that: <pre class="prettyprint"><code>import numpy as np from sklearn.svm import SVC X = np.array([[3, 4], [1, 4], [2, 3], [6, -1], [7, -1], [5, -3]]) y = np.array([-1, -1, -1, 1, 1, 1]) clf = SVC(C=1e5, kernel='linear') clf.fit(X, y) </code></pre> I would like to compute predictions for trained models only using algebra. Now the formula for linear inference is easy: <img src="https://i.stack.imgur.com/mVVKP.png" alt="enter image description here"> where <img src="https://chart.googleapis.com/chart?cht=tx&chl=%5Calpha_jy_jx_j" alt="\alpha_jy_jx_j"> collectively are called weights. What makes matters super easy is that <code>clf.coef_</code> gets you the weights. So: <pre class="prettyprint"><code>w = clf.coef_ b = clf.intercept_ assert np.sign(w.dot(X[0]) + b)[0] == clf.predict(X[0].reshape((1, 2))) </code></pre> Side note: the sum of multiplications is exactly what <code>dot</code> does on two vectors, and <code>reshape</code> for input vector is needed to conform with the expected <code>predict</code> input shape. But of course, for other kernels, it is more complicated than that, from this formula <img src="https://i.stack.imgur.com/KZRYk.png" alt="enter image description here"> and previous answers we cannot pre-compute the weights since <img src="https://chart.googleapis.com/chart?cht=tx&chl=%5Calpha_jy_jK(x%2Cx_j)" alt="\alpha_jy_jK(x,x_j)"> are all tied in together. Now, this is where I've got stuck until I've got some help from a friend. Who discovered this documentation page. It says that <img src="https://chart.googleapis.com/chart?cht=tx&chl=%5Calpha_jy_j" alt="\alpha_jy_j"> is <code>clf.dual_coef_</code> in scikit learn terms. Once you know that this equation becomes easy as well. We now know the value of <img src="https://chart.googleapis.com/chart?cht=tx&chl=%5Calpha_jy_j" alt="\alpha_jy_j">. One thing left to do is to calculate the kernel function, which depends on type of the kernel, for polynomial kernel of 3rd degree (this is the default degree for poly SVM in scikit) <img src="https://chart.googleapis.com/chart?cht=tx&chl=K(x%2Cx_j)" alt="K(x,x_j)"> roughly translates to <code>np.power(clf.support_vectors_.dot(X), clf.degree)</code>. ** Now let's combine everything we've learned into this code snippet: <pre class="prettyprint"><code>import numpy as np from sklearn.svm import SVC X = np.array([[3, 4], [1, 4], [2, 3], [6, -1], [7, -1], [5, -3]]) y = np.array([-1, -1, -1, 1, 1, 1]) clf = SVC(kernel='poly', gamma=1) clf.fit(X, y) print('b = ', clf.intercept_) print('Indices of support vectors = ', clf.support_) print('Support vectors = ', clf.support_vectors_) print('Number of support vectors for each class = ', clf.n_support_) print('Coefficients of the support vector in the decision function = ', np.abs(clf.dual_coef_)) negative_prediction = clf.dual_coef_.dot(np.power(clf.gamma * clf.support_vectors_.dot(X[0]), clf.degree)) + clf.intercept_ positive_prediction = clf.dual_coef_.dot(np.power(clf.gamma * clf.support_vectors_.dot(X[4]), clf.degree)) + clf.intercept_ print('Compare both results') print(negative_prediction, clf.decision_function(X[0].reshape((1, 2)))) print(positive_prediction, clf.decision_function(X[4].reshape((1, 2)))) assert np.sign(negative_prediction) == clf.predict(X[0].reshape((1, 2))) assert np.sign(positive_prediction) == clf.predict(X[4].reshape((1, 2))) </code></pre> If you run it you'll see that the assertions are passing, WOO HOO! We now can predict the results not using the <code>predict</code>, and I hope it may help with the question asked. Since now you can adjust dual coefficients the same way you wanted to adjust weights. ** But please pay attention that if you do not use gamma, also remove it from the "manual calculations", since it will just break otherwise. Also, it is an example of inference for polynomial kernel, for other kernels inference function should be adjusted accordingly. See documentation <ul> <li> Source for formulas snapshots and much more info about SVM.</li> <li>Relevant scikit learn documentation</li> <li>The code snippet based on something I've seen on stackoverflow, but I've lost the source link. So I would like to thank and credit the original author(once I find him).</li> </ul>

How to obtain features' weights

Tags:

I am dealing with highly imbalanced data set and my idea is to obtain values of feature weights from my libSVM model. As for now I am OK with the linear kernel, where I can obtain feature weights, but when I am using rbf or poly, I fail to reach my objective.

Here I am using sklearn for my model and it's easy to obtain feature weights for linear kernel using .coef_. Can anyone help me to do same thing for rbf or poly? What I've tried to do so far is given below:

svr = SVC(C=10, cache_size=200, class_weight='auto', coef0=0.0, degree=3.0, gamma=0.12,kernel='rbf', max_iter=-1, probability=True, random_state=0,shrinking=True, tol=0.001, verbose=False)
clf = svr.fit(data_train,target_train)
print clf.coef_

550

asked Jan 21 '14 14:01

Paul85

2 Answers

This is not only impossible, as stated in the documentation:

Weights asigned to the features (coefficients in the primal problem). This is only available in the case of linear kernel.

but also it doesn't make sense. In linear SVM the resulting separating plane is in the same space as your input features. Therefore its coefficients can be viewed as weights of the input's "dimensions".

In other kernels, the separating plane exists in another space - a result of kernel transformation of the original space. Its coefficients are not directly related to the input space. In fact, for the rbf kernel the transformed space is infinite-dimensional (you can get a starting point on this on Wikipedia of course).

answered Sep 17 '22 19:09

BartoszKP

I was stuck with a similar problem but for a different reason. My objective was to compute the inference not using the built-in SVC.predict. Assuming that:

import numpy as np
from sklearn.svm import SVC

X = np.array([[3, 4], [1, 4], [2, 3], [6, -1], [7, -1], [5, -3]])
y = np.array([-1, -1, -1, 1, 1, 1])

clf = SVC(C=1e5, kernel='linear')
clf.fit(X, y)

I would like to compute predictions for trained models only using algebra. Now the formula for linear inference is easy:

enter image description here

where $\alpha_jy_jx_j$ collectively are called weights. What makes matters super easy is that clf.coef_ gets you the weights. So:

w = clf.coef_
b = clf.intercept_

assert np.sign(w.dot(X[0]) + b)[0] == clf.predict(X[0].reshape((1, 2)))

Side note: the sum of multiplications is exactly what dot does on two vectors, and reshape for input vector is needed to conform with the expected predict input shape.

But of course, for other kernels, it is more complicated than that, from this formula enter image description here and previous answers we cannot pre-compute the weights since $\alpha_jy_jK(x,x_j)$ are all tied in together.

Now, this is where I've got stuck until I've got some help from a friend. Who discovered this documentation page. It says that $\alpha_jy_j$ is clf.dual_coef_ in scikit learn terms. Once you know that this equation becomes easy as well.

We now know the value of $\alpha_jy_j$ . One thing left to do is to calculate the kernel function, which depends on type of the kernel, for polynomial kernel of 3rd degree (this is the default degree for poly SVM in scikit) $K(x,x_j)$ roughly translates to np.power(clf.support_vectors_.dot(X), clf.degree). **

Now let's combine everything we've learned into this code snippet:

import numpy as np
from sklearn.svm import SVC

X = np.array([[3, 4], [1, 4], [2, 3], [6, -1], [7, -1], [5, -3]])
y = np.array([-1, -1, -1, 1, 1, 1])

clf = SVC(kernel='poly', gamma=1)
clf.fit(X, y)

print('b = ', clf.intercept_)
print('Indices of support vectors = ', clf.support_)
print('Support vectors = ', clf.support_vectors_)
print('Number of support vectors for each class = ', clf.n_support_)
print('Coefficients of the support vector in the decision function = ', np.abs(clf.dual_coef_))

negative_prediction = clf.dual_coef_.dot(np.power(clf.gamma * clf.support_vectors_.dot(X[0]), clf.degree)) + clf.intercept_
positive_prediction = clf.dual_coef_.dot(np.power(clf.gamma * clf.support_vectors_.dot(X[4]), clf.degree)) + clf.intercept_

print('Compare both results')
print(negative_prediction, clf.decision_function(X[0].reshape((1, 2))))
print(positive_prediction, clf.decision_function(X[4].reshape((1, 2))))

assert np.sign(negative_prediction) == clf.predict(X[0].reshape((1, 2)))
assert np.sign(positive_prediction) == clf.predict(X[4].reshape((1, 2)))

If you run it you'll see that the assertions are passing, WOO HOO! We now can predict the results not using the predict, and I hope it may help with the question asked. Since now you can adjust dual coefficients the same way you wanted to adjust weights.

** But please pay attention that if you do not use gamma, also remove it from the "manual calculations", since it will just break otherwise. Also, it is an example of inference for polynomial kernel, for other kernels inference function should be adjusted accordingly. See documentation

Source for formulas snapshots and much more info about SVM.
Relevant scikit learn documentation
The code snippet based on something I've seen on stackoverflow, but I've lost the source link. So I would like to thank and credit the original author(once I find him).

answered Sep 20 '22 19:09

sleepyhead

Related questions
                            
                                How to edit AndroidManifest.xml in PhoneGap 3
                            
                                Camera in Android, how to get best size, preview size, picture size, view size, image distorted
                            
                                cordova ubuntu: An error occurred while listing Android targets
                            
                                How does one query a postgres array using the gin index?
                            
                                `fixed` vs GCHandle.Alloc(obj, GCHandleType.Pinned)
                            
                                Rails file upload (paperclip) on edit
                            
                                Tab bar background is missing on iOS 7.1 after presenting and dismissing a view controller
                            
                                Justification for using a bitfield instead of EnumSet in modern Java 8 API
                            
                                What's so hard about p2p Hole Punching?
                            
                                What is the benefit of calling beginUpdates/endUpdates for a UITableView as opposed to not doing so?
                            
                                How to specify C++11 with distutils?
                            
                                How do I use Android’s “Surface” classes?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With