Why does support vectors in SVM have alpha (Lagrangian multiplier) greater than zero?

1 Answers

This might be a late answer but I am putting my understanding here for other visitors.

Lagrangian multiplier, usually denoted by α is a vector of the weights of all the training points as support vectors.

Suppose there are m training examples. Then α is a vector of size m. Now focus on any ith element of α: α_i. It is clear that α_i captures the weight of the ith training example as a support vector. Higher value of α_i means that ith training example holds more importance as a support vector; something like if a prediction is to be made, then that ith training example will be more important in deriving the decision.

Now coming to the OP's concern:

I am not able to understand why particularly the Lagrangian multiplier is greater than zero for support vectors.

It is just a construct. When you say α_i=0, it is just that ith training example has zero weight as a support vector. You can instead also say that that ith example is not a support vector.

Side note: One of the KKT's conditions is the complementary slackness: α_ig_i(w)=0 for all i. For a support vector, it must lie on the margin which implies that g_i(w)=0. Now α_i can or cannot be zero; anyway it is satisfying the complementary slackness condition. For α_i=0, you can choose whether you want to call such points a support vector or not based on the discussion given above. But for a non-support vector, α_i must be zero for satisfying the complementary slackness as g_i(w) is not zero.

answered Oct 17 '22 19:10

Ankit Shubham

Related questions
                            
                                What is the difference between energy function and loss function? [closed]
                            
                                AttributeError when training CNN 1D with Python Keras
                            
                                Error in loading the model with load_weights in Keras
                            
                                How to calculate feature importance in each models of cross validation in sklearn
                            
                                Using Hyper-parameters from H2O to re-build XGBoost in Sklearn gives Difference Performance in Python
                            
                                Improve real-life results of neural network trained with mnist dataset
                            
                                Count number of the blues lines on white background in the image
                            
                                How to reset Keras metrics?
                            
                                OpenAI GPT-2 model use with TensorFlow JS
                            
                                How to compute hessian matrix for all parameters in a network in pytorch?
                            
                                How to purposely overfit Weka tree classifiers?
                            
                                How to calculate tag-wise precision and recall for POS tagger?
                            
                                Which keywords most distinguish two groups of people?
                            
                                How to implement a soft-margin SVM model using Matlab's quadprog?
                            
                                How to choose the right kernel functions
                            
                                Gradient Descent: Do we iterate on ALL of the training set with each step in GD? or Do we change GD for each training set?
                            
                                How to classify URLs? what are URLs features? How to select and Extract features from URL
                            
                                Get a classification report stating the class wise precision and recall for multinomial Naive Bayes using 10 fold cross validation
                            
                                TensorFlow - why doesn't this sofmax regression learn anything?
                            
                                Python Neural Network Reinforcement Learning [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does support vectors in SVM have alpha (Lagrangian multiplier) greater than zero?

Tags:

machine-learning

statistics

svm

Neel Shah

People also ask

1 Answers

Ankit Shubham

Recent Activity

Donate For Us