Computing Lagrange Multiplers for a simple Support Vector Machine

Tags:

Firstly, I am a beginner to Support Vector Machines so I'm sorry if I am going about this problem in the wrong way. I am trying to implement a very simple SVM from scratch which uses the identity kernel function to classify linearly separable data into one of two classes. As an example of the sort of data which I will be using, consider the plot below seen in this document:

Plotted linearly separable data

Using the points (1,0), (3, 1) and (3, -1) as support vectors, we know that the following is true with regards to calculating the decision plane (Screenshotted from the same document):

Lagrange Multipler Formula One Which when fiddled and rearranged a bit gives us Lagrange multipliers of -3.5, 0.75 and 0.75 respectively.

I understand how this algebra works on paper, however I am unsure as to the best approach when it comes to implementation. So my question is as follows: how are the SVM's Lagrange Multipliers calculated in practice? Is there an algorithm which I am missing which will be able to determine these values for arbitrary linearly separable support vectors? Should I use a standard maths library to solve the linear equations (I am implementing the SVM in java)? Would such a maths library be slow for large scale learning? Note that this is a learning exercise so I'm not just looking for a ready made SVM library.

Any other advice would be much appreciated!

EDIT 1: LutzL made a good point that half the problem is actually determining which points are to be used as the support vectors, so to keep things simple assume for the purpose of this question that they have already been computed.

816

asked Feb 12 '15 17:02

Hungry

1 Answers

Independent of the kernel function, the determination of the coefficients leads to a quadratic optimization problem with linear positivity constraints. Which has a horrendous complexity if implemented naively testing all boundary components, so you can not avoid advanced optimization algorithms like barrier or trust region methods.

There are also heuristic approaches that try to keep the optimization problem in low dimension by searching for point sets close to the separation line and eliminating points that are most probably far away from it.

144

answered Oct 01 '22 03:10

Lutz Lehmann

Related questions
                            
                                Java Graph library for Network visualization by Graph
                            
                                How to handle cyclic references in XStream?
                            
                                Setup Java development environment with Docker
                            
                                Freemarker issues with Java 8
                            
                                Count number of group by rows in hibernate with criteria
                            
                                If an NxM multiplication table is put in order, what is number in the middle?
                            
                                REST Service that can consume both JSON and Multipart Form
                            
                                Fetching CloudWatch metrics using the AWS Java SDK?
                            
                                Gradle run configurations with different properties
                            
                                How to make code completion for Java 8 lambda parameters in Eclipse work?
                            
                                How to migrate same class with 2 entity-names to Spring Data JPA?
                            
                                Can an Atomic compare and exchange overwrite a lazy write without seeing it?
                            
                                Jax-ws set connection timeout for reading wsdl and sending request
                            
                                Is ImmutableMap a sub-optimal choice for large volume of keys/objects/
                            
                                Log4j not logging with JBoss 6.1
                            
                                Pushes not received after the app is closed
                            
                                Java BufferedImage memory consumption
                            
                                Android Espresso functional tests with fragments
                            
                                Sublime Text 3 - Plugin Profiles
                            
                                Scala: how to implement via function object parameters a generic flow where signatures differ?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Computing Lagrange Multiplers for a simple Support Vector Machine

Tags:

java

math

machine-learning

svm

linear-algebra

Hungry

People also ask

1 Answers

Lutz Lehmann

Recent Activity

Donate For Us