What is the meaning of the nu parameter in Scikit-Learn's SVM class?

Tags:

I am following the example shown in http://scikit-learn.org/stable/auto_examples/svm/plot_oneclass.html#example-svm-plot-oneclass-py, where a one class SVM is used for anomaly detection. Now, this may be a notation unique to scikit-learn, but I couldn't find an explanation of how to use the parameter nu given to the OneClassSVM constructor.

In http://scikit-learn.org/stable/modules/svm.html#nusvc, it is stated that the parameter nu is a reparametrization of the parameter C (which is the regularization parameter which I am familiar with) - but doesn't state how to perform that reparameterization.

Both a formula and an intuition will be much appreciated.

Thanks!

984

asked Jun 27 '12 16:06

Guy Adini

1 Answers

The problem with C and the introduction of nu

The problem with the parameter C is:

that it can take any positive value
that it has no direct interpretation.

It is therefore hard to choose correctly and one has to resort to cross validation or direct experimentation to find a suitable value.

In response Schölkopf et al. reformulated SVM to take a new regularization parameter nu. This parameter is:

bounded between 0 and 1
has a direct interpretation

Interpretation of nu

The parameter nu is an upper bound on the fraction of margin errors and a lower bound of the fraction of support vectors relative to the total number of training examples. For example, if you set it to 0.05 you are guaranteed to find at most 5% of your training examples being misclassified (at the cost of a small margin, though) and at least 5% of your training examples being support vectors.

Relationship between C and nu

The relation between C and nu is governed by the following formula:

nu = A+B/C

A and B are constants which are unfortunately not that easy to calculate.

Conclusion

The takeaway message is that C and nu SVM are equivalent regarding their classification power. The regularization in terms of nu is easier to interpret compared to C, but the nu SVM is usually harder to optimize and runtime doesn't scale as well as the C variant with number of input samples.

More details (including formulas for A and B) can be found here: Chang CC, Lin CJ - "Training nu-support vector classifiers: theory and algorithms"

answered Oct 11 '22 02:10

Bernhard Kausler

Related questions
                            
                                SQLAlchemy: selecting which columns of an object in a query
                            
                                Mails not being sent to people in CC
                            
                                Safely extract zip or tar using Python
                            
                                Catching exception in context manager __enter__()
                            
                                How do I customize text color in IPython?
                            
                                How to check if a specific integer is in a list
                            
                                Adding a background image to a plot
                            
                                Django logging on Heroku
                            
                                In what situation do we need to use `multiprocessing.Pool.imap_unordered`?
                            
                                Understanding LDA implementation using gensim
                            
                                How to get only files in directory? [duplicate]
                            
                                X-Forwarded-Proto and Flask
                            
                                How to use Django's assertJSONEqual to verify response of view returning JsonResponse
                            
                                Is there a better way to guess possible unknown variables without brute force than I am doing? Machine learning? [duplicate]
                            
                                AttributeError: can't set attribute when connecting to sqlite database with flask-sqlalchemy
                            
                                How to Check if request.GET var is None?
                            
                                Get "2:35pm" instead of "02:35PM" from Python date/time?
                            
                                python subclassing multiprocessing.Process
                            
                                NoSQL Solution for Persisting Graphs at Scale
                            
                                How do I close the files from tempfile.mkstemp?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the meaning of the nu parameter in Scikit-Learn's SVM class?

Tags:

python

machine-learning

scikit-learn