Active Learning (e.g. Pool Sampling) for SVM in python [closed]

Tags:

I'm working on a problem that would greatly benefit from an active learning protocol (e.g. given a set of unlabeled data as compared to an existing model, the algorithm requests that a subset of unlabeled data be labeled by an 'oracle').

Does anyone have any examples of active learning (either using pool sampling, query by committee, or otherwise) being implemented in a SVM (preferably in python)?

959

asked May 03 '16 18:05

DrTchocky

2 Answers

Implementing active learning in python is quite straight forward. For simpliest case you just select new sample to query, which has smallest absolute value of decision_function on your learned SVM (simple uncertainty sampling), which is basically a single line long!. Assuming that you have a binary classification, with trained svm in clf and some unlabeled examples in X, you simply select

sample = X[np.argmin(np.abs(clf.decision_function(X)))]

You can find many different implementations on github too, like the one for AL paper from last year's ECML: https://github.com/gmum/mlls2015

133

answered Sep 30 '22 09:09

lejlot

Two popular query strategies for pool based sampling are uncertainty sampling and query by committee (see paper for an extensive review). The following library implements three common uncertainty strategies: least confident, max margin and entropy as well as two committee strategies: vote entropy and average KL divergence: https://github.com/davefernig/alp

The library is compatible with scikit-learn and can be used with any classifier. It uses random subsampling as a baseline for measuring the benefit of active learning.

answered Sep 30 '22 09:09

Vadim Smolyakov

Related questions
                            
                                How to filter stdout in python logging
                            
                                How do I replace a closed event loop?
                            
                                Python - is there a way to make all strings unicode in a project by default?
                            
                                Cookies must be enabled in your browser [Python Requests]
                            
                                Using Python Higher Order Functions to Manipulate Lists
                            
                                python opencv cv2 matchTemplate with transparency
                            
                                How to change screen transition in different screens
                            
                                Lambda and S3 Permission denied when want to create file
                            
                                What parameters does Django's models.DO_NOTHING expect?
                            
                                Retrieving data from a yaml file based on a Python list
                            
                                SQLAlchemy best practices: when / how to configure a scoped_session?
                            
                                how to get last n bits by bit-op?
                            
                                why are empty numpy arrays not printed
                            
                                How can I turn a csv file into a list of list in python
                            
                                Delete second row of header in PANDAS
                            
                                SQL Alchemy: How to bulk update values from a list of dicts
                            
                                In Pandas, how to create a unique ID based on the combination of many columns?
                            
                                PEP8 E226 recommendation
                            
                                Fastest or most idiomatic way to remove object from list of objects in Python
                            
                                How to extract tuple values in pandas dataframe for use of matplotlib?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Active Learning (e.g. Pool Sampling) for SVM in python [closed]

Tags:

python

machine-learning

svm

DrTchocky

People also ask

2 Answers

lejlot

Vadim Smolyakov

Recent Activity

Donate For Us