How much time does take train SVM classifier?

Tags:

I wrote following code and test it on small data:

classif = OneVsRestClassifier(svm.SVC(kernel='rbf'))
classif.fit(X, y)

Where X, y (X - 30000x784 matrix, y - 30000x1) are numpy arrays. On small data algorithm works well and give me right results.

But I run my program about 10 hours ago... And it is still in process.

I want to know how long it will take, or it stuck in some way? (Laptop specs 4 GB Memory, Core i5-480M)

341

asked Aug 10 '13 18:08

Il'ya Zhenin

1 Answers

SVM training can be arbitrary long, this depends on dozens of parameters:

C parameter - greater the missclassification penalty, slower the process
kernel - more complicated the kernel, slower the process (rbf is the most complex from the predefined ones)
data size/dimensionality - again, the same rule

in general, basic SMO algorithm is O(n^3), so in case of 30 000 datapoints it has to run number of operations proportional to the2 700 000 000 000which is realy huge number. What are your options?

change a kernel to the linear one, 784 features is quite a lot, rbf can be redundant
reduce features' dimensionality (PCA?)
lower the C parameter
train model on the subset of your data to find the good parameters and then train the whole one on some cluster/supercomputer

181

answered Oct 31 '22 23:10

lejlot

Related questions
                            
                                sqlalchemy dynamic mapping
                            
                                Cannot import PyQt4.QtGui
                            
                                How to delete all files in directory on remote SFTP server in Python?
                            
                                Function returning a tuple or None: how to call that function nicely?
                            
                                Python: find regexp in a file
                            
                                Equivalent of ruby obj.send in python
                            
                                Are there any benefits from using a @staticmethod?
                            
                                Accessing a function within a function(nested function?) [duplicate]
                            
                                Generate in-memory image for Django testing
                            
                                Customize x-axis in matplotlib
                            
                                custom matplotlib plot : chess board like table with colored cells
                            
                                Django 1.4 - bulk_create with a list
                            
                                Hooking into sqlalchemy models
                            
                                Why isn't admin.autodiscover() called automatically in Django when using the admin, why was it designed to be called explicitly?
                            
                                Partial matching GAE search API
                            
                                Python 3: get 2nd to last index of occurrence in string
                            
                                New folder that is created inside the current directory
                            
                                Convert Image ( png ) To Matrix And Then To 1D Array
                            
                                How can I add a tag to a key in boto (Amazon S3)?
                            
                                Append list of Python dictionaries to a file without loading it

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How much time does take train SVM classifier?

Tags:

python

machine-learning

numpy

svm

Il'ya Zhenin

People also ask

1 Answers

lejlot

Recent Activity

Donate For Us