Normalizing feature values for SVM

Tags:

I've been playing with some SVM implementations and I am wondering - what is the best way to normalize feature values to fit into one range? (from 0 to 1)

Let's suppose I have 3 features with values in ranges of:

3 - 5.
0.02 - 0.05
10-15.

How do I convert all of those values into range of [0,1]?

What If, during training, the highest value of feature number 1 that I will encounter is 5 and after I begin to use my model on much bigger datasets, I will stumble upon values as high as 7? Then in the converted range, it would exceed 1...

How do I normalize values during training to account for the possibility of "values in the wild" exceeding the highest(or lowest) values the model "seen" during training? How will the model react to that and how I make it work properly when that happens?

502

asked Dec 10 '13 22:12

user3010273

1 Answers

Besides scaling to unit length method provided by Tim, standardization is most often used in machine learning field. Please note that when your test data comes, it makes more sense to use the mean value and standard deviation from your training samples to do this scaling. If you have a very large amount of training data, it is safe to assume they obey the normal distribution, so the possibility that new test data is out-of-range won't be that high. Refer to this post for more details.

answered Nov 13 '22 06:11

lennon310

Related questions
                            
                                Why is logistic regression called regression? [closed]
                            
                                Manual split versus Scikit Grid Search
                            
                                Weights in Convolutional network?
                            
                                DBSCAN for clustering data by location and density
                            
                                What is the difference between the train loss and train error?
                            
                                How to resolve "IndexError: too many indices for array"
                            
                                calculate precision and recall in a confusion matrix
                            
                                Tensorflow: Using neural network to classify positive or negative phrases
                            
                                Dropout rate guidance for hidden layers in a convolution neural network
                            
                                How to build a Language model using LSTM that assigns probability of occurence for a given sentence
                            
                                Tensorflow.js tokenizer
                            
                                XGBoost Best Iteration
                            
                                Classification Report - Precision and F-score are ill-defined
                            
                                Is there some way to save best model only with tensorflow.estimator.train_and_evaluate()?
                            
                                In language modeling, why do I have to init_hidden weights before every new epoch of training? (pytorch)
                            
                                Single Perceptron - Non-linear Evaluating function
                            
                                Random Forests - Probability Estimates (+scikit-learn specific)
                            
                                Setting gamma and lambda in Reinforcement Learning
                            
                                GridSearchCV on LogisticRegression in scikit-learn
                            
                                data imbalance in SVM using libSVM

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Normalizing feature values for SVM

Tags:

range

machine-learning

svm

normalization

feature-selection

user3010273

People also ask

1 Answers

lennon310

Recent Activity

Donate For Us