What is the difference between standardscaler and normalizer in sklearn.preprocessing module? Don't both do the same thing? i.e remove mean and scale using deviation?

From the Normalizer docs: <blockquote> Each sample (i.e. each row of the data matrix) with at least one non zero component is rescaled independently of other samples so that its norm (l1 or l2) equals one. </blockquote> And StandardScaler <blockquote> Standardize features by removing the mean and scaling to unit variance </blockquote> In other words Normalizer acts row-wise and StandardScaler column-wise. Normalizer does not remove the mean and scale by deviation but scales the whole row to unit norm.

Difference between standardscaler and Normalizer in sklearn.preprocessing

2 Answers

From the Normalizer docs:

Each sample (i.e. each row of the data matrix) with at least one non zero component is rescaled independently of other samples so that its norm (l1 or l2) equals one.

And StandardScaler

Standardize features by removing the mean and scaling to unit variance

In other words Normalizer acts row-wise and StandardScaler column-wise. Normalizer does not remove the mean and scale by deviation but scales the whole row to unit norm.

138

answered Oct 02 '22 17:10

joc

This visualization and article by Ben helps a lot in illustrating the idea.

enter image description here

The StandardScaler assumes your data is normally distributed within each feature. By "removing the mean and scaling to unit variance", you can see in the picture now they have the same "scale" regardless of its original one.

answered Oct 02 '22 17:10

vincentlcy

Related questions
                            
                                Determining the most contributing features for SVM classifier in sklearn
                            
                                scikit-learn return value of LogisticRegression.predict_proba
                            
                                What is "metrics" in Keras?
                            
                                What is `lr_policy` in Caffe?
                            
                                Unknown initializer: GlorotUniform when loading Keras model
                            
                                What are the differences between all these cross-entropy losses in Keras and TensorFlow?
                            
                                Shuffling training data with LSTM RNN
                            
                                What does clf mean in machine learning?
                            
                                Suggest what user could buy if he already has something in the cart
                            
                                importance of PCA or SVD in machine learning
                            
                                TensorFlow operator overloading
                            
                                How to understand the term `tensor` in TensorFlow?
                            
                                Neural Networks: What does "linearly separable" mean?
                            
                                xgboost in R: how does xgb.cv pass the optimal parameters into xgb.train
                            
                                How to pick a language for Artificial Intelligence programming? [closed]
                            
                                ResNet: 100% accuracy during training, but 33% prediction accuracy with the same data
                            
                                Correlated features and classification accuracy
                            
                                Machine Learning & Big Data [closed]
                            
                                Machine Learning Algorithm for Predicting Order of Events?
                            
                                Hyperparameter optimization for Pytorch model [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between standardscaler and Normalizer in sklearn.preprocessing

Tags:

machine-learning

statistics

scikit-learn

rb1992

People also ask

2 Answers

joc

vincentlcy

Recent Activity

Donate For Us