Do you apply min max scaling separately on training and test data?

1 Answers

Split it, then scale. Imagine it this way: you have no idea what real-world data looks like, so you couldn't scale the training data to it. Your test data is the surrogate for real-world data, so you should treat it the same way.

To reiterate: Split, scale your training data, then use the scaling from your training data on the testing data.

answered Oct 13 '22 13:10

Arya McCarthy

Related questions
                            
                                How to fix "ResourceExhaustedError: OOM when allocating tensor"
                            
                                Numpy Broadcast to perform euclidean distance vectorized
                            
                                Slow Performance with Apache Spark Gradient Boosted Tree training runs
                            
                                Batch normalization with 3D convolutions in TensorFlow
                            
                                How to set preferences for ALS implicit feedback in Collaborative Filtering?
                            
                                What's the best open-source Java Bayesian spam filter library? [closed]
                            
                                apache spark MLLib: how to build labeled points for string features?
                            
                                How to get access of individual trees of a xgboost model in python /R
                            
                                libsvm Shrinking Heuristics
                            
                                What is the relation between validation_data and validation_split in Keras' fit function?
                            
                                Problems obtaining most informative features with scikit learn?
                            
                                Use attribute and target matrices for TensorFlow Linear Regression Python
                            
                                Understanding `width_shift_range` and `height_shift_range` arguments in Keras's ImageDataGenerator class
                            
                                Building an SVM with Tensorflow
                            
                                TensorFlow in production for real time predictions in high traffic app - how to use?
                            
                                TensorFlow: does tf.train.batch automatically load the next batch when the batch has finished training?
                            
                                Spark Word2vec vector mathematics
                            
                                TensorBoard: How to plot histogram for gradients?
                            
                                PyTorch: What's the difference between state_dict and parameters()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Do you apply min max scaling separately on training and test data?

Tags:

machine-learning

normalization

shekit

People also ask

1 Answers

Arya McCarthy

Recent Activity

Donate For Us