Cross Validation--Use testing set or validation set to predict?

Tags:

machine-learning

I have a question about cross validation.

In Machine learning, we know there're training, validation, test set. And test set is final run to see how the final model/classifier performed.

But in the process of cross validation: we are splitting data into training set and testing set(most tutorial used this term), so I'm confused. Do we need to split the whole data into 3 parts: training, validation, test? Since in cross validation we just keep talking about relationship with 2 set: training and the other.

Could someone help clarify?

Thanks

434

asked Apr 27 '17 16:04

ADJ

1 Answers

Yep ,it's a little confusing as some material uses CV/test interchangeably and some material does not use ,but i'll try to make it easy to understand by giving the comprehension of why it's needed:

You need the train set to do exactly that, train, but then also you need a way to ensure that your algorithm isn't memorizing the train set(that it's not overfitting) and how well its doing, so that makes the need of the test set so you can give it data it has never seen and you can measure the performance.

But.... ML its all about experimentation, you will train, evaluate, tweak some knob(hyperparameters or architectures), train again, evaluate again over and over, and then you will select the best experiment results, you deploy your system and in production it gets data it's never seen and it doesn't perform that well ,what happened? You used your test data to fit parameters and make decisions , so you overfitted to this test data but you dont know how it does to data never seen.

Cross validation solves this, you have your train data to learn parameters, and test data to evaluate how it does on unseen data, but still need a way to experiment the best hyper parameters and architectures: you take a sample of your training data and call it cross validation set, and hide your test data , you will NEVER use it until the end.

Now use your train data to learn parameters, and experiment with hyper parameters and architectures, but you will evaluate each experiment on the cross validation data instead of test data(you can see it as using CV data as a way to learn the hyperparameters) , after you experimented a lot, and selected your best performing option(on CV), you now use your test data to evaluate how it performs on data it has never seen before deploying it to production.

129

answered Nov 15 '22 10:11

Luis Leal

Related questions
                            
                                Angular Material form control highlighted in red even though pristine
                            
                                Laravel form request validation on store and update use same validation
                            
                                How do you create rules for jquery form validate plugin with names that are arrays?
                            
                                Rails Dynamic Validation
                            
                                How can I rewrite the ErrorMessage for a CustomValidator control on the client?
                            
                                XSD Schema Validation in Ruby
                            
                                How to set focus to a control after validation in .NET
                            
                                How to convert textbox.text into int or what to use for int only input instead?
                            
                                Can i call CustomValidator method on server side without assigning ControlToValidate?
                            
                                std::logic_error instead of return false
                            
                                How do I validate a CCD HL7 document?
                            
                                asp.net Regular Expression Validator with Required Field Validator
                            
                                what is a parse error and how do I fix it
                            
                                Primefaces - Do not display message for required fields, just hightlight border
                            
                                Creating a custom rule in jQuery Validate
                            
                                What's the best way to validate numbers with comma as decimal separator?
                            
                                rails presence conditional validation between 2 fields
                            
                                json schema to validate array of objects with anyOf and oneOf requirements
                            
                                How to pass parameter to translated validation error message
                            
                                Appengine - Deployment of hidden folder

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cross Validation--Use testing set or validation set to predict?

Tags:

validation

machine-learning

ADJ

People also ask

1 Answers

Luis Leal

Recent Activity

Donate For Us