Beta. Content is under active construction and has not been peer-reviewed. Report errors on
GitHub
.
Disclaimer
Theorem
Path
Curriculum
Paths
Demos
Diagnostic
Search
Quiz Hub
/
Train-Test Split and Data Leakage
Train-Test Split and Data Leakage
2 questions
Difficulty 2-2
View topic
Foundation
0 / 2
2 foundation
Adapts to your performance
1 / 2
foundation (2/10)
conceptual
Why do we split data into training, validation, and test sets (three separate sets, not just two)?
Hide and think first
A.
Three distinct sets are required because each model goes through three separate training phases during the learning process
B.
The test set is needed because validation data has too much noise
C.
Train fits the model, validation selects hyperparameters, test gives an unbiased final estimate
D.
A two-way train/test split is always sufficient for complete and proper model evaluation in all practical scenarios
Submit Answer