Skip to main content

All Questions

Filter by
Sorted by
Tagged with
0 votes
0 answers
41 views

Simple demonstration of imputation data leakage?

I'm aware that it's best practice to do all pre-processing within train-test splits, including data imputation. At least, it's recommended not to use the test data to generate the imputation model for ...
Evan's user avatar
  • 225
4 votes
1 answer
7k views

When imputing missing values in a test set, should the new values come from the training set or be recalculated from the test set?

Both answers to this question on imputing missing values note that, when imputing missing values in a test set for model evaluation, the replacement values should be the ones calculated and used in ...
danpelota's user avatar
  • 277