Newest 'data-leakage+standardization' Questions

2 votes

3 answers

322 views

What is "information leak from test to train" ? Is stratification by target a leak?

It's common practice to do procedures such as standardization and even missing value imputation (commonly based on some means) after train/test split - otherwise it is treated as information leak from ...

Ars ML

31

asked Feb 23, 2023 at 20:11

1 vote

0 answers

585 views

Normalization and RidgeCV in Sklearn Pipeline - possible data leakage?

To avoid data leakage between the train and test set, I'm using sklearn's Pipeline as follows: ...

flanders

11

asked Jan 31, 2022 at 20:12

2 votes

1 answer

1k views

What is the difference between standardizing time series data and non-time series data?

From reading some answers on this site (1, 2, 3 and 4) I found that, on time series data, standardization must be applied separately on the train and test sets to avoid data leakage. So the train data ...

Marcus

265

asked Oct 28, 2020 at 14:30

1 vote

1 answer

347 views

Normalization of training and test set with data leakage

I have a time series data set for actual number of airport passengers. Within 15 years (2004 ~ 2019), just like having a trend, number of the passengers is increasing over time as the country is ...

S. Jay

35

asked Nov 21, 2019 at 1:12

1 vote

1 answer

236 views

Standardize data before plotting learning curve?

I have implemented cross validation function with hyper parameter tuning. Basically, doing the following: Split the data into 80% training, 20% testing apply cross validation with hyper parameter ...

Perl Del Rey

531

asked Nov 18, 2019 at 17:40

Stack Exchange Network

All Questions

What is "information leak from test to train" ? Is stratification by target a leak?

Normalization and RidgeCV in Sklearn Pipeline - possible data leakage?

What is the difference between standardizing time series data and non-time series data?

Normalization of training and test set with data leakage

Standardize data before plotting learning curve?

Hot Network Questions

All Questions

Related Tags