train_test_validation#
Module contains checks of train test validation checks.
Classes
Find new categories in the test set. |
|
Verify test dataset size comparing it to the train dataset size. |
|
Check if test dates are present in train data. |
|
Check test data that is dated earlier than the latest date in train. |
|
Check if identifiers (Index/Date) can be used to predict the label. |
|
Check if test indexes are present in train data. |
|
Find new labels in test. |
|
Return the Predictive Power Score of all features, in order to estimate each feature's ability to predict the label. |
|
Detect different variants of string categories between the same categorical column in two datasets. |
|
Calculate drift between train dataset and test dataset per feature, using statistical measures. |
|
Calculate label drift between train dataset and test dataset, using statistical measures. |
|
Detect samples in the test data that appear also in training data. |
|
Calculate drift between the entire train and test datasets using a model trained to distinguish between them. |
|
Calculate drift between the entire train and test datasets using a model trained to distinguish between them. |