train_test_validation#
Module contains checks of train test validation checks.
Classes
Find new categories in the test set.  | 
|
Verify test dataset size comparing it to the train dataset size.  | 
|
Check if test dates are present in train data.  | 
|
Check test data that is dated earlier than the latest date in train.  | 
|
Check if identifiers (Index/Date) can be used to predict the label.  | 
|
Check if test indexes are present in train data.  | 
|
Find new labels in test.  | 
|
Return the Predictive Power Score of all features, in order to estimate each feature's ability to predict the label.  | 
|
Detect different variants of string categories between the same categorical column in two datasets.  | 
|
Calculate drift between train dataset and test dataset per feature, using statistical measures.  | 
|
Calculate label drift between train dataset and test dataset, using statistical measures.  | 
|
Detect samples in the test data that appear also in training data.  | 
|
Calculate drift between the entire train and test datasets using a model trained to distinguish between them.  | 
|
Calculate drift between the entire train and test datasets using a model trained to distinguish between them.  |