train_test_validation#

Module containing the train test validation check in the nlp package.

Classes

LabelDrift

Calculate label drift between train dataset and test dataset, using statistical measures.

PropertyDrift

Calculate drift between train dataset and test dataset per feature, using statistical measures.

TrainTestSamplesMix

Detect samples in the test data that appear also in training data.

TextEmbeddingsDrift

Calculate drift between the train and test datasets using a model trained to distinguish between their embeddings.