integrity#
Module contains all data integrity checks.
Classes
Search for various types of null values, including string representations of null. |
|
Detect different variants of string categories (e.g. |
|
Detect columns which contain a mix of numerical and string values. |
|
Check if there are columns which have only a single unique value in all rows. |
|
Search in column[s] for values that contains only special characters. |
|
Detect strings with length that is much longer/shorter than the identified "normal" string lengths. |
|
Detect different variants of string categories between the same categorical column in two datasets. |
|
Check if dominant values have increased significantly between test and reference data. |
|
Checks for duplicate samples in the dataset. |
|
Find new categories in the test set. |
|
Find new labels in test. |
|
Find samples which have the exact same features' values but different labels. |
|
Detects outliers in a dataset using the LoOP algorithm. |