methodology#
Module contains checks for methodological flaws in the model building process.
Classes
Check for overfit caused by using too many iterations in a gradient boosted model. |
|
Detect features that are nearly unused by the model. |
|
Return the PPS (Predictive Power Score) of all features in relation to the label. |
|
Return the Predictive Power Score of all features, in order to estimate each feature's ability to predict the label. |
|
Check if test indexes are present in train data. |
|
Detect samples in the test data that appear also in training data. |
|
Check if test dates are present in train data. |
|
Check test data that is dated earlier than latest date in train. |
|
Check if identifiers (Index/Date) can be used to predict the label. |
|
Measure model average inference time (in seconds) per sample. |
|
Verify test dataset size comparing it to the train dataset size. |