SpecialCharacters.add_condition_samples_ratio_w_special_characters_less_or_equal#

SpecialCharacters.add_condition_samples_ratio_w_special_characters_less_or_equal(max_ratio: float = 0.05, threshold_ratio_per_sample=0.2) Self[source]#

Add condition - ratio of samples containing more special characters than threshold is below max_ratio.

Parameters
max_ratiofloat , default: 0.05

Maximum ratio of samples allowed.

threshold_ratio_per_samplefloat , default: 0.2

Threshold ratio of special characters in a sample.