SpecialCharacters.add_condition_samples_ratio_w_special_characters_less_or_equal#
- SpecialCharacters.add_condition_samples_ratio_w_special_characters_less_or_equal(max_ratio: float = 0.05, threshold_ratio_per_sample=0.2) Self [source]#
Add condition - ratio of samples containing more special characters than threshold is below max_ratio.
- Parameters
- max_ratiofloat , default: 0.05
Maximum ratio of samples allowed.
- threshold_ratio_per_samplefloat , default: 0.2
Threshold ratio of special characters in a sample.