DataDuplicates.run_logic#

DataDuplicates.run_logic(context: Context, dataset_type: str = 'train')[source]#

Run check.

Returns
CheckResult

percentage of duplicates and display of the top n_to_show most duplicated.