.. DO NOT EDIT. .. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY. .. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE: .. "checks_gallery/tabular/train_test_validation/plot_date_train_test_leakage_duplicates.py" .. LINE NUMBERS ARE GIVEN BELOW. .. only:: html .. note:: :class: sphx-glr-download-link-note Click :ref:`here ` to download the full example code .. rst-class:: sphx-glr-example-title .. _sphx_glr_checks_gallery_tabular_train_test_validation_plot_date_train_test_leakage_duplicates.py: .. _plot_tabular_date_train_test_validation_leakage_duplicates: Date Train Test Leakage Duplicates ************************************** .. GENERATED FROM PYTHON SOURCE LINES 10-23 .. code-block:: default from datetime import datetime import pandas as pd from deepchecks.tabular import Dataset, Suite from deepchecks.tabular.checks import DateTrainTestLeakageDuplicates def dataset_from_dict(d: dict, datetime_name: str = None) -> Dataset: dataframe = pd.DataFrame(data=d) return Dataset(dataframe, datetime_name=datetime_name) .. GENERATED FROM PYTHON SOURCE LINES 24-26 Synthetic example with date leakage =================================== .. GENERATED FROM PYTHON SOURCE LINES 26-59 .. code-block:: default train_ds = dataset_from_dict({'col1': [ datetime(2021, 10, 1, 0, 0), datetime(2021, 10, 1, 0, 0), datetime(2021, 10, 1, 0, 0), datetime(2021, 10, 2, 0, 0), datetime(2021, 10, 2, 0, 0), datetime(2021, 10, 2, 0, 0), datetime(2021, 10, 3, 0, 0), datetime(2021, 10, 3, 0, 0), datetime(2021, 10, 3, 0, 0), datetime(2021, 10, 4, 0, 0), datetime(2021, 10, 4, 0, 0), datetime(2021, 10, 4, 0, 0), datetime(2021, 10, 5, 0, 0), datetime(2021, 10, 5, 0, 0) ]}, 'col1') test_ds = dataset_from_dict({'col1': [ datetime(2021, 9, 4, 0, 0), datetime(2021, 10, 4, 0, 0), datetime(2021, 10, 5, 0, 0), datetime(2021, 10, 6, 0, 0), datetime(2021, 10, 6, 0, 0), datetime(2021, 10, 7, 0, 0), datetime(2021, 10, 7, 0, 0), datetime(2021, 10, 8, 0, 0), datetime(2021, 10, 8, 0, 0), datetime(2021, 10, 9, 0, 0), datetime(2021, 10, 9, 0, 0) ]}, 'col1') DateTrainTestLeakageDuplicates(n_to_show=3).run(train_dataset=train_ds, test_dataset=test_ds) .. raw:: html
Date Train-Test Leakage (duplicates)


.. GENERATED FROM PYTHON SOURCE LINES 60-62 Synthetic example no date leakage ================================= .. GENERATED FROM PYTHON SOURCE LINES 62-81 .. code-block:: default train_ds = dataset_from_dict({'col1': [ datetime(2021, 10, 3, 0, 0), datetime(2021, 10, 3, 0, 0), datetime(2021, 10, 4, 0, 0), datetime(2021, 10, 4, 0, 0), datetime(2021, 10, 4, 0, 0), datetime(2021, 10, 5, 0, 0), datetime(2021, 10, 5, 0, 0) ]}, 'col1') test_ds = dataset_from_dict({'col1': [ datetime(2021, 11, 4, 0, 0), datetime(2021, 11, 4, 0, 0), datetime(2021, 11, 5, 0, 0), datetime(2021, 11, 6, 0, 0), ]}, 'col1') DateTrainTestLeakageDuplicates().run(train_dataset=train_ds, test_dataset=test_ds) .. raw:: html
Date Train-Test Leakage (duplicates)


.. rst-class:: sphx-glr-timing **Total running time of the script:** ( 0 minutes 0.077 seconds) .. _sphx_glr_download_checks_gallery_tabular_train_test_validation_plot_date_train_test_leakage_duplicates.py: .. only:: html .. container:: sphx-glr-footer sphx-glr-footer-example .. container:: sphx-glr-download sphx-glr-download-python :download:`Download Python source code: plot_date_train_test_leakage_duplicates.py ` .. container:: sphx-glr-download sphx-glr-download-jupyter :download:`Download Jupyter notebook: plot_date_train_test_leakage_duplicates.ipynb ` .. only:: html .. rst-class:: sphx-glr-signature `Gallery generated by Sphinx-Gallery `_