Welcome to Deepchecks!#
Deepchecks is the leading tool for testing and for validating your machine learning models and data, and it enables doing so with minimal effort. Deepchecks accompanies you through various validation and testing needs such as verifying your dataβs integrity, inspecting its distributions, validating data splits, evaluating your model and comparing between different models.
Join Our Community π
In addition to perusing the documentation, please feel free to ask questions on our Slack Community, or to post a issue or start a discussion on Github.
π» Installation#
In order to use deepchecks, you need to install it with pip:
# deepchecks for tabular data:
pip install deepchecks --upgrade
# for installing deepchecks including the computer vision subpackage (note - Pytorch should be installed separately):
pip install "deepchecks[vision]" --upgrade
For more installation details and best practices, check out our full installation instructions.
πββοΈ See It in Action#
For a quick start, check out the following examples in the tutorials section, to have deepchecks up and running in a few minutes:
Tabular Data#
Head over to one of our following quickstart tutorials, and have deepchecks running on your environment in less than 5 min:
Recommended - download the code and run it locally on the built-in dataset and (optional) model, or replace them with your own.
π See Our Checks Demo#
Play with some of the existing checks in our Interactive Checks Demo, and see how they work on various datasets with custom corruptions injected.
Computer Vision#
Note: CV Subpackage is in Beta Release
It is available for installation from PyPi, use at your own discretion. Github Issues for feedback and feature requests are welcome!
ππΌ When Should You Use Deepchecks?#
While youβre in the research phase, and want to validate your data, find potential methodological problems, and/or validate your model and evaluate it.
See the When Should You Use section for an elaborate explanation of the typical scenarios.
π Which Types of Checks Exist?#
Check out our Tabular Checks to see all the available checks for Tabular and Vision Checks for CV.
They are checks for different phases in the ML workflow:
Data Integrity
Train-Test Validation (Distribution and Methodology Checks)
Model Performance Evaluation
π§ How Does it Work?#
Deepchecks is built of checks, each designated to help to identify a specific issue. Some checks relate only to the data and labels and some require also the model. Suites are composed of checks. Each check contains outputs to display in a notebook and/or conditions with a pass/fail/warning output. For more information about deepchecks structure and components head over to our Deepchecks Hierarchy in the User Guide.
β What Do You Need in Order to Start?#
Depending on your phase and what you wish to validate, youβll need a subset of the following:
Raw data (before pre-processing such as OHE, string processing, etc.), with optional labels
The modelβs training data with labels
Test data (which the model isnβt exposed to) with labels
- A supported model that you wish to validate, including: scikit-learn, XGBoost, PyTorch, and more.For tabular data models see supported models, for more details about the supported model API.For Computer Vision, we currently support the PyTorch framework. See Data Classes to understand how to integrate your data.
π Viewing Check and Suite Results#
The packageβs output can be consumed in various formats:
Viewed inline in Jupyter (default behavior)
π’ Suported Data Types#
Deepchecks currently supports Tabular Data (deepchecks.tabular
) and is in beta release for Computer Vision (deepchecks.vision
).