load_data#

load_data(data_format: str = 'Dataset', as_train_test: bool = True, modify_timestamps: bool = True, data_size: Optional[int] = 15000) Union[Tuple, Dataset, DataFrame][source]#

Load and returns the Airbnb NYC 2019 dataset (regression).

Parameters
data_formatstr , default: Dataset

Represent the format of the returned value. Can be ‘Dataset’|’Dataframe’ ‘Dataset’ will return the data as a Dataset object ‘Dataframe’ will return the data as a pandas Dataframe object

as_train_testbool , default: True

If True, the returned data is split into train and test exactly like the toy model was trained. The first return value is the train data and the second is the test data. In order to get this model, call the load_fitted_model() function. Otherwise, returns a single object.

modify_timestampsbool , default: True

If True, the returned data timestamp column will be for the last 30 days. Otherwise, the data timestamp will be for March 2023.

data_sizet.Optional[int] , default: 15000

The number of samples to return. If None, returns all the data.

Returns
datasetUnion[deepchecks.Dataset, pd.DataFrame]

the data object, corresponding to the data_format attribute.

train_data, test_dataTuple[Union[deepchecks.Dataset, pd.DataFrame],Union[deepchecks.Dataset, pd.DataFrame]

tuple if as_train_test = True. Tuple of two objects represents the dataset split to train and test sets.