load_data#
- load_data(data_format: str = 'Dataset', as_train_test: bool = True, modify_timestamps: bool = True, data_size: Optional[int] = 15000) Union[Tuple, Dataset, DataFrame] [source]#
Load and returns the Airbnb NYC 2019 dataset (regression).
- Parameters
- data_formatstr , default: Dataset
Represent the format of the returned value. Can be ‘Dataset’|’Dataframe’ ‘Dataset’ will return the data as a Dataset object ‘Dataframe’ will return the data as a pandas Dataframe object
- as_train_testbool , default: True
If True, the returned data is split into train and test exactly like the toy model was trained. The first return value is the train data and the second is the test data. In order to get this model, call the load_fitted_model() function. Otherwise, returns a single object.
- modify_timestampsbool , default: True
If True, the returned data timestamp column will be for the last 30 days. Otherwise, the data timestamp will be for March 2023.
- data_sizet.Optional[int] , default: 15000
The number of samples to return. If None, returns all the data.
- Returns
- datasetUnion[deepchecks.Dataset, pd.DataFrame]
the data object, corresponding to the data_format attribute.
- train_data, test_dataTuple[Union[deepchecks.Dataset, pd.DataFrame],Union[deepchecks.Dataset, pd.DataFrame]
tuple if as_train_test = True. Tuple of two objects represents the dataset split to train and test sets.