DeepchecksLLMClient.execute_app#

async DeepchecksLLMClient.execute_app(app_name: str, dataset_name: str, deployment_name: str, version_name: str | None = None, env_type: str | None = None, additional_headers: Dict[str, str] | None = None, show_progress: bool = True, verify_ssl: bool = True, progress_callback: Callable[[int, int], None] | None = None, sample_callback: Callable | None = None) DatasetRunResult#

Run a dataset on a deployment with parallel execution.

This method fetches the dataset samples and deployment configuration, then runs all samples against the deployment endpoint using the configured parallelism and retry settings.

Note: This is an async method and must be called with await: >>> result = await client.execute_app(“my-app”, “test-dataset”, “prod-deployment”)

Parameters:
app_namestr

Application name

dataset_namestr

Dataset name to run

deployment_namestr

Deployment name to run against

version_namestr, optional

Version name to include in dc_fields

env_typestr, optional

Environment type to include in dc_fields

additional_headersdict, optional

Additional headers to include in requests (beyond deployment headers)

show_progressbool, default=True

Whether to show a live progress widget (if in notebook with ipywidgets installed)

verify_sslbool, default=True

Whether to verify SSL certificates

progress_callbackcallable, optional

Callback function that receives (completed_count, total_count) after each sample completes

sample_callbackcallable, optional

Callback function that receives (sample_id, status, keyword args) for each sample status update. Keyword args may include: input, output, error, duration, retries

Returns:
DatasetRunResult

Results of running the dataset, including success/failure counts, individual sample results, and timing information.

Examples

>>> result = await client.execute_app("my-app", "test-dataset", "prod-deployment")
>>> print(f"Success rate: {result.success_rate:.1f}%")
>>> print(f"Failed samples: {result.failed_samples}")
>>> # Re-run only failed samples
>>> if result.failed_samples > 0:
...     retry_result = await result.rerun_failed()
...     print(f"Retry success rate: {retry_result.success_rate:.1f}%")