List Eval Runs
curl --request GET \
--url https://api.bland.ai/v1/evals/runs \
--header 'authorization: <authorization>'{
"data": {
"object": "list",
"data": [
{
"id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
"status": "COMPLETE",
"triggered_by": "manual",
"run_mode": "text",
"resolved_call_count": 50,
"resolved_agent_count": 3,
"resolved_atom_count": 150,
"completed_atom_count": 150,
"summary": {
"overall_score_mean": 82.4,
"text_score_mean": 82.4,
"audio_score_mean": null,
"successful_call_count": 48,
"failed_call_count": 2,
"insufficient_evidence_count": 1,
"target_match_count": 39,
"pass_threshold_pct": 75,
"overall_pass": true
},
"billable_cost_usd": 1.24,
"workbench_setup_id": "b2c3d4e5-f6a7-8901-bcde-f12345678901",
"workbench_setup_version_id": "c3d4e5f6-a7b8-9012-cdef-123456789012",
"metadata": {},
"created_at": "2026-05-20T14:00:00.000Z",
"started_at": "2026-05-20T14:00:05.000Z",
"completed_at": "2026-05-20T14:03:42.000Z"
},
{
"id": "d4e5f6a7-b8c9-0123-defa-234567890123",
"status": "RUNNING",
"triggered_by": "auto",
"run_mode": "full",
"resolved_call_count": 100,
"resolved_agent_count": 2,
"resolved_atom_count": 200,
"completed_atom_count": 87,
"summary": null,
"billable_cost_usd": null,
"workbench_setup_id": null,
"workbench_setup_version_id": null,
"metadata": { "pipeline": "nightly" },
"created_at": "2026-05-27T02:00:00.000Z",
"started_at": "2026-05-27T02:00:08.000Z",
"completed_at": null
}
],
"has_more": false,
"next_cursor": null
},
"errors": null
}
Experiments
List Eval Runs
List eval runs in your organization, with optional filters.
GET
/
v1
/
evals
/
runs
List Eval Runs
curl --request GET \
--url https://api.bland.ai/v1/evals/runs \
--header 'authorization: <authorization>'{
"data": {
"object": "list",
"data": [
{
"id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
"status": "COMPLETE",
"triggered_by": "manual",
"run_mode": "text",
"resolved_call_count": 50,
"resolved_agent_count": 3,
"resolved_atom_count": 150,
"completed_atom_count": 150,
"summary": {
"overall_score_mean": 82.4,
"text_score_mean": 82.4,
"audio_score_mean": null,
"successful_call_count": 48,
"failed_call_count": 2,
"insufficient_evidence_count": 1,
"target_match_count": 39,
"pass_threshold_pct": 75,
"overall_pass": true
},
"billable_cost_usd": 1.24,
"workbench_setup_id": "b2c3d4e5-f6a7-8901-bcde-f12345678901",
"workbench_setup_version_id": "c3d4e5f6-a7b8-9012-cdef-123456789012",
"metadata": {},
"created_at": "2026-05-20T14:00:00.000Z",
"started_at": "2026-05-20T14:00:05.000Z",
"completed_at": "2026-05-20T14:03:42.000Z"
},
{
"id": "d4e5f6a7-b8c9-0123-defa-234567890123",
"status": "RUNNING",
"triggered_by": "auto",
"run_mode": "full",
"resolved_call_count": 100,
"resolved_agent_count": 2,
"resolved_atom_count": 200,
"completed_atom_count": 87,
"summary": null,
"billable_cost_usd": null,
"workbench_setup_id": null,
"workbench_setup_version_id": null,
"metadata": { "pipeline": "nightly" },
"created_at": "2026-05-27T02:00:00.000Z",
"started_at": "2026-05-27T02:00:08.000Z",
"completed_at": null
}
],
"has_more": false,
"next_cursor": null
},
"errors": null
}
Headers
Your API key for authentication.
Query Parameters
Filter to runs that included this eval agent.
Filter to runs started from this workbench setup.
Filter by run status. Possible values:
PENDING, QUEUED, RUNNING, COMPLETE, PARTIAL, FAILED, CANCELLED.Filter by how the run was triggered. Possible values:
manual, auto, backfill.Number of results to return. Between 1 and 100.
Cursor: return results after this object ID.
Cursor: return results before this object ID. Cannot be combined with
starting_after.Response
Always
"list".The list of eval run objects.
Show EvalRun list item fields
Show EvalRun list item fields
Unique identifier for the eval run.
Current status of the run. One of
PENDING, QUEUED, RUNNING, COMPLETE, PARTIAL, FAILED, CANCELLED.How the run was triggered. One of
manual, auto, backfill.The scoring modality used. One of
text, audio, full.Number of calls resolved for the run.
Number of eval agents resolved for the run.
Total number of call-by-agent evaluations in the run.
Number of evaluations that have completed.
Aggregate scoring summary for the run, or
null if not yet available.Show Summary fields
Show Summary fields
Mean overall score across all completed evaluations, normalized 0-100.
Mean text-modality score, normalized 0-100.
Mean audio-modality score, normalized 0-100.
Number of calls that were graded successfully.
Number of calls that failed to grade.
Number of evaluations where evidence was insufficient to produce a verdict.
Number of evaluations where the selected level matched a target level.
The pass threshold percentage configured for the run, 0-100.
Whether the run met its pass threshold, or
null if no threshold was set.Billable cost for this run in USD.
The workbench setup the run was started from, if any.
The pinned workbench setup version used, if any.
Key-value metadata attached to the run.
ISO 8601 timestamp when the run was created.
ISO 8601 timestamp when the run started, or
null if not yet started.ISO 8601 timestamp when the run completed, or
null if not yet complete.Whether more results exist beyond this page.
Cursor to use in
starting_after to retrieve the next page, or null if there are no more results.{
"data": {
"object": "list",
"data": [
{
"id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
"status": "COMPLETE",
"triggered_by": "manual",
"run_mode": "text",
"resolved_call_count": 50,
"resolved_agent_count": 3,
"resolved_atom_count": 150,
"completed_atom_count": 150,
"summary": {
"overall_score_mean": 82.4,
"text_score_mean": 82.4,
"audio_score_mean": null,
"successful_call_count": 48,
"failed_call_count": 2,
"insufficient_evidence_count": 1,
"target_match_count": 39,
"pass_threshold_pct": 75,
"overall_pass": true
},
"billable_cost_usd": 1.24,
"workbench_setup_id": "b2c3d4e5-f6a7-8901-bcde-f12345678901",
"workbench_setup_version_id": "c3d4e5f6-a7b8-9012-cdef-123456789012",
"metadata": {},
"created_at": "2026-05-20T14:00:00.000Z",
"started_at": "2026-05-20T14:00:05.000Z",
"completed_at": "2026-05-20T14:03:42.000Z"
},
{
"id": "d4e5f6a7-b8c9-0123-defa-234567890123",
"status": "RUNNING",
"triggered_by": "auto",
"run_mode": "full",
"resolved_call_count": 100,
"resolved_agent_count": 2,
"resolved_atom_count": 200,
"completed_atom_count": 87,
"summary": null,
"billable_cost_usd": null,
"workbench_setup_id": null,
"workbench_setup_version_id": null,
"metadata": { "pipeline": "nightly" },
"created_at": "2026-05-27T02:00:00.000Z",
"started_at": "2026-05-27T02:00:08.000Z",
"completed_at": null
}
],
"has_more": false,
"next_cursor": null
},
"errors": null
}
Was this page helpful?
⌘I