List Agent Results
curl --request GET \
--url https://api.bland.ai/v1/evals/runs/{run_id}/agent-results \
--header 'authorization: <authorization>'{
"data": {
"object": "list",
"data": [
{
"id": "c2d3e4f5-a6b7-8901-cdef-234567890123",
"org_id": "f6a7b8c9-d0e1-2345-fabc-456789012345",
"eval_run_id": "e5f6a7b8-c9d0-1234-efab-345678901234",
"eval_run_call_result_id": "b1c2d3e4-f5a6-7890-bcde-f12345678901",
"eval_run_agent_snapshot_id": "33333333-aaaa-bbbb-cccc-dddddddddddd",
"eval_agent_id": "11111111-aaaa-bbbb-cccc-dddddddddddd",
"eval_agent_version_id": "22222222-aaaa-bbbb-cccc-dddddddddddd",
"call_id": "a1b2c3d4-0000-0000-0000-000000000001",
"selected_level_key": "pass",
"selected_level_label": "Pass",
"score_normalized_0_100": 88.0,
"is_target_match": true,
"is_insufficient_evidence": false,
"confidence": 0.93,
"reasoning_md": "The agent correctly identified the account issue and offered a resolution within the first two minutes of the call.",
"evidence": [
{
"source": "transcript",
"speaker": "agent",
"start_ms": 45200,
"end_ms": 52100,
"text": "I can see your account has a pending charge from last week. Let me reverse that for you right now."
}
],
"failed": false,
"failure_code": null,
"failure_message": null,
"billable_cost_usd": 0.02,
"audit": {
"prompt_version": "v3.1.0",
"input_tokens": 2840,
"output_tokens": 312,
"latency_ms": 1843
},
"created_at": "2026-05-27T10:00:48.000Z"
}
],
"has_more": false,
"next_cursor": null
},
"errors": null
}
Results
List Agent Results
List every individual judge verdict in an eval run.
GET
/
v1
/
evals
/
runs
/
{run_id}
/
agent-results
List Agent Results
curl --request GET \
--url https://api.bland.ai/v1/evals/runs/{run_id}/agent-results \
--header 'authorization: <authorization>'{
"data": {
"object": "list",
"data": [
{
"id": "c2d3e4f5-a6b7-8901-cdef-234567890123",
"org_id": "f6a7b8c9-d0e1-2345-fabc-456789012345",
"eval_run_id": "e5f6a7b8-c9d0-1234-efab-345678901234",
"eval_run_call_result_id": "b1c2d3e4-f5a6-7890-bcde-f12345678901",
"eval_run_agent_snapshot_id": "33333333-aaaa-bbbb-cccc-dddddddddddd",
"eval_agent_id": "11111111-aaaa-bbbb-cccc-dddddddddddd",
"eval_agent_version_id": "22222222-aaaa-bbbb-cccc-dddddddddddd",
"call_id": "a1b2c3d4-0000-0000-0000-000000000001",
"selected_level_key": "pass",
"selected_level_label": "Pass",
"score_normalized_0_100": 88.0,
"is_target_match": true,
"is_insufficient_evidence": false,
"confidence": 0.93,
"reasoning_md": "The agent correctly identified the account issue and offered a resolution within the first two minutes of the call.",
"evidence": [
{
"source": "transcript",
"speaker": "agent",
"start_ms": 45200,
"end_ms": 52100,
"text": "I can see your account has a pending charge from last week. Let me reverse that for you right now."
}
],
"failed": false,
"failure_code": null,
"failure_message": null,
"billable_cost_usd": 0.02,
"audit": {
"prompt_version": "v3.1.0",
"input_tokens": 2840,
"output_tokens": 312,
"latency_ms": 1843
},
"created_at": "2026-05-27T10:00:48.000Z"
}
],
"has_more": false,
"next_cursor": null
},
"errors": null
}
Documentation Index
Fetch the complete documentation index at: https://docs.bland.ai/llms.txt
Use this file to discover all available pages before exploring further.
Headers
Your API key for authentication.
Path Parameters
The ID of the eval run.
Query Parameters
Number of results to return. Between 1 and 100.
Cursor: return results after this object ID.
Cursor: return results before this object ID. Cannot be combined with
starting_after.Response
Always
"list".The list of agent result objects, one per call-by-agent evaluation.
Show EvalRunAgentResult fields
Show EvalRunAgentResult fields
Unique identifier for this agent result.
The organization that owns this result.
The eval run this result belongs to.
The call result this agent verdict is part of.
The frozen agent snapshot used to produce this verdict.
The eval agent that produced this verdict.
The specific version of the eval agent used.
The call that was scored.
The level key the judge selected, or
null if scoring failed or evidence was insufficient.Human-readable label for the selected level.
Normalized score for this verdict, 0-100.
Whether the selected level is one of the configured target levels.
Whether the judge determined there was not enough evidence to produce a verdict.
Judge confidence in the verdict, 0-1.
Markdown-formatted reasoning from the judge.
Transcript or audio quotes the judge cited as evidence.
Show Evidence quote fields
Show Evidence quote fields
Where the evidence came from. One of
transcript or audio.Who spoke this quote. One of
agent, customer, unknown, or null.Start time of the quote in milliseconds, if available.
End time of the quote in milliseconds, if available.
The quoted text.
Whether this evaluation failed to complete.
Machine-readable failure code, if
failed is true.Human-readable failure description, if
failed is true.Billable cost for this individual verdict in USD.
ISO 8601 timestamp when this result was created.
Whether more results exist beyond this page.
Cursor to use in
starting_after to retrieve the next page, or null if there are no more results.{
"data": {
"object": "list",
"data": [
{
"id": "c2d3e4f5-a6b7-8901-cdef-234567890123",
"org_id": "f6a7b8c9-d0e1-2345-fabc-456789012345",
"eval_run_id": "e5f6a7b8-c9d0-1234-efab-345678901234",
"eval_run_call_result_id": "b1c2d3e4-f5a6-7890-bcde-f12345678901",
"eval_run_agent_snapshot_id": "33333333-aaaa-bbbb-cccc-dddddddddddd",
"eval_agent_id": "11111111-aaaa-bbbb-cccc-dddddddddddd",
"eval_agent_version_id": "22222222-aaaa-bbbb-cccc-dddddddddddd",
"call_id": "a1b2c3d4-0000-0000-0000-000000000001",
"selected_level_key": "pass",
"selected_level_label": "Pass",
"score_normalized_0_100": 88.0,
"is_target_match": true,
"is_insufficient_evidence": false,
"confidence": 0.93,
"reasoning_md": "The agent correctly identified the account issue and offered a resolution within the first two minutes of the call.",
"evidence": [
{
"source": "transcript",
"speaker": "agent",
"start_ms": 45200,
"end_ms": 52100,
"text": "I can see your account has a pending charge from last week. Let me reverse that for you right now."
}
],
"failed": false,
"failure_code": null,
"failure_message": null,
"billable_cost_usd": 0.02,
"audit": {
"prompt_version": "v3.1.0",
"input_tokens": 2840,
"output_tokens": 312,
"latency_ms": 1843
},
"created_at": "2026-05-27T10:00:48.000Z"
}
],
"has_more": false,
"next_cursor": null
},
"errors": null
}
Was this page helpful?
⌘I