Get Simulation Set

curl --request GET \
  --url https://api.bland.ai/v1/agent-testing/simulation-sets/{id} \
  --header 'authorization: <authorization>'

{
  "id": "d4e5f6a7-89ab-cdef-0123-4567890abcde",
  "status": "PASSED",
  "simulations_per_scenario": 5,
  "total_scenarios": 2,
  "statistics": {
    "per_scenario": {
      "a1b2c3d4-5678-9abc-def0-1234567890ab": {
        "scenario_id": "a1b2c3d4-5678-9abc-def0-1234567890ab",
        "scenario_name": "Happy Path - Booking",
        "total_runs": 5,
        "passed": 5,
        "failed": 0,
        "pass_rate": 1.0,
        "is_flaky": false,
        "scores": {
          "mean": 0.92,
          "median": 0.93,
          "stddev": 0.03,
          "min": 0.87,
          "max": 0.96
        },
        "worst_run_id": "b8c9d0e1-2345-6789-abcd-ef0123456789",
        "common_failure_modes": [],
        "confidence": "high"
      },
      "f6a7b8c9-0abc-def1-2345-67890abcdef0": {
        "scenario_id": "f6a7b8c9-0abc-def1-2345-67890abcdef0",
        "scenario_name": "Angry Caller Test",
        "total_runs": 5,
        "passed": 3,
        "failed": 2,
        "pass_rate": 0.6,
        "is_flaky": true,
        "scores": {
          "mean": 0.68,
          "median": 0.71,
          "stddev": 0.15,
          "min": 0.42,
          "max": 0.85
        },
        "worst_run_id": "c9d0e1f2-3456-789a-bcde-f01234567890",
        "common_failure_modes": [
          "Agent failed to de-escalate within 3 turns",
          "Agent used dismissive language"
        ],
        "confidence": "medium"
      }
    },
    "overall": {
      "avg_pass_rate": 0.8,
      "flaky_count": 1,
      "reliable_count": 1,
      "total_runs": 10
    }
  },
  "created_at": "2026-04-14T10:00:00.000Z",
  "completed_at": "2026-04-14T10:05:32.000Z"
}

GET

agent-testing

simulation-sets

{id}

Get Simulation Set

curl --request GET \
  --url https://api.bland.ai/v1/agent-testing/simulation-sets/{id} \
  --header 'authorization: <authorization>'

{
  "id": "d4e5f6a7-89ab-cdef-0123-4567890abcde",
  "status": "PASSED",
  "simulations_per_scenario": 5,
  "total_scenarios": 2,
  "statistics": {
    "per_scenario": {
      "a1b2c3d4-5678-9abc-def0-1234567890ab": {
        "scenario_id": "a1b2c3d4-5678-9abc-def0-1234567890ab",
        "scenario_name": "Happy Path - Booking",
        "total_runs": 5,
        "passed": 5,
        "failed": 0,
        "pass_rate": 1.0,
        "is_flaky": false,
        "scores": {
          "mean": 0.92,
          "median": 0.93,
          "stddev": 0.03,
          "min": 0.87,
          "max": 0.96
        },
        "worst_run_id": "b8c9d0e1-2345-6789-abcd-ef0123456789",
        "common_failure_modes": [],
        "confidence": "high"
      },
      "f6a7b8c9-0abc-def1-2345-67890abcdef0": {
        "scenario_id": "f6a7b8c9-0abc-def1-2345-67890abcdef0",
        "scenario_name": "Angry Caller Test",
        "total_runs": 5,
        "passed": 3,
        "failed": 2,
        "pass_rate": 0.6,
        "is_flaky": true,
        "scores": {
          "mean": 0.68,
          "median": 0.71,
          "stddev": 0.15,
          "min": 0.42,
          "max": 0.85
        },
        "worst_run_id": "c9d0e1f2-3456-789a-bcde-f01234567890",
        "common_failure_modes": [
          "Agent failed to de-escalate within 3 turns",
          "Agent used dismissive language"
        ],
        "confidence": "medium"
      }
    },
    "overall": {
      "avg_pass_rate": 0.8,
      "flaky_count": 1,
      "reliable_count": 1,
      "total_runs": 10
    }
  },
  "created_at": "2026-04-14T10:00:00.000Z",
  "completed_at": "2026-04-14T10:05:32.000Z"
}

Headers

authorization

string

required

Your API key for authentication.

Path Parameters

string

required

The simulation set ID.

Response

string

Unique identifier for the simulation set.

status

string

Current status of the simulation set. One of PENDING, RUNNING, PASSED, FAILED, or ERROR.

simulations_per_scenario

integer

Number of times each scenario is run.

total_scenarios

integer

Total number of scenarios in this set.

statistics

object

Aggregated statistics for the simulation set.

Show statistics object

per_scenario

object

A map of scenario IDs to per-scenario statistics.

Show per-scenario stats

scenario_id

string

The scenario ID.

scenario_name

string

The scenario name.

total_runs

integer

Total number of simulation runs for this scenario.

passed

integer

Number of runs that passed.

failed

integer

Number of runs that failed.

pass_rate

number

Pass rate as a decimal between 0 and 1.

is_flaky

boolean

Whether the scenario exhibited flaky behavior (some runs passed, some failed).

scores

object

Score distribution statistics including mean, median, stddev, min, and max.

worst_run_id

string

The ID of the worst-performing run.

common_failure_modes

array of strings

Most common failure reasons observed across runs.

confidence

string

Statistical confidence level. One of low, medium, or high.

overall

object

Overall aggregated statistics across all scenarios.

Show overall stats

avg_pass_rate

number

Average pass rate across all scenarios.

flaky_count

integer

Number of scenarios detected as flaky.

reliable_count

integer

Number of scenarios that are reliably passing or failing.

total_runs

integer

Total simulation runs across all scenarios.

created_at

string

ISO 8601 timestamp of when the simulation set was created.

completed_at

string

ISO 8601 timestamp of when the simulation set completed (null if still running).

{
  "id": "d4e5f6a7-89ab-cdef-0123-4567890abcde",
  "status": "PASSED",
  "simulations_per_scenario": 5,
  "total_scenarios": 2,
  "statistics": {
    "per_scenario": {
      "a1b2c3d4-5678-9abc-def0-1234567890ab": {
        "scenario_id": "a1b2c3d4-5678-9abc-def0-1234567890ab",
        "scenario_name": "Happy Path - Booking",
        "total_runs": 5,
        "passed": 5,
        "failed": 0,
        "pass_rate": 1.0,
        "is_flaky": false,
        "scores": {
          "mean": 0.92,
          "median": 0.93,
          "stddev": 0.03,
          "min": 0.87,
          "max": 0.96
        },
        "worst_run_id": "b8c9d0e1-2345-6789-abcd-ef0123456789",
        "common_failure_modes": [],
        "confidence": "high"
      },
      "f6a7b8c9-0abc-def1-2345-67890abcdef0": {
        "scenario_id": "f6a7b8c9-0abc-def1-2345-67890abcdef0",
        "scenario_name": "Angry Caller Test",
        "total_runs": 5,
        "passed": 3,
        "failed": 2,
        "pass_rate": 0.6,
        "is_flaky": true,
        "scores": {
          "mean": 0.68,
          "median": 0.71,
          "stddev": 0.15,
          "min": 0.42,
          "max": 0.85
        },
        "worst_run_id": "c9d0e1f2-3456-789a-bcde-f01234567890",
        "common_failure_modes": [
          "Agent failed to de-escalate within 3 turns",
          "Agent used dismissive language"
        ],
        "confidence": "medium"
      }
    },
    "overall": {
      "avg_pass_rate": 0.8,
      "flaky_count": 1,
      "reliable_count": 1,
      "total_runs": 10
    }
  },
  "created_at": "2026-04-14T10:00:00.000Z",
  "completed_at": "2026-04-14T10:05:32.000Z"
}

Docs for agents: llms.txt

Create Simulation Set Start Tornado Session

⌘I

​Headers

​Path Parameters

​Response

Headers

Path Parameters

Response