Create Eval Run
Experiments
Create Eval Run
Start a new eval run over a batch of calls.
POST
Create Eval Run
Documentation Index
Fetch the complete documentation index at: https://docs.bland.ai/llms.txt
Use this file to discover all available pages before exploring further.
Headers
Your API key for authentication.
Body Parameters
The call IDs to score. Between 1 and 5000 calls.
The eval agents to score with. Maximum 50 agents. Each object:
The scoring modality. One of
text, audio, or full. Defaults to text.Percentage of evaluations that must pass for the run to be marked passing overall, 0-100. Omit to run without a threshold.
UUID of a workbench setup to associate with this run. Must be provided together with
workbench_setup_version_id.UUID of the pinned workbench setup version. The version’s agent roster is used when
attached_agents is omitted. Must be provided together with workbench_setup_id.How this run was initiated. One of
manual, auto, or backfill. Defaults to manual.Key-value metadata to attach to the run. Keys and values must be strings.
You must supply either a non-empty
attached_agents array or a workbench_setup_version_id (whose pinned roster supplies the agents). If you provide workbench_setup_id or workbench_setup_version_id, both fields are required together. A valid billing record must be on file before a run can start.Response
Unique identifier for the newly created eval run.
The organization that owns this run.
Initial status of the run. Will be
PENDING immediately after creation.How the run was triggered.
The scoring modality for this run.
The call IDs submitted for scoring.
The agent attachments submitted with the run. Each object contains
eval_agent_id, eval_agent_version_id, weight, and target_level_keys.The associated workbench setup, if provided.
The pinned workbench setup version, if provided.
Number of calls resolved for the run.
Number of eval agents resolved for the run.
Total call-by-agent evaluations to perform.
Number of evaluations completed so far.
Always
null at creation time.Always
null at creation time.Error code if the run failed, otherwise
null.Human-readable error message if the run failed, otherwise
null.Internal workflow identifier, if applicable.
Key-value metadata attached to the run.
ISO 8601 timestamp when the run was created.
Always
null at creation time.Always
null at creation time.