Retrieve detailed results for a specific test run including chat history, assertion results, and scores.
PENDING, RUNNING, PASSED, FAILED, ERROR, CANCELLED.role - Either user (the persona) or assistant (the agent).content - The text spoken in that turn.naturalness - Score (0-1) for how natural the agent sounded.conciseness - Score (0-1) for response brevity and clarity.empathy - Score (0-1) for empathetic responses.flow_preservation - Score (0-1) for maintaining conversation flow.back_channeling - Score (0-1) for appropriate use of acknowledgments.topic_transitions - Score (0-1) for smooth topic changes.overall_tone_score - Aggregate tone score (0-1).issues - Array of identified tone issues.examples - Array of specific examples from the conversation.ERROR, this contains a description of what went wrong. Otherwise null.