Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
17 changes: 12 additions & 5 deletions docs/inspector/test-cases.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -83,14 +83,16 @@ You can run tests in two ways:

## Analyzing Results

### Results & Runs View
### Suite Overview

Click Results & Runs in the sidebar to see overall analytics:

- **Accuracy Donut** - Overall accuracy percentage
- **Accuracy Chart** - Shows pass rates across runs (line connects multiple runs)
- **Performance by Model** - Bar chart comparing models

The suite overview provides three different views to analyze your test results from different angles.

<iframe
className="w-full aspect-video rounded-xl"
src="https://www.youtube.com/embed/iglDMsuTwf4"
Expand All @@ -99,14 +101,19 @@ Click Results & Runs in the sidebar to see overall analytics:
allowFullScreen
></iframe>

Use the dropdown to switch between "Runs" and "Test Cases" views:

**Runs view:**
- **Run History** - Shows all your runs with their metrics (Run ID, Start time, Duration, Passed, Failed, Accuracy, Tokens)
Use the tabs to switch between "Test cases", "Executions", and "Runs" views:

**Test Cases view:**
- **Test Cases Table** - List of all tests with Test Case Name, Iterations, Avg Accuracy, Avg Duration

**Executions view:**
- **Execution Timeline** - Flat list of every test execution across all test cases, sorted by most recent
- Shows test case name, result (passed/failed/pending/cancelled), and timestamp
- Click any execution to view its details in the compare view

**Runs view:**
- **Run History** - Shows all your runs with their metrics (Run ID, Start time, Duration, Passed, Failed, Accuracy, Tokens)

### Run Detail View

When you click on a run:
Expand Down
Loading