Accessing the Runs List
Click Past Runs in the left sidebar to open the Runs List. You can also reach it from any agent card by clicking Show past runs in the card’s action menu.What the List Shows
Each row represents one Run and displays:- Agent — the agent that started the run; hover to see the full name if it’s truncated
- Evaluation — for completed runs, an evaluation badge showing how many criteria passed or failed; click the badge to open the full evaluation breakdown
- Run — the run number linked directly to the run detail view; if you sent follow-up messages during the run, a small indicator next to the run number shows how many follow-ups were submitted
- Trigger — how the run was started (manual, scheduled, event-triggered, API, etc.)
- Created — when the run started; dates in the current year omit the year to reduce visual noise
- Updated — when the run last changed, shown only if it differs from the created time
Filtering Runs
Use the filter bar at the top to narrow the list:- Needs attention — quick filter for runs with Failed or Needs Input status
- Issues — quick filter for runs that have evaluation issues flagged
- Agent — filter by a specific agent; agents are grouped by folder if your team uses folders
- Status — filter by Running, Needs Input, Completed, Failed, or Stopped
- Created by — filter by team member (visible to Admins and Managers)
- Trigger — filter by how runs were started (manual, scheduled, API, etc.)
Understanding Evaluations
Duvo automatically scores completed and failed runs against a set of criteria derived from the agent’s SOP. Evaluation results surface in two places:- In the Runs List: a badge on each completed run row showing the pass/fail breakdown at a glance
- In the Agent Builder: the Evaluations tab shows aggregated scores for the last 24 hours or last 7 days, plus any recurring issue alerts
Evaluation Severity
Each failed criterion is assigned a severity level — Critical, Medium, or Low — based on how significantly it affects the run’s outcome. The badge in the Runs List reflects the highest severity found:| Badge color | Meaning |
|---|---|
| Red | At least one Critical issue |
| Yellow | Highest issue is Medium (no Critical issues) |
| Grey | All criteria passed, or only Low-severity issues |
| Dashed grey | Run was stopped before evaluation completed |