When to Keep It One Agent
A single agent is the right choice when the process fits these conditions:- One coherent AOP. All steps belong to one logical flow that a single person would carry out end-to-end.
- One primary operator persona. The human approval steps all go to the same role — a single agent handles its own exceptions.
- One integration cluster. The required connections belong to the same system or domain (for example, all Gmail + Google Sheets, or all Salesforce + Slack).
- Short end-to-end latency. The work completes in under five minutes with no waiting stages that could block downstream work.
When to Split — and Which Seam to Use
Three conditions consistently warrant splitting into multiple agents:- Different personas handle different stages. If the first stage requires a procurement manager to approve and the second stage requires a finance analyst to validate, keep the stages in separate agents so each persona manages their own exceptions.
- Different SLAs. High-priority intake should not be slowed down waiting for a batch of low-priority items to clear. Splitting lets each agent run on its own schedule.
- Reusable subroutines. If five agents all need to “look up a customer record and validate their account status,” extract that step into one specialist agent that the others can hand to.
| Hand-off shape | Use this |
|---|---|
| One agent produces items; another processes them asynchronously | Queue (producer/consumer) |
| A running agent needs to escalate or route to a specialist right now | Agent Handover |
| Multiple agents share a reusable step | Agent Handover from each caller |
Three Canonical Patterns
Pattern 1: Pipeline (A → B → C)
Each agent adds context and passes the result to the next stage. Use the Queue producer/consumer seam. When it fits:- Intake, enrichment, and action are conceptually distinct stages
- Each stage has its own AOP author and approval chain
- Stages run at different speeds (intake is fast; enrichment is slow)
- The stages are so tightly coupled that they share state in ways the queue data model can’t represent cleanly
- Total end-to-end latency matters more than separation of concerns
- Stage A — the producer — runs on a trigger (email, webhook, schedule). Its AOP extracts the raw data and adds a case to the queue with the structured fields Stage B needs.
- Stage B — the first consumer — picks up Pending cases, enriches them, and adds enriched cases to a second queue for Stage C.
- Stage C — the final consumer — takes the enriched data and performs the write action (update the ERP, send the email, post the Slack message).
Pattern 2: Fan-Out (A → N parallel B’s)
One agent identifies a batch of items and hands each one to a consumer that processes them in parallel. When it fits:- A large batch arrives on a schedule (end-of-day, end-of-week) and each item can be processed independently
- The items share the same AOP but contain different data
- Throughput matters — sequential processing would take too long
- Items within a batch depend on each other (process item 1 before item 2)
- The write destination has strict rate limits and parallel consumers would breach them
- Producer agent — runs on a schedule or trigger. Queries the source system (database, spreadsheet, API), identifies all items to process, and adds one case per item to the queue.
- Consumer agent — has the case trigger enabled with concurrency set to the appropriate parallel limit. Each case runs its own Run independently.
Pattern 3: Escalation (A handles routine, hands to B for exceptions)
One agent covers the standard path. When a case falls outside the standard, it hands off to a specialist agent rather than failing or requesting a generic HITL. When it fits:- A large volume of cases follow a predictable pattern but a small percentage require specialist judgment
- The specialist agent has access to different connections or a different AOP
- You want clean separation between routine volume and exception handling
- The majority of cases are exceptions — this pattern breaks down if the “routine” path is not actually routine
- The specialist agent and the routine agent are maintained by the same person — in that case, a single agent with HITL is simpler
- Triage agent — processes all incoming cases. For routine cases, completes the action directly. For exceptions, requests a handover to the specialist agent.
- Specialist agent — configured as an allowed handover target. Its AOP handles the narrow class of edge cases the triage agent escalates.
Lineage and Traceability
When a source event travels through two or three agents, you need to be able to follow it from end to end. Two practices make this reliable:Passing a correlation ID
Include a unique identifier for the originating event in every case you add to the queue. Use the same field name across all stages.Reading the case timeline
When a case passes through multiple agents via Handover, the case detail view shows the full timeline: which agent handled each stage, when it started, and what it did. Open a case in your queue and scroll to the timeline to see the complete processing history. For runs in the Runs list, the run output for each stage includes the case ID so you can link stages together.Anti-Patterns
| Anti-pattern | Why it causes problems | What to do instead |
|---|---|---|
| One mega-agent that handles intake, enrichment, action, exception routing, and reporting | Nobody can safely edit it; one AOP change affects every path; test coverage is impossible | Split along the seams above; start with the natural persona or SLA boundaries |
| Two agents that should be one | Adds latency, creates brittle queue serialisation for steps that share tight state, costs more to operate | Merge if they have the same AOP author, same operator, and complete in under five minutes |
| Circular handovers — A hands to B, B hands back to A | Creates infinite loops unless both AOPs have explicit termination conditions | Add a handover_count field to case data; instruct each agent to stop if the count exceeds a threshold |
| Producer faster than consumer | Cases pile up in Pending; queue grows unbounded | Add concurrency to the consumer, or add a cap in the producer’s AOP (e.g., “add at most 100 cases per run”) |
| Implicit failure — an agent ends without completing, postponing, or handing over the case | The case silently lands in Failed; no one knows what happened | Every agent that consumes cases must explicitly complete, postpone, fail with a reason, or hand over |
Worked Example: Complaint Triage
This example shows how to take a Clarity-generated process and split it across three agents.The Clarity output
A Clarity session captured the following steps for handling inbound customer complaints:- Read the incoming email
- Classify: Billing, Technical, or Escalation
- For Billing: look up account, issue credit if under $200, else forward to billing team
- For Technical: create a support ticket, assign to on-call engineer
- For Escalation: forward to account manager with a summary
The decomposition decision
| Step | Agent | Reason |
|---|---|---|
| Steps 1–2 (intake + classify) | Triage | Runs continuously on an email trigger; needs access to Gmail only |
| Step 3 (billing resolution) | Billing Specialist | Needs CRM access and billing system access; different operator persona; low volume |
| Step 4 (technical ticket) | Triage | Simple action; same connections; no specialist judgment needed |
| Step 5 (escalation) | Account Manager Briefing | Different SLA (same-day); needs CRM + Slack; different recipient |
The wiring
- Triage consumes from the “Customer Complaints” queue (case trigger enabled).
- Triage uses Agent Handover to route to @Billing Specialist or @Account Manager Briefing when the case falls outside its path.
- Billing Specialist and Account Manager Briefing are each configured with “Queue (Consumer)” so they appear as valid handover targets.
Lineage
Each case in the “Customer Complaints” queue carriesdata.source_email_id from the originating Gmail message. When a case passes through Handover to the Billing Specialist, the case timeline shows both stages. The Billing Specialist’s AOP includes the original email ID in any CRM record it creates, so the audit trail runs from inbox to CRM record.
Related
- Queue — Producer/consumer setup, case triggers, and concurrency
- Agent Handover — Configuring handover targets in the AOP
- Designing Human-in-the-Loop Workflows — Where to place approval gates within each agent in a multi-stage process
- Guardrails for High-Risk Automations — Risk classification applies per agent, not per pipeline
- Retries, Failures, and Skipped Steps — How individual run failures propagate through multi-assignment flows
- Playbooks — Complaint Triage — A full end-to-end example using the escalation pattern