Agent Specialization¶
Understanding why Agent Flow uses multiple specialized agents instead of a single general-purpose agent, and how model selection optimizes for both quality and cost.
The Case for Specialization¶
A single general-purpose agent attempting to handle exploration, planning, implementation, review, and verification faces several challenges:
- Context overload: Trying to hold all skills in context dilutes focus
- Tool conflicts: Having all tools available creates temptation to shortcut
- Role confusion: Unclear when to plan vs. implement vs. verify
- No separation of concerns: Self-review is inherently less rigorous
Agent Flow addresses these by creating focused specialists, each with: - A clear role and responsibility - A restricted set of appropriate tools - Specific behavioral guidelines - An optimal model for their task type
Agent Roles¶
Riko - The Explorer¶
Purpose: Fast codebase exploration and information gathering
Why specialized? - Exploration requires breadth-first thinking - Read-only access prevents accidental modifications - Web access enables external documentation lookup - Focus on finding, not implementing
Model: Opus (deep reasoning for unfamiliar codebases)
Tools: Read, Grep, Glob, Bash (AST analysis only), WebSearch, WebFetch
Key behaviors: - Three-tier search: local repository, web, user clarification - Evidence-based findings with file paths and line numbers - Never runs code or modifies files
Senku - The Planner¶
Purpose: Strategic planning and implementation strategy
Why specialized? - Planning requires foresight and risk assessment - Separation from implementation prevents premature coding - TodoWrite creates structured, trackable plans - Focus on strategy, not execution
Model: Opus (strategic planning needs deep reasoning)
Tools: Read, Grep, Glob, TodoWrite
Key behaviors: - Explores codebase to understand existing patterns - Creates step-by-step implementation plans - Identifies risks and edge cases - Never writes code directly
Loid - The Executor¶
Purpose: Code implementation following plans
Why specialized? - Implementation benefits from focused execution - Having Write/Edit tools concentrated in one agent - Quick sanity tests catch immediate issues - Clear handoff points to verification
Model: Sonnet (fast execution with sufficient capability)
Tools: Read, Write, Edit, Bash, Grep, Glob
Key behaviors: - Follows plans precisely - Makes changes incrementally - Runs tests after each change - Reports blockers immediately
Lawliet - The Reviewer¶
Purpose: Code quality assurance through static analysis
Why specialized? - Review requires independent perspective - Separation from implementation enables objectivity - Static analysis focus (no test execution) - Clear verdict for proceed/iterate decision
Model: Sonnet (fast iteration for review cycles)
Tools: Read, Grep, Glob, Bash (static analysis only)
Key behaviors: - Reviews all modified files - Runs type checking and linting - Checks security concerns - Provides actionable feedback
Alphonse - The Verifier¶
Purpose: Comprehensive verification through test execution
Why specialized? - Verification is the final quality gate - Separation from implementation prevents bias - Focus on running commands and reporting results - Clear pass/fail determination
Model: Sonnet (quick command execution)
Tools: Bash, Read, Grep
Key behaviors: - Runs full test suite - Verifies types, lint, build - Reports exact command output - Never modifies code
Model Selection Strategy¶
Agent Flow uses two model tiers optimized for different task types:
flowchart TB
subgraph Opus["Opus - Strategic Tasks"]
direction LR
O1["Deep Reasoning"]
O2["Complex Analysis"]
O3["Unfamiliar Code"]
O4["Strategic Planning"]
end
subgraph Sonnet["Sonnet - Execution Tasks"]
direction LR
S1["Well-Defined Tasks"]
S2["Fast Iteration"]
S3["Clear Criteria"]
S4["Command Execution"]
end
Opus --> R[Riko] & SK[Senku]
Sonnet --> L[Loid] & LW[Lawliet] & A[Alphonse]
When Opus Excels¶
Deep reasoning required: - Exploring unfamiliar codebases - Understanding complex architectures - Making strategic decisions - Analyzing ambiguous requirements
Cost justified by: - Reduced iterations from better understanding - Fewer mistakes in planning - More thorough exploration
When Sonnet Excels¶
Fast execution required: - Implementing well-defined plans - Running verification commands - Iterating on review feedback - Executing repetitive operations
Cost optimized by: - Speed for iteration cycles - Sufficient capability for clear tasks - Lower cost per operation
Model Assignment Rationale¶
| Agent | Model | Rationale |
|---|---|---|
| Riko | Opus | Exploration of unfamiliar code needs thorough analysis |
| Senku | Opus | Strategic planning requires deep reasoning for task decomposition |
| Loid | Sonnet | Implementation benefits from speed with sufficient capability |
| Lawliet | Sonnet | Review cycles need fast iteration for feedback loops |
| Alphonse | Sonnet | Verification is command-focused with clear pass/fail criteria |
Tool Access Matrix¶
Each agent has a restricted toolset matching their responsibilities:
Agent Read Grep Glob Write Edit Bash Web Todo
----------- ---- ---- ---- ----- ---- ---- --- ----
Riko Yes Yes Yes - - Yes* Yes -
Senku Yes Yes Yes - - - - Yes
Loid Yes Yes Yes Yes Yes Yes - -
Lawliet Yes Yes Yes - - Yes** - -
Alphonse Yes Yes - - - Yes - -
* Riko's Bash is restricted to AST analysis tools only (ast-grep, tree-sitter, language parsers)
** Lawliet's Bash is restricted to static analysis tools
Key Restrictions¶
- Only Loid can modify files (Write, Edit)
- Prevents accidental changes during exploration
- Ensures implementation follows plans
-
Creates clear audit trail
-
Only Riko can access the web (WebSearch, WebFetch)
- Focuses web usage on research
- Prevents implementation shortcuts
-
Maintains codebase focus
-
Only Senku can manage tasks (TodoWrite)
- Centralizes planning authority
- Creates structured implementation paths
- Prevents ad-hoc task creation
Specialization Benefits¶
Clear Responsibility¶
Each agent knows exactly what they should and should not do:
Riko: "I find things. I don't change things."
Senku: "I plan things. I don't implement things."
Loid: "I implement things. I don't plan things."
Lawliet: "I review things. I don't fix things."
Alphonse: "I verify things. I don't modify things."
Separation of Concerns¶
The agent pipeline creates natural checkpoints:
Exploration -> Planning -> Implementation -> Review -> Verification
| | | | |
Context Strategy Code Quality Evidence
gathering design changes assurance confirmation
Independent Verification¶
Because verification agents didn't write the code: - No bias toward "my code works" - Fresh perspective on quality - Objective pass/fail determination
Extending with New Agents¶
When adding new agents, consider:
- What is their specific responsibility?
- Single, focused purpose
-
Clear boundaries with existing agents
-
What tools do they need?
- Minimum necessary for their role
-
Avoid overlapping capabilities
-
Which model tier?
- Strategic/planning tasks -> Opus
-
Execution/verification tasks -> Sonnet
-
How do they fit the workflow?
- Which phase do they participate in?
- What handoffs do they receive/provide?
See Adding Agents Guide for implementation details.
Related Concepts¶
- The "Subagents LIE" Principle - Why verification is separate
- Evidence-Based Verification - What constitutes proof
- Architecture Overview - System design
- Agents Reference - Complete specifications