Claude-Code-Workflow team-testing
Unified team skill for testing team. Progressive test coverage through Generator-Critic loops, shared memory, and dynamic layer selection. Triggers on "team testing".
git clone https://github.com/catlog22/Claude-Code-Workflow
T=$(mktemp -d) && git clone --depth=1 https://github.com/catlog22/Claude-Code-Workflow "$T" && mkdir -p ~/.claude/skills && cp -r "$T/.codex/skills/team-testing" ~/.claude/skills/catlog22-claude-code-workflow-team-testing-21a2ef && rm -rf "$T"
.codex/skills/team-testing/SKILL.mdTeam Testing
Orchestrate multi-agent test pipeline: strategist -> generator -> executor -> analyst. Progressive layer coverage (L1/L2/L3) with Generator-Critic loops for coverage convergence.
Architecture
Skill(skill="team-testing", args="task description") | SKILL.md (this file) = Router | +--------------+--------------+ | | no --role flag --role <name> | | Coordinator Worker roles/coordinator/role.md roles/<name>/role.md | +-- analyze -> dispatch -> spawn workers -> STOP | +-------+-------+-------+-------+ v v v v [strat] [gen] [exec] [analyst] team-worker agents, each loads roles/<role>/role.md
Role Registry
| Role | Path | Prefix | Inner Loop |
|---|---|---|---|
| coordinator | roles/coordinator/role.md | — | — |
| strategist | roles/strategist/role.md | STRATEGY-* | false |
| generator | roles/generator/role.md | TESTGEN-* | true |
| executor | roles/executor/role.md | TESTRUN-* | true |
| analyst | roles/analyst/role.md | TESTANA-* | false |
Role Router
Parse
$ARGUMENTS:
- Has
-> Read--role <name>
, execute Phase 2-4roles/<name>/role.md - No
->--role
, execute entry routerroles/coordinator/role.md
Delegation Lock
Coordinator is a PURE ORCHESTRATOR. It coordinates, it does NOT do.
Before calling ANY tool, apply this check:
| Tool Call | Verdict | Reason |
|---|---|---|
, , , , | ALLOWED | Orchestration |
| ALLOWED | Agent health check |
| ALLOWED | User interaction |
| ALLOWED | Message bus |
on files | ALLOWED | Session state |
on , , | ALLOWED | Loading own instructions |
on project source code | BLOCKED | Delegate to worker |
on any file outside | BLOCKED | Delegate to worker |
| BLOCKED | Only workers call CLI |
running build/test/lint commands | BLOCKED | Delegate to worker |
If a tool call is BLOCKED: STOP. Create a task, spawn a worker.
No exceptions for "simple" tasks. Even a single-file read-and-report MUST go through spawn_agent.
Shared Constants
- Session prefix:
TST - Session path:
.workflow/.team/TST-<date>-<slug>/ - Team name:
testing - CLI tools:
(read-only),ccw cli --mode analysis
(modifications)ccw cli --mode write - Message bus:
mcp__ccw-tools__team_msg(session_id=<session-id>, ...)
Worker Spawn Template
Coordinator spawns workers using this template:
spawn_agent({ agent_type: "team_worker", task_name: "<task-id>", fork_turns: "none", message: `## Role Assignment role: <role> role_spec: <skill_root>/roles/<role>/role.md session: <session-folder> session_id: <session-id> requirement: <task-description> inner_loop: <true|false> Read role_spec file (<skill_root>/roles/<role>/role.md) to load Phase 2-4 domain instructions. ## Task Context task_id: <task-id> title: <task-title> description: <task-description> pipeline_phase: <pipeline-phase> ## Upstream Context <prev_context>` })
After spawning, use
wait_agent({ timeout_ms: 1800000 }) to collect results. If result.timed_out, send STATUS_CHECK via followup_task (wait 3 min), then FINALIZE with interrupt (wait 3 min), then mark timed_out and close agents. Use close_agent({ target }) each worker.
Model Selection Guide
| Role | model | reasoning_effort | Rationale |
|---|---|---|---|
| Strategist (STRATEGY-*) | (default) | high | Test strategy requires deep code understanding |
| Generator (TESTGEN-*) | (default) | high | Test code generation needs precision |
| Executor (TESTRUN-*) | (default) | medium | Running tests and collecting results, less reasoning |
| Analyst (TESTANA-*) | (default) | high | Coverage analysis and quality assessment |
Override model/reasoning_effort in spawn_agent when cost optimization is needed:
spawn_agent({ agent_type: "team_worker", task_name: "<task-id>", fork_turns: "none", model: "<model-override>", reasoning_effort: "<effort-level>", message: "..." })
User Commands
| Command | Action |
|---|---|
/ | View pipeline status graph |
/ | Advance to next step |
| Revise specific task |
| Inject feedback for revision |
v4 Agent Coordination
Message Semantics
| Intent | API | Example |
|---|---|---|
| Send strategy to running generators | | Queue test strategy findings to TESTGEN-* workers |
| Not used in this skill | | No resident agents -- all workers are one-shot |
| Check running agents | | Verify parallel generator/executor health |
Parallel Test Generation
Comprehensive pipeline spawns multiple generators (per layer) and executors in parallel:
// Spawn parallel generators for L1 and L2 const genNames = ["TESTGEN-001", "TESTGEN-002"] for (const name of genNames) { spawn_agent({ agent_type: "team_worker", task_name: name, ... }) } wait_agent({ timeout_ms: 1800000 }) // 30 min — apply timeout cascade if timed_out
GC Loop Coordination
Generator-Critic loops create dynamic TESTGEN-fix and TESTRUN-fix tasks. The coordinator tracks
gc_rounds[layer] and creates fix tasks dynamically when coverage is below target.
Agent Health Check
Use
list_agents({}) in handleResume and handleComplete:
// Reconcile session state with actual running agents const running = list_agents({}) // Compare with tasks.json active_agents // Reset orphaned tasks (in_progress but agent gone) to pending
Named Agent Targeting
Workers are spawned with
task_name: "<task-id>" enabling direct addressing:
-- queue strategy context to running generatorsend_message({ target: "TESTGEN-001", message: "..." })
-- cleanup by name after wait_agent returnsclose_agent({ target: "TESTRUN-001" })
Completion Action
When pipeline completes, coordinator presents:
functions.request_user_input({ questions: [{ question: "Testing pipeline complete. What would you like to do?", header: "Completion", multiSelect: false, options: [ { label: "Archive & Clean (Recommended)", description: "Archive session, clean up team" }, { label: "Keep Active", description: "Keep session for follow-up work" }, { label: "Deepen Coverage", description: "Add more test layers or increase coverage targets" } ] }] })
Session Directory
.workflow/.team/TST-<date>-<slug>/ ├── .msg/messages.jsonl # Team message bus ├── .msg/meta.json # Session metadata ├── wisdom/ # Cross-task knowledge ├── strategy/ # Strategist output ├── tests/ # Generator output (L1-unit/, L2-integration/, L3-e2e/) ├── results/ # Executor output └── analysis/ # Analyst output
Specs Reference
- specs/pipelines.md — Pipeline definitions and task registry
- specs/team-config.json — Team configuration
Error Handling
| Scenario | Resolution |
|---|---|
| Unknown --role value | Error with available role list |
| Role not found | Error with expected path (roles/<name>/role.md) |
| CLI tool fails | Worker fallback to direct implementation |
| GC loop exceeded | Accept current coverage with warning |
| Fast-advance conflict | Coordinator reconciles on next callback |
| Completion action fails | Default to Keep Active |