Agentic-qe qcsd-production-swarm

Use when assessing post-release production health with DORA metrics, root cause analysis, defect prediction, or cross-phase feedback loops in the QCSD Production phase.

install
source · Clone the upstream repo
git clone https://github.com/proffesor-for-testing/agentic-qe
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/proffesor-for-testing/agentic-qe "$T" && mkdir -p ~/.claude/skills && cp -r "$T/assets/skills/qcsd-production-swarm" ~/.claude/skills/proffesor-for-testing-agentic-qe-qcsd-production-swarm-9186c0 && rm -rf "$T"
manifest: assets/skills/qcsd-production-swarm/SKILL.md
source content

QCSD Production Swarm v1.0

Post-release production health assessment and QCSD feedback loop closure.


Overview

The Production Swarm assesses release health in the live production environment using DORA metrics, incident RCA, defect prediction, and cross-phase feedback loops. It renders a HEALTHY / DEGRADED / CRITICAL decision and is the only QCSD phase with dual responsibility: assessing current production health AND closing the feedback loop back to Ideation and Refinement phases.

QCSD Phase Positioning

PhaseSwarmDecisionWhen
Ideationqcsd-ideation-swarmGO / CONDITIONAL / NO-GOPI/Sprint Planning
Refinementqcsd-refinement-swarmREADY / CONDITIONAL / NOT-READYSprint Refinement
Developmentqcsd-development-swarmSHIP / CONDITIONAL / HOLDDuring Sprint
Verificationqcsd-cicd-swarmRELEASE / REMEDIATE / BLOCKPre-Release / CI-CD
Productionqcsd-production-swarmHEALTHY / DEGRADED / CRITICALPost-Release

Parameters

  • TELEMETRY_DATA
    : Path to production telemetry, incident reports, and DORA metrics (required)
  • RELEASE_ID
    : Release identifier for tracking (optional)
  • OUTPUT_FOLDER
    : Where to save reports (default:
    ${PROJECT_ROOT}/Agentic QCSD/production/
    )
  • SLA_DEFINITIONS
    : Path to SLA/SLO target definitions (optional)

ENFORCEMENT RULES - READ FIRST

RuleEnforcement
E1You MUST spawn ALL THREE core agents in Step 2. No exceptions.
E2You MUST put all parallel Task calls in a SINGLE message.
E3You MUST STOP and WAIT after each batch. No proceeding early.
E4You MUST spawn conditional agents if flags are TRUE. No skipping.
E5You MUST apply HEALTHY/DEGRADED/CRITICAL logic exactly as specified in Step 5.
E6You MUST generate the full report structure. No abbreviated versions.
E7Each agent MUST read its reference files before analysis.
E8You MUST run BOTH feedback agents in Step 8 SEQUENTIALLY. Always. Both agents.
E9You MUST execute Step 7 learning persistence. No skipping.

PROHIBITED BEHAVIORS:

  • Summarizing instead of spawning agents
  • Skipping agents "for brevity"
  • Proceeding before background tasks complete
  • Providing your own analysis instead of spawning specialists
  • Omitting report sections or using placeholder text

Step Execution Protocol

This skill uses a micro-file step architecture. Each step is a self-contained file loaded one at a time to avoid "lost in the middle" context degradation.

Execute steps sequentially by reading each step file with the Read tool.

Steps

  1. Flag Detection --
    steps/01-flag-detection.md
    -- Retrieve CI/CD signals, detect telemetry source, evaluate all 7 flags
  2. Core Agents --
    steps/02-core-agents.md
    -- Spawn qe-metrics-optimizer, qe-defect-predictor, qe-root-cause-analyzer in parallel
  3. Batch 1 Results --
    steps/03-batch1-results.md
    -- Wait for core agents, extract all metrics
  4. Conditional Agents --
    steps/04-conditional-agents.md
    -- Spawn flagged conditional agents in parallel
  5. Decision Synthesis --
    steps/05-decision-synthesis.md
    -- Apply HEALTHY/DEGRADED/CRITICAL logic
  6. Report Generation --
    steps/06-report-generation.md
    -- Generate executive summary and full report
  7. Learning Persistence --
    steps/07-learning-persistence.md
    -- Store findings to memory, save persistence record
  8. Feedback Loop --
    steps/08-feedback-loop.md
    -- Run learning coordinator then transfer specialist (sequential)
  9. Final Output --
    steps/09-final-output.md
    -- Display completion summary with all scores

Execution Instructions

  1. Use the Read tool to load the current step file (e.g.,
    Read({ file_path: ".claude/skills/qcsd-production-swarm/steps/01-flag-detection.md" })
    )
  2. Execute the step's instructions completely
  3. Verify all success criteria are met before proceeding
  4. Pass the step's output as context to the next step
  5. If a step fails, halt and report the failure point -- do not skip ahead

Resume Support

To resume from a specific step: specify

--from-step N
and the orchestrator will skip to step N. Ensure you have the required prerequisite data from prior steps.


Agent Inventory

AgentTypeDomainBatch
qe-metrics-optimizerCore (always)learning-optimization1
qe-defect-predictorCore (always)defect-intelligence1
qe-root-cause-analyzerCore (always)defect-intelligence1
qe-chaos-engineerConditional (HAS_INFRASTRUCTURE_CHANGE)chaos-resilience2
qe-performance-testerConditional (HAS_PERFORMANCE_SLA)chaos-resilience2
qe-regression-analyzerConditional (HAS_REGRESSION_RISK)defect-intelligence2
qe-pattern-learnerConditional (HAS_RECURRING_INCIDENTS)defect-intelligence2
qe-middleware-validatorConditional (HAS_MIDDLEWARE)enterprise-integration2
qe-sap-rfc-testerConditional (HAS_SAP_INTEGRATION)enterprise-integration2
qe-sod-analyzerConditional (HAS_AUTHORIZATION)enterprise-integration2
qe-learning-coordinatorFeedback (always, sequential)learning-optimization3
qe-transfer-specialistFeedback (always, sequential)learning-optimization3

Total: 12 agents (3 core + 7 conditional + 2 feedback)


Quality Gate Thresholds

MetricHEALTHYDEGRADEDCRITICAL
DORA Score>= 0.70.4 - 0.69< 0.4
SLA Compliance>= 99%95 - 98.9%< 95%
Incident SeverityP3/P4/NONEP2P0/P1
Defect Trenddeclining/stablestable (density > 2)increasing + density > 5
RCA Completeness>= 80%50 - 79%< 50%

Report Filename Mapping

AgentReport FilenameStep
qe-metrics-optimizer
02-dora-metrics.md
2
qe-defect-predictor
03-defect-prediction.md
2
qe-root-cause-analyzer
04-root-cause-analysis.md
2
qe-chaos-engineer
05-chaos-resilience.md
4
qe-performance-tester
06-performance-sla.md
4
qe-regression-analyzer
07-regression-analysis.md
4
qe-pattern-learner
08-pattern-analysis.md
4
Learning Persistence
09-learning-persistence.json
7
qe-middleware-validator
10-middleware-health.md
4
qe-sap-rfc-tester
11-sap-health.md
4
qe-sod-analyzer
12-sod-compliance.md
4
Feedback agents
13-feedback-loops.md
8
Synthesis
01-executive-summary.md
6

Execution Model Options

ModelWhen to UseAgent Spawn
Task Tool (PRIMARY)Claude Code sessions
Task({ subagent_type, run_in_background: true })
MCP ToolsMCP server available
fleet_init({})
/
task_submit({})
CLITerminal/scripts
swarm init
/
agent spawn

Key Principle

Production health is measured by outcomes, not intentions. This swarm provides evidence-based production assessment and closes the QCSD feedback loop.