Agentops using-agentops

Meta skill explaining the AgentOps operating model. Hook-capable runtimes inject it at session start; Codex uses it through the explicit startup fallback. Covers bookkeeping, validation, primitives, flows, the RPI lifecycle, and the skill catalog.

install

source · Clone the upstream repo

git clone https://github.com/boshu2/agentops

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/boshu2/agentops "$T" && mkdir -p ~/.claude/skills && cp -r "$T/cli/embedded/skills/using-agentops" ~/.claude/skills/boshu2-agentops-using-agentops && rm -rf "$T"

manifest: cli/embedded/skills/using-agentops/SKILL.md

AgentOps Operating Model

AgentOps is the operational layer for coding agents.

Publicly, it gives you four things:

Bookkeeping — captured learnings, findings, and reusable context
Validation — plan and code review before work ships
Primitives — single skills, hooks, and CLI surfaces
Flows — named compositions like
```
/research
```
,
```
/validation
```
, and
```
/rpi
```

Technically, AgentOps acts as a context compiler: raw session signal becomes reusable knowledge, compiled prevention, and better next work.

Core Flow: RPI

Research → Plan → Implement → Validate
    ↑                            │
    └──── Knowledge Flywheel ────┘

Research Phase

/research <topic>      # Deep codebase exploration
ao search "<query>"    # Search existing knowledge
ao search "<query>" --cite retrieved  # Record adoption when a search result is reused
ao lookup <id>         # Pull full content of specific learning
ao lookup --query "x"  # Search knowledge by relevance

Output:

.agents/research/<topic>.md

Plan Phase

/pre-mortem <spec>     # Simulate failures (error/rescue map, scope modes, prediction tracking)
/plan <goal>           # Decompose into trackable issues

Output: Beads issues with dependencies

Implement Phase

/implement <issue>     # Single issue execution
/crank <epic>          # Autonomous epic loop (uses swarm for waves)
/swarm                 # Parallel execution (fresh context per agent)

Output: Code changes, tests, documentation

Validate Phase

/vibe [target]         # Code validation (finding classification + suppression + domain checklists)
/post-mortem           # Validation + streak tracking + prediction accuracy + retro history
/retro                 # Quick-capture a single learning

Output:

.agents/learnings/

.agents/patterns/

Phase-to-Skill Mapping

Phase Primary Skill Supporting Skills

Discovery

/discovery

/brainstorm

/research

/plan

/pre-mortem

Implement

/crank

/implement

(single issue),

/swarm

(parallel execution)

Validate

/validation

/vibe

/post-mortem

/retro

/forge

Choosing the skill:

Use
```
/implement
```
for single issue execution. Now defaults to TDD-first — writes failing tests before implementing. Skip with
```
--no-tdd
```
.
Use
```
/crank
```
for autonomous epic execution (loops waves via swarm until done). Auto-generates file-ownership maps to prevent worker conflicts.
Use
```
/discovery
```
for the discovery phase only (brainstorm → search → research → plan → pre-mortem).
Use
```
/validation
```
for the validation phase only (vibe → post-mortem → retro → forge).
Use
```
/rpi
```
for full lifecycle — delegates to
```
/discovery
```
→
```
/crank
```
→
```
/validation
```
.
Use
```
/ratchet
```
to gate/record progress through RPI.

Start Here (12 starters)

These are the skills every user needs first. Everything else is available when you need it.

Skill	Purpose
`/quickstart`	Guided onboarding — run this first
`/bootstrap`	One-command full AgentOps setup — fills gaps only
`/research`	Deep codebase exploration
`/council`	Multi-model consensus review + finding auto-extraction
`/vibe`	Code validation (classification + suppression + domain checklists)
`/rpi`	Full RPI lifecycle orchestrator ( `/discovery` → `/crank` → `/validation` )
`/implement`	Execute single issue
`/retro --quick`	Quick-capture a single learning into the flywheel
`/status`	Single-screen dashboard of current work and suggested next action
`/goals`	Maintain GOALS.yaml fitness specification
`/push`	Atomic test-commit-push workflow
`/flywheel`	Knowledge flywheel health monitoring (σ×ρ > δ/100)

Advanced Skills (when you need them)

Skill	Purpose
`/compile`	Active knowledge intelligence — Mine → Grow → Defrag cycle
`/llm-wiki`	External reading wiki proposal — raw sources to compiled wiki
`/harvest`	Cross-rig knowledge consolidation — sweep, dedup, promote to global hub
`/knowledge-activation`	Operationalize a mature `.agents` corpus into beliefs, playbooks, briefings, and gap surfaces
`/brainstorm`	Structured idea exploration before planning
`/discovery`	Full discovery phase orchestrator (brainstorm → search → research → plan → pre-mortem)
`/plan`	Epic decomposition into issues
`/design`	Product validation gate — goal alignment, persona fit, competitive differentiation
`/pre-mortem`	Failure simulation (error/rescue, scope modes, temporal, predictions)
`/post-mortem`	Validation + streak tracking + prediction accuracy + retro history
`/bug-hunt`	Root cause analysis
`/release`	Pre-flight, changelog, version bumps, tag
`/crank`	Autonomous epic loop (uses swarm for each wave)
`/swarm`	Fresh-context parallel execution (Ralph pattern)
`/evolve`	Goal-driven fitness-scored improvement loop
`/autodev`	PROGRAM.md autonomous development contract setup and validation
`/dream`	Interactive Dream operator surface for setup, bedtime runs, and morning reports
`/doc`	Documentation generation
`/retro`	Quick-capture a learning (full retro → /post-mortem)
`/validation`	Full validation phase orchestrator (vibe → post-mortem → retro → forge)
`/ratchet`	Brownian Ratchet progress gates for RPI workflow
`/forge`	Mine transcripts for knowledge — decisions, learnings, patterns
`/readme`	Generate gold-standard README for any project
`/security`	Continuous repository security scanning and release gating
`/security-suite`	Binary and prompt-surface security suite — static analysis, dynamic tracing, offline redteam, policy gating
`/test`	Test generation, coverage analysis, and TDD workflow
`/red-team`	Persona-based adversarial validation — probe docs and skills from constrained user perspectives
`/review`	Review incoming PRs, agent output, or diffs — SCORED checklist
`/refactor`	Safe, verified refactoring with regression testing at each step
`/deps`	Dependency audit, update, vulnerability scanning, and license compliance
`/perf`	Performance profiling, benchmarking, regression detection, and optimization
`/scaffold`	Project scaffolding, component generation, and boilerplate setup
`/scenario`	Author and manage holdout scenarios for behavioral validation

Expert Skills (specialized workflows)

Skill	Purpose
`/grafana-platform-dashboard`	Build Grafana platform dashboards from templates/contracts
`/codex-team`	Parallel Codex agent execution
`/openai-docs`	Official OpenAI docs lookup with citations
`/oss-docs`	OSS documentation scaffold and audit
`/reverse-engineer-rpi`	Reverse-engineer a product into feature catalog and specs
`/pr-research`	Upstream repository research before contribution
`/pr-plan`	External contribution planning
`/pr-implement`	Fork-based PR implementation
`/pr-validate`	PR-specific validation and isolation checks
`/pr-prep`	PR preparation and structured body generation
`/pr-retro`	Learn from PR outcomes
`/complexity`	Code complexity analysis
`/product`	Interactive PRODUCT.md generation
`/handoff`	Session handoff for continuation
`/recover`	Post-compaction context recovery
`/trace`	Trace design decisions through history
`/provenance`	Trace artifact lineage to sources
`/beads`	Issue tracking operations
`/heal-skill`	Detect and fix skill hygiene issues
`/converter`	Convert skills to Codex/Cursor formats
`/update`	Reinstall all AgentOps skills from latest source

Knowledge Flywheel

Every

/post-mortem

promotes learnings and patterns into

.agents/

so future

/research

starts with better context instead of zero.

Runtime Modes

AgentOps has four runtime modes. Do not assume hook automation exists everywhere.

Mode	When it applies	Start path	Closeout path	Guarantees
`gc`	Gas City ( `gc` ) binary available and `city.toml` present	gc controller manages sessions; `ao rpi` auto-selects gc executor	gc event bus captures phase/gate/failure/metric events	Default when gc is available. Phase execution via gc sessions, events via gc event bus, agent health via gc health patrol
`hook-capable`	Claude/OpenCode with lifecycle hooks installed (no gc)	Runtime hook or `ao inject` / `ao lookup`	Runtime hook or `ao forge transcript` + `ao flywheel close-loop`	Automatic startup/context injection and session-end maintenance when hooks are installed
`codex-native-hooks`	Codex CLI v0.115.0+ with native hook support (March 2026)	Runtime hooks (same as hook-capable)	Runtime hooks (same as hook-capable)	Native lifecycle hooks — same guarantees as hook-capable mode
`codex-hookless-fallback`	Codex Desktop / Codex CLI pre-v0.115.0 without hook surfaces	`ao codex start`	`ao codex stop`	Explicit startup context, citation tracking, transcript fallback, and close-loop metrics without hooks
`manual`	No hooks and no Codex-native runtime detection	`ao inject` / `ao lookup`	`ao forge transcript` + `ao flywheel close-loop`	Works everywhere, but lifecycle actions are operator-driven

Issue Tracking

This workflow uses beads for git-native issue tracking:

bd ready              # Unblocked issues
bd show <id>          # Issue details
bd close <id>         # Close issue
bd vc status          # Inspect Dolt state if needed (JSONL auto-sync is automatic)

Examples

Startup Context Loading

Hook-capable runtimes

```
session-start.sh
```
(or equivalent) can run at session start.
In
```
manual
```
mode, MEMORY.md is auto-loaded and the hook points to on-demand retrieval (
```
ao search
```
,
```
ao lookup
```
).
In
```
lean
```
mode, the hook extracts pending knowledge and injects prior learnings with a reduced token budget.
This skill can be injected automatically into session context.

Codex (v0.115.0+: native hooks, older: hookless fallback)

v0.115.0+: hooks fire automatically — same behavior as hook-capable runtimes above.
Pre-v0.115.0: run
```
ao codex start
```
explicitly, use
```
ao lookup
```
for citations, end with
```
ao codex stop
```
.

Result: The agent gets the RPI workflow, prior context, and a citation path in all modes.

Workflow Reference During Planning

User says: "How should I approach this feature?"

What happens:

Agent references this skill's RPI workflow section
Agent recommends Research → Plan → Implement → Validate phases
Agent suggests
```
/research
```
for codebase exploration,
```
/plan
```
for decomposition
Agent explains
```
/pre-mortem
```
for failure simulation before implementation
User follows recommended workflow with agent guidance

Result: Agent provides structured workflow guidance based on this meta-skill, avoiding ad-hoc approaches.

Troubleshooting

Problem	Cause	Solution
Skill not auto-loaded	Hook runtime unavailable or startup path not run	Hook-capable runtimes: verify `hooks/session-start.sh` exists and is enabled. Codex: run `ao codex start` explicitly
Outdated skill catalog	This file not synced with actual skills/ directory	Update skill list in this file after adding/removing skills
Wrong skill suggested	Natural language trigger ambiguous	User explicitly calls skill with `/skill-name` syntax
Workflow unclear	RPI phases not well-documented here	Read full workflow guide in README.md or docs/ARCHITECTURE.md