git clone https://github.com/Yeachan-Heo/oh-my-codex
T=$(mktemp -d) && git clone --depth=1 https://github.com/Yeachan-Heo/oh-my-codex "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/plan" ~/.claude/skills/yeachan-heo-oh-my-codex-plan && rm -rf "$T"
skills/plan/SKILL.md<Use_When>
- User wants to plan before implementing -- "plan this", "plan the", "let's plan"
- User wants structured requirements gathering for a vague idea
- User wants an existing plan reviewed -- "review this plan",
--review - User wants multi-perspective consensus on a plan --
, "ralplan"--consensus - Task is broad or vague and needs scoping before any code is written </Use_When>
<Do_Not_Use_When>
- User wants autonomous end-to-end execution -- use
insteadautopilot - User wants to start coding immediately with a clear task -- use
or delegate to executorralph - User asks a simple question that can be answered directly -- just answer it
- Task is a single focused fix with obvious scope -- skip planning, just do it </Do_Not_Use_When>
<Why_This_Exists> Jumping into code without understanding requirements leads to rework, scope creep, and missed edge cases. Plan provides structured requirements gathering, expert analysis, and quality-gated plans so that execution starts from a solid foundation. The consensus mode adds multi-perspective validation for high-stakes projects. </Why_This_Exists>
<Execution_Policy>
- Auto-detect interview vs direct mode based on request specificity
- Ask one question at a time during interviews -- never batch multiple questions
- Gather codebase facts via
agent before asking the user about themexplore - When session guidance enables
, preferUSE_OMX_EXPLORE_CMD
for simple read-only repository lookups during planning; keep prompts narrow and concrete, and keep prompt-heavy or ambiguous planning work on the richer normal path and fall back normally ifomx explore
is unavailable.omx explore - Plans must meet quality standards: 80%+ claims cite file/line, 90%+ criteria are testable
- Implementation step count must be right-sized to task scope; avoid defaulting to exactly five steps when the work is clearly smaller or larger
- Consensus mode outputs the final plan by default; add
to enable execution handoff--interactive - Consensus mode uses RALPLAN-DR short mode by default; switch to deliberate mode with
or when the request explicitly signals high risk (auth/security, data migration, destructive/irreversible changes, production incident, compliance/PII, public API breakage)--deliberate - Default to concise, evidence-dense progress and completion reporting unless the user or risk level requires more detail
- Treat newer user task updates as local overrides for the active workflow branch while preserving earlier non-conflicting constraints
- If correctness depends on additional inspection, retrieval, execution, or verification, keep using the relevant tools until the plan is grounded
- Continue through clear, low-risk, reversible next steps automatically; ask only when the next step is materially branching, destructive, or preference-dependent </Execution_Policy>
Mode Selection
| Mode | Trigger | Behavior |
|---|---|---|
| Interview | Default for broad requests | Interactive requirements gathering |
| Direct | , or detailed request | Skip interview, generate plan directly |
| Consensus | , "ralplan" | Planner -> Architect -> Critic loop until agreement with RALPLAN-DR structured deliberation (short by default, for high-risk); outputs plan by default |
| Consensus Interactive | | Same as Consensus but pauses for user feedback at draft and approval steps, then hands off to execution |
| Review | , "review this plan" | Critic evaluation of existing plan |
Interview Mode (broad/vague requests)
- Classify the request: Broad (vague verbs, no specific files, touches 3+ areas) triggers interview mode
- Ask one focused question using
for preferences, scope, and constraintsAskUserQuestion - Gather codebase facts first: Before asking "what patterns does your code use?", spawn an
agent to find out, then ask informed follow-up questionsexplore - Build on answers: Each question builds on the previous answer
- Consult Analyst (THOROUGH tier) for hidden requirements, edge cases, and risks
- Create plan when the user signals readiness: "create the plan", "I'm ready", "make it a work plan"
Direct Mode (detailed requests)
- Quick Analysis: Optional brief Analyst consultation
- Create plan: Generate comprehensive work plan immediately
- Review (optional): Critic review if requested
Consensus Mode (--consensus
/ "ralplan")
--consensusRALPLAN-DR modes: Short (default, bounded structure) and Deliberate (for
--deliberate or explicit high-risk requests). Both modes keep the same Planner -> Architect -> Critic sequence. The workflow auto-proceeds through planning steps (Planner/Architect/Critic) but outputs the final plan without executing.
- Planner creates initial plan and a compact RALPLAN-DR summary before any Architect review. The summary MUST include:
- Principles (3-5)
- Decision Drivers (top 3)
- Viable Options (>=2) with bounded pros/cons for each option
- If only one viable option remains, an explicit invalidation rationale for the alternatives that were rejected
- In deliberate mode: a pre-mortem (3 failure scenarios) and an expanded test plan covering unit / integration / e2e / observability
- User feedback (--interactive only): If running with
, MUST use--interactive
to present the draft plan plus the RALPLAN-DR Principles / Decision Drivers / Options summary for early direction alignment with these options:AskUserQuestion- Proceed to review — send to Architect and Critic for evaluation
- Request changes — return to step 1 with user feedback incorporated
- Skip review — go directly to final approval (step 7)
If NOT running with
, automatically proceed to review (step 3).--interactive
- Architect reviews for architectural soundness using
withask_codex
. Architect review MUST include: strongest steelman counterargument (antithesis) against the favored option, at least one meaningful tradeoff tension, and (when possible) a synthesis path. In deliberate mode, Architect should explicitly flag principle violations. Wait for this step to complete before proceeding to step 4. Do NOT run steps 3 and 4 in parallel.agent_role: "architect" - Critic evaluates against quality criteria using
withask_codex
. Critic MUST verify principle-option consistency, fair alternative exploration, risk mitigation clarity, testable acceptance criteria, and concrete verification steps. Critic MUST explicitly reject shallow alternatives, driver contradictions, vague risks, or weak verification. In deliberate mode, Critic MUST reject missing/weak pre-mortem or missing/weak expanded test plan. Run only after step 3 is complete.agent_role: "critic" - Re-review loop (max 5 iterations): If Critic rejects or iterates, execute this closed loop:
a. Collect all feedback from Architect + Critic
b. Pass feedback to Planner to produce a revised plan
c. Return to Step 3 — Architect reviews the revised plan
d. Return to Step 4 — Critic evaluates the revised plan
e. Repeat until Critic approves OR max 5 iterations reached
f. If max iterations reached without approval, present the best version to user via
with note that expert consensus was not reachedAskUserQuestion - Apply improvements: When reviewers approve with improvement suggestions, merge all accepted improvements into the plan file before proceeding. Final consensus output MUST include an ADR section with: Decision, Drivers, Alternatives considered, Why chosen, Consequences, Follow-ups. Specifically:
a. Collect all improvement suggestions from Architect and Critic responses
b. Deduplicate and categorize the suggestions
c. Update the plan file in
with the accepted improvements (add missing details, refine steps, strengthen acceptance criteria, ADR updates, etc.) d. Note which improvements were applied in a brief changelog section at the end of the plan e. Before any execution handoff, derive an explicit available-agent-types roster from the known prompt catalog and add concrete follow-up staffing guidance for both.omx/plans/
and$ralph
(recommended roles, counts, suggested reasoning levels by lane, and why each lane exists) f. For the$team
path, add an explicit launch-hint block with concrete$team
/omx team
commands and a team verification path (what team proves before shutdown, what Ralph verifies after handoff)$team - On Critic approval (with improvements applied): (--interactive only) If running with
, use--interactive
to present the plan with these options:AskUserQuestion- Approve and execute — proceed to implementation via ralph+ultrawork
- Approve and implement via team — proceed to implementation via coordinated parallel team agents
- Request changes — return to step 1 with user feedback
- Reject — discard the plan entirely
If NOT running with
, output the final approved plan and stop. Do NOT auto-execute.--interactive
- (--interactive only) User chooses via the structured
UI (never ask for approval in plain text)AskUserQuestion - On user approval (--interactive only):
- Approve and execute: MUST invoke
with the approved plan path from$ralph
as context plus the explicit available-agent-types roster, suggested reasoning levels, concrete role allocation guidance, and direct launch hints for Ralph follow-up work. Do NOT implement directly. Do NOT edit source code files in the planning agent. The ralph skill handles execution via ultrawork parallel agents..omx/plans/ - Approve and implement via team: MUST invoke
with the approved plan path from$team
as context plus the explicit available-agent-types roster, suggested reasoning levels, concrete staffing / worker-role allocation guidance, explicit.omx/plans/
/omx team
launch hints, and the team verification path. Do NOT implement directly. The team skill coordinates parallel agents across the staged pipeline for faster execution on large tasks.$team
- Approve and execute: MUST invoke
Review Mode (--review
)
--review- Treat review as a reviewer-only pass. The context that wrote the plan, cleanup proposal, or diff MUST NOT be the context that approves it.
- Read plan file from
.omx/plans/ - Evaluate via Critic using
withask_codexagent_role: "critic" - For cleanup/refactor/anti-slop work, verify that the artifact includes a cleanup plan, regression tests or an explicit test gap, smell-by-smell passes, and quality gates.
- Return verdict: APPROVED, REVISE (with specific feedback), or REJECT (replanning required)
- If the current context authored the artifact, hand the review to
,/review
,critic
,quality-reviewer
, orsecurity-reviewer
as appropriate.verifier
Plan Output Format
Every plan includes:
- Requirements Summary
- Acceptance Criteria (testable)
- Implementation Steps (with file references)
- Adaptive step count sized to the actual scope (not a fixed five-step template)
- Risks and Mitigations
- Verification Steps
- For consensus/ralplan: RALPLAN-DR summary (Principles, Decision Drivers, Options)
- For consensus/ralplan final output: ADR (Decision, Drivers, Alternatives considered, Why chosen, Consequences, Follow-ups)
- For consensus/ralplan execution handoff: Available-Agent-Types Roster, Follow-up Staffing Guidance (including suggested reasoning levels by lane), explicit
/omx team
Launch Hints, and Team Verification Path$team - For deliberate consensus mode: Pre-mortem (3 scenarios) and Expanded Test Plan (unit/integration/e2e/observability)
Plans are saved to
.omx/plans/. Drafts go to .omx/drafts/.
</Steps>
<Tool_Usage>
- Before first MCP tool use, call
to discover deferred MCP toolsToolSearch("mcp") - Use
for preference questions (scope, priority, timeline, risk tolerance) -- provides clickable UIAskUserQuestion - Use plain text for questions needing specific values (port numbers, names, follow-up clarifications)
- Use the
agent (LOW tier, bounded quick pass) to gather codebase facts before asking the userexplore - Use
withask_codex
for planning validation on large-scope plansagent_role: "planner" - Use
withask_codex
for requirements analysisagent_role: "analyst" - Use
withask_codex
for plan review in consensus and review modesagent_role: "critic" - If ToolSearch finds no MCP tools or Codex is unavailable, fall back to equivalent OMX prompt agents -- never block on external tools
- CRITICAL — Consensus mode agent calls MUST be sequential, never parallel. Always await the Architect result before issuing the Critic call.
- In consensus mode, default to RALPLAN-DR short mode; enable deliberate mode on
or explicit high-risk signals (auth/security, migrations, destructive changes, production incidents, compliance/PII, public API breakage)--deliberate - In consensus mode with
: use--interactive
for the user feedback step (step 2) and the final approval step (step 7) -- never ask for approval in plain text. WithoutAskUserQuestion
, auto-proceed through planning steps without pausing. Output the final plan without execution.--interactive - In consensus mode with
, on user approval MUST invoke--interactive
for execution (step 9) -- never implement directly in the planning agent$ralph - In consensus mode, execution follow-up handoff MUST include an explicit available-agent-types roster plus concrete staffing / role-allocation guidance grounded in that roster, suggested reasoning levels by lane, explicit
/omx team
launch hints, and a team verification path </Tool_Usage>$team
Scenario Examples
Good: The user says
continue after the workflow already has a clear next step. Continue the current branch of work instead of restarting or re-asking the same question.
Good: The user changes only the output shape or downstream delivery step (for example
make a PR). Preserve earlier non-conflicting workflow constraints and apply the update locally.
Bad: The user says
continue, and the workflow restarts discovery or stops before the missing verification/evidence is gathered.
<Examples>
<Good>
Adaptive interview (gathering facts before asking):
```
Planner: [spawns explore agent: "find authentication implementation"]
Planner: [receives: "Auth is in src/auth/ using JWT with passport.js"]
Planner: "I see you're using JWT authentication with passport.js in src/auth/.
For this new feature, should we extend the existing auth or add a separate auth flow?"
```
Why good: Answers its own codebase question first, then asks an informed preference question.
</Good>
<Good>
Single question at a time:
```
Q1: "What's the main goal?"
A1: "Improve performance"
Q2: "For performance, what matters more -- latency or throughput?"
A2: "Latency"
Q3: "For latency, are we optimizing for p50 or p99?"
```
Why good: Each question builds on the previous answer. Focused and progressive.
</Good>
<Bad>
Asking about things you could look up:
```
Planner: "Where is authentication implemented in your codebase?"
User: "Uh, somewhere in src/auth I think?"
```
Why bad: The planner should spawn an explore agent to find this, not ask the user.
</Bad>
<Bad>
Batching multiple questions:
```
"What's the scope? And the timeline? And who's the audience?"
```
Why bad: Three questions at once causes shallow answers. Ask one at a time.
</Bad>
<Bad>
Presenting all design options at once:
```
"Here are 4 approaches: Option A... Option B... Option C... Option D... Which do you prefer?"
```
Why bad: Decision fatigue. Present one option with trade-offs, get reaction, then present the next.
</Bad>
</Examples>
<Escalation_And_Stop_Conditions>
- Stop interviewing when requirements are clear enough to plan -- do not over-interview
- In consensus mode, stop after 5 Planner/Architect/Critic iterations and present the best version
- Consensus mode outputs the plan by default; with
, user can approve and hand off to ralph/team--interactive - If the user says "just do it" or "skip planning", MUST invoke
to transition to execution mode. Do NOT implement directly in the planning agent.$ralph - Escalate to the user when there are irreconcilable trade-offs that require a business decision </Escalation_And_Stop_Conditions>
<Final_Checklist>
- Plan has testable acceptance criteria (90%+ concrete)
- Plan references specific files/lines where applicable (80%+ claims)
- All risks have mitigations identified
- No vague terms without metrics ("fast" -> "p99 < 200ms")
- Plan saved to
.omx/plans/ - In consensus mode: RALPLAN-DR summary includes 3-5 principles, top 3 drivers, and >=2 viable options (or explicit invalidation rationale)
- In consensus mode final output: ADR section included (Decision / Drivers / Alternatives considered / Why chosen / Consequences / Follow-ups)
- In deliberate consensus mode: pre-mortem (3 scenarios) + expanded test plan (unit/integration/e2e/observability) included
- In consensus mode with
: user explicitly approved before any execution; without--interactive
: output final plan after Critic approval (no auto-execution) </Final_Checklist>--interactive
When presenting design choices during interviews, chunk them:
- Overview (2-3 sentences)
- Option A with trade-offs
- [Wait for user reaction]
- Option B with trade-offs
- [Wait for user reaction]
- Recommendation (only after options discussed)
Format for each option:
### Option A: [Name] **Approach:** [1 sentence] **Pros:** [bullets] **Cons:** [bullets] What's your reaction to this approach?
Question Classification
Before asking any interview question, classify it:
| Type | Examples | Action |
|---|---|---|
| Codebase Fact | "What patterns exist?", "Where is X?" | Explore first, do not ask user |
| User Preference | "Priority?", "Timeline?" | Ask user via AskUserQuestion |
| Scope Decision | "Include feature Y?" | Ask user |
| Requirement | "Performance constraints?" | Ask user |
Review Quality Criteria
| Criterion | Standard |
|---|---|
| Clarity | 80%+ claims cite file/line |
| Testability | 90%+ criteria are concrete |
| Verification | All file refs exist |
| Specificity | No vague terms |
Deprecation Notice
The separate
/planner, /ralplan, and /review skills have been merged into $plan. All workflows (interview, direct, consensus, review) are available through $plan.
</Advanced>