Agentic-loop prd

Brainstorm, harden, and generate an executable PRD for Ralph from a description, idea file, or plan file.

install

source · Clone the upstream repo

git clone https://github.com/allierays/agentic-loop

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/allierays/agentic-loop "$T" && mkdir -p ~/.claude/skills && cp -r "$T/.claude/skills/prd" ~/.claude/skills/allierays-agentic-loop-prd && rm -rf "$T"

manifest: .claude/skills/prd/SKILL.md

source content

/prd - Generate PRD for Ralph

Generate executable stories for Ralph's autonomous development loop.

CRITICAL: This command does NOT write code. It produces

.ralph/prd.json

only.

User Input

$ARGUMENTS

Workflow

Step 1: Determine Input Type

$ARGUMENTS

is empty:

Scan for existing source files:

ls docs/ideas/*.md 2>/dev/null || echo "No idea files found"
ls docs/plans/*.md 2>/dev/null || echo "No plan files found"

If source files exist, use AskUserQuestion to let the user pick:
- Question: "What should I build the PRD from?"
- Header: "PRD source"
- Options: List discovered idea/plan files (up to 3-4 most relevant), plus a "Describe a feature" option that says "Type a description directly (e.g., 'Add user logout button')"
- If no source files found, skip AskUserQuestion and just say: "Describe the feature you'd like to build (e.g.,
```
/prd 'Add user logout button'
```
  )"

$ARGUMENTS

looks like a plan file (

plans/

prefix,

docs/plans/

path, or full path to a plan file):

If it's a full path, use it directly
If it's
```
plans/name
```
or just a prefix, look for
```
docs/plans/{name}.md
```
Proceed to "Read and Understand the Plan"

$ARGUMENTS

looks like an idea file reference (no spaces, matches

docs/ideas/*.md

If it's a full path, use it directly

If it's just a name like

content-engine

, check

docs/ideas/content-engine.md

first, fall back to

docs/plans/content-engine.md

Proceed to "Read and Understand the Idea"

$ARGUMENTS

is a description (has spaces, is a sentence):

This is the quick PRD flow - no source file created
Good for small features that don't need documentation
Skip to "Confirm Understanding" below

Step 2a: Read and Understand the Idea (from file)

Read the idea file and summarize:

Say: "I've read

{path}

. Here's my understanding:

Feature: {name} Problem: {one line} Solution: {one line} Scope: {key items}

I'll now ask a few hardening questions before generating stories."

Proceed to Step 2.5.

Step 2b: Confirm Understanding (from description)

If working from a direct description, first explore the codebase briefly:

ls -la src/ app/ 2>/dev/null | head -20
cat package.json 2>/dev/null | jq '{name, dependencies}' || true
cat pyproject.toml 2>/dev/null | head -20 || true

Use the detected tech stack, test runners, and constraints when building each story — these go into each story's

techStack

constraints

, and

testing.runner

fields (not at the PRD root).

Then say: "I'll create a PRD for: {description}

Here's what I found in your codebase: [brief summary of tech stack, existing patterns]

I'll now ask a few hardening questions before generating stories."

Proceed to Step 2.5.

Step 2c: Read and Understand the Plan (from plan file)

Read the plan file and summarize:

Say: "I've read

{path}

. Here's my understanding:

Feature/Goal: {name} Approach: {summary of approach} Key Files: {files mentioned} Scope: {key items}

I'll now ask a few hardening questions before generating stories."

Proceed to Step 2.5.

Step 2.5: Harden the Requirements

This step runs for ALL input types (idea file, plan file, or description). Review what you already know from the input and ask ONLY about gaps — skip questions the input already answers.

Say: "Before I generate stories, I want to make sure we've covered the key areas:"

Scope & UX (always ask):

What's in scope vs out of scope?
Is this user-facing? What does the user see/do?
What are the edge cases?
Responsive design (if frontend): Must it work on mobile/tablet? What breakpoints? Any layout changes between screen sizes?

Security (ask if feature involves auth, user input, or sensitive data):

Authentication: Who can access this? Login required?
Passwords: How stored? (must be hashed, never plain text)
User input: What validation needed? (SQL injection, XSS)
Sensitive data: What should NEVER be in API responses?
Rate limiting: Should this be rate limited?

Scale (ask if feature involves lists, data, or APIs):

How many items expected? (10s, 1000s, millions?)
Pagination needed? What's the max per page?
Caching needed? How fresh must data be?
Database indexes: What will be queried/sorted frequently?

Migration (ask if feature involves restructuring or moving code):

Source → destination mapping: Where does code currently live? Where should it end up?
Phases: What's the logical order?
Verification: What commands prove each phase worked?

After presenting the hardening questions, use AskUserQuestion:

Question: "Answer the questions above, or proceed with sensible defaults?"
Header: "Hardening"
Options:
- "Go with defaults" — "Proceed with sensible defaults for unanswered questions"
- "Let me answer" — "I'll respond to the questions above"
If the user selects "Let me answer" or "Other", STOP and wait for their response before continuing.

Step 3: Check for Existing PRD

cat .ralph/prd.json 2>/dev/null

If it exists, read it and say: "

.ralph/prd.json

exists with {N} stories ({M} completed, {P} pending). I'll append new stories to it."

Default behavior is append — just proceed. Do NOT ask for confirmation unless the user explicitly says "overwrite" or "replace".

When appending:

Find highest existing story number (ignore prefix - could be US-005 or TASK-005)
Always use TASK- prefix for new stories (e.g., if highest is US-005 or TASK-005, new stories start at TASK-006)
New stories will be added after existing ones

Step 3.5: Read Existing Test Infrastructure

Before writing stories, discover the project's existing test setup so stories reference real fixtures, helpers, and patterns:

# Find test config and fixtures
ls tests/conftest.py tests/fixtures/ src/__tests__/ e2e/ 2>/dev/null
cat tests/conftest.py 2>/dev/null | head -50
cat e2e/*.config.ts 2>/dev/null | head -30

# Find existing test patterns
grep -r "def test_\|async def test_\|it(\|describe(" tests/ src/__tests__/ e2e/ 2>/dev/null | head -20

Use what you find to:

Reference correct fixture names in story
```
notes
```
(e.g., "Use
```
db_session
```
and
```
client
```
fixtures from
```
conftest.py
```
")
Match existing test file organization (e.g.,
```
tests/domains/auth/
```
not
```
tests/test_auth.py
```
)
Include specific test scenarios in
```
notes
```
based on patterns you see in existing tests
Reference real helpers (e.g., "Use
```
MockRequest
```
from
```
test_auth.py
```
for request mocking")

Step 4: Split into Stories

Break the idea into small, executable stories:

Each story completable in one Claude session (~10-15 min)
Max 3-4 acceptance criteria per story
No limit on story count — generate as many stories as the idea needs
If appending, start IDs from the next available number
Each story must include its own
techStack
,
constraints
, and
contextFiles
. Include only what's relevant to that story — don't copy-paste identical context into every story.

Step 5: Write Draft PRD

Write the initial PRD to

.ralph/prd.json

Ensure .ralph directory exists:

mkdir -p .ralph && touch .ralph/.prd-edit-allowed

Write all stories to
```
.ralph/prd.json
```
- If appending: Read existing JSON, add new stories, update count

Do not present to user yet - validation comes next.

Step 6: Validate and Fix (MANDATORY)

Read back the PRD you just wrote and validate EVERY story.

cat .ralph/prd.json

For EACH story, check:

6a. Testability

❌
```
grep -q 'function' file.py
```
→ Only checks code exists, not behavior
❌
```
test -f src/component.tsx
```
→ Only checks file exists
❌
```
npm test
```
alone for backend → Mocks can pass without real behavior
✅
```
curl ... | jq -e
```
→ Tests actual API response
✅
```
npx playwright test
```
→ Real browser tests
✅
```
npx tsc --noEmit
```
→ Real type checking

6b. Dependencies

Can this story's tests pass given prior stories completed?
If TASK-003 needs a user, does TASK-001/002 create one?

6c. Security (for auth/input stories)

Does acceptanceCriteria include:

Password handling → "Passwords hashed with bcrypt (cost 10+)"
Auth responses → "Password/tokens NEVER in response body"
User input → "Input sanitized to prevent SQL injection/XSS"
Login endpoints → "Rate limited to N attempts per minute"
Token expiry → "JWT expires after N hours"

6d. Scale (for list/data stories)

Does acceptanceCriteria include:

List endpoints → "Returns paginated results (max 100 per page)"
Query params → "Accepts ?page=N&limit=N"
Large datasets → "Database query uses index on [column]"

6e. Context (for all stories)

Does
```
contextFiles
```
include the source file (idea or plan file, especially if it has ASCII mockups)?
Does
```
contextFiles
```
include styleguide (if exists)?
Does
```
techStack
```
include the relevant stack for this story?
Does
```
constraints
```
include any rules this story must follow?
For frontend: Is
```
testUrl
```
set?
For frontend: Is
```
mcp
```
set to
```
["playwright", "devtools"]
```
?
For frontend: Does
```
notes
```
include Playwright MCP visual verification instructions? (See "Playwright MCP for Visual Verification" section below)

6f. E2E Coverage (MANDATORY for user-facing features)

If the feature has ANY frontend stories that add or modify user-facing UI:

There MUST be at least one story with
```
"e2e"
```
in its
```
testing.types
```
That story MUST have Playwright test files in
```
testing.files.e2e
```
That story's
```
testSteps
```
MUST include
```
npx playwright test ...
```
The E2E story should be the LAST story (depends on all others) to test the full integrated flow
If no E2E story exists, CREATE one as the final story

6h. Responsive Design (for frontend stories)

Every frontend story that creates or modifies user-facing UI MUST include:

```
acceptanceCriteria
```
with responsive behavior: "Layout adapts to mobile (< 768px), tablet (768-1024px), and desktop (> 1024px)"
```
testSteps
```
with a viewport resize check OR Playwright test that validates mobile layout
```
notes
```
with Playwright MCP instructions to screenshot at mobile and desktop widths

Example acceptanceCriteria:

"Component renders in single-column layout on mobile (< 768px)",
"Navigation collapses to hamburger menu on mobile",
"Touch targets are at least 44x44px on mobile"

Example testSteps:

"npx playwright test tests/e2e/dashboard.spec.ts --project=mobile"

If a frontend story has no responsive criteria and the feature is user-facing, add them.

6g. Test Scenario Specificity

Every story's

notes

field MUST include 3+ specific test scenarios that describe what to test and how. Vague notes like "Test the service methods" are not acceptable.

Good example:

"notes": "Test scenarios: (1) Exchange valid auth code → returns JWT with correct claims. (2) Exchange expired code → returns 401 with 'code_expired' error. (3) Exchange code with wrong redirect_uri → returns 400. (4) Verify nonce mismatch is rejected. Use existing test fixtures: db_session, client from conftest.py."

Bad example:

"notes": "Test the authentication service methods with proper mocking."

Fix any issues you find:

Problem	Fix
testSteps use grep/test only	Replace with curl, playwright
Backend story has only `npm test`	Add curl commands that hit real endpoints
Story depends on something not created	Reorder or add missing dependency
Auth story missing security criteria	Add password hashing, rate limiting to acceptanceCriteria
List endpoint missing pagination	Add pagination criteria to acceptanceCriteria
Frontend missing contextFiles	Add source file (idea or plan) + styleguide paths
Frontend missing testUrl	Add URL from config
Frontend missing mcp	Add `"mcp": ["playwright", "devtools"]`
Frontend notes missing Playwright MCP guidance	Add visual verification instructions to notes (see Playwright MCP section)
Story missing techStack	Add relevant subset of detected tech
Story missing constraints	Add applicable rules for this story
testSteps use import-checks ( `python -c "from X import Y"` )	Replace with curl, pytest, or real behavioral tests
No E2E story for user-facing feature	Add a final E2E story with Playwright tests
Story notes lack specific test scenarios	Add 3+ concrete scenarios with inputs, expected outputs, and fixture references
Frontend story missing responsive design	Add mobile/tablet/desktop acceptanceCriteria and viewport test steps

Step 7: Reorder if Needed

If validation found dependency issues, reorder stories:

Stories that create foundations (DB schemas, base components) come first
Stories that depend on others come after their dependencies
Update
```
dependsOn
```
arrays to reflect the order
Re-number story IDs if needed (TASK-001, TASK-002, etc.)

After reordering, re-run Step 6 validation to confirm the new order works.

Step 7b: Assign Batch Numbers

After reordering, assign a

batch

number to every story. Batches group independent stories that could theoretically run in parallel.

Stories with no
```
dependsOn
```
→ batch 1
Stories whose deps are ALL in batch N → batch N+1 (use the highest dep batch)
Within each batch level, check
```
files.create
```
and
```
files.modify
```
overlap — if two stories share any create/modify files, bump one to the next batch
Add
```
"batch": N
```
to every story

Example:

TASK-001: batch 1  (no deps)
TASK-002: batch 2  (depends on TASK-001, creates RegisterForm)
TASK-003: batch 2  (depends on TASK-001, modifies src/api/users.ts)

TASK-002 and TASK-003 are in the same batch because they don't depend on each other and don't share create/modify files.

Step 8: Present Final PRD

Open the PRD for review:

open -a TextEdit .ralph/prd.json

Say: "I've {created|updated} the PRD with {N} stories and opened it in TextEdit."

Then use AskUserQuestion with multiSelect: true:

Question: "How does the PRD look?"
Header: "PRD review"
multiSelect: true
Options:
- "Approved" — "PRD is good — ready to run with Ralph"
- "Edit" — "I'll tell you what to change"
- "I edited the JSON" — "I made changes directly in the file, re-validate it"

If the user selects "Edit" (with or without other selections), STOP and wait for their changes. If "I edited the JSON" is selected, re-read and re-validate the PRD. If only "Approved" is selected, proceed to Step 9.

Step 9: Final Instructions

Once approved, say:

"PRD is ready!

Source:

{source-file-path}

PRD:

.ralph/prd.json

({N} stories)

To start autonomous development, open another terminal and run:

npx agentic-loop run

Ralph will work through each story, running tests and committing as it goes."

DO NOT start implementing code.

Complete PRD JSON Schema

Full working example: See

templates/prd-example.json

for a complete, valid PRD.

{
  "feature": {
    "name": "Feature Name",
    "ideaFile": "docs/ideas/{feature-name}.md or docs/plans/{feature-name}.md",
    "branch": "feature/{feature-name}",
    "status": "pending"
  },

  "metadata": {
    "createdAt": "ISO timestamp",
    "estimatedStories": 5,
    "complexity": "low|medium|high"
  },

  "stories": [
    {
      "id": "TASK-001",
      "type": "frontend|backend",
      "title": "Short description",
      "priority": 1,
      "passes": false,

      "techStack": {
        "frontend": "{detected from package.json}",
        "backend": "{detected from pyproject.toml/go.mod}",
        "database": "{detected or asked}"
      },

      "constraints": [
        "Rules that apply to this story",
        "E.g. Use existing UI components from src/components/ui"
      ],

      "files": {
        "create": ["paths to new files"],
        "modify": ["paths to existing files"],
        "reuse": ["existing files to import from"]
      },

      "acceptanceCriteria": [
        "What it should do"
      ],

      "errorHandling": [
        "What happens when things fail"
      ],

      "testing": {
        "types": ["unit", "integration"],
        "approach": "TDD",
        "runner": "vitest|jest|pytest|go test",
        "files": {
          "unit": ["src/components/Dashboard.test.tsx"],
          "integration": ["tests/integration/dashboard.test.ts"],
          "e2e": ["tests/e2e/dashboard.spec.ts"]
        }
      },

      "testSteps": [
        "curl -s {config.urls.backend}/endpoint | jq -e '.expected == true'",
        "npx playwright test tests/e2e/feature.spec.ts"
      ],

      "testUrl": "{config.urls.frontend}/feature-page",

      "mcp": ["playwright", "devtools"],

      "contextFiles": [
        "docs/ideas/feature.md",
        "src/styles/styleguide.html"
      ],

      "skills": [
        {"name": "styleguide", "usage": "Reference for UI components"},
        {"name": "vibe-check", "usage": "Run after implementation"}
      ],

      "apiContract": {
        "endpoint": "GET /api/resource",
        "response": {"field": "type"}
      },

      "prerequisites": [
        "Backend server running",
        "Database seeded"
      ],

      "notes": "Human guidance - preferences, warnings, tips",

      "scale": "small|medium|large",

      "architecture": {
        "pattern": "React Query for data fetching",
        "constraints": ["No Redux"]
      },

      "dependsOn": [],
      "batch": 1
    }
  ]
}

Field Reference

PRD-Level Fields

Field	Required	Description
`feature`	Yes	Feature name, ideaFile (idea or plan path), branch, status
`metadata`	Yes	Created date, estimated stories, complexity

Note: URLs come from

.ralph/config.json

, not the PRD. Use

{config.urls.backend}

in testSteps.

Story-Level Fields

Each story is self-contained with all the context it needs. No global defaults — include only what's relevant to that story.

Field	Required	Description
`id`	Yes	Unique ID (TASK-001, TASK-002, etc.)
`type`	Yes	frontend or backend (keep stories atomic)
`title`	Yes	Short description
`priority`	No	Order of importance (1 = highest)
`passes`	Yes	Always starts as `false`
`techStack`	Yes	Technologies relevant to this story (auto-detect from project)
`constraints`	No	Rules/constraints specific to this story
`files`	Yes	create, modify, reuse arrays
`acceptanceCriteria`	Yes	What must be true when done
`errorHandling`	Yes	How to handle failures
`testing`	Yes	Test types, approach, runner, and files for this story
`testSteps`	Yes	Executable shell commands
`testUrl`	Frontend	URL to verify the feature
`mcp`	Frontend	MCP tools for verification
`contextFiles`	No	Files Claude should read (idea files, styleguides)
`skills`	No	Relevant skills with usage hints
`apiContract`	Backend	Expected request/response format
`testUsers`	Deprecated	Use `RALPH_TEST_USER` / `RALPH_TEST_PASSWORD` env vars in `.env` instead
`prerequisites`	No	What must be running/ready
`notes`	No	Human guidance for Claude
`scale`	No	small, medium, large
`architecture`	No	Story-specific patterns/constraints
`dependsOn`	No	Story IDs that must complete first
`batch`	Yes	Batch number for parallel grouping (1 = no deps, higher = later)

Testing Strategy

Story-Level Testing Config

Each story defines its own testing configuration. Auto-detect tools from project config files and include the relevant runner in each story:

Detection hints:

Check
```
package.json
```
for
```
vitest
```
,
```
jest
```
,
```
playwright
```
,
```
cypress
```
Check
```
pyproject.toml
```
for
```
pytest
```
Check
```
go.mod
```
for Go projects (use
```
go test
```
)

Note: Coverage thresholds (

minimum

enforced

) belong in

.ralph/config.json

, not in the PRD.

"testing": {
  "types": ["unit", "integration"],
  "approach": "TDD",
  "runner": "vitest",
  "files": {
    "unit": ["src/components/Dashboard.test.tsx"],
    "integration": ["tests/integration/dashboard.test.ts"],
    "e2e": ["tests/e2e/dashboard.spec.ts"]
  }
}

Field	Description
`types`	Required test types: `unit` , `integration` , `e2e`
`approach`	TDD or test-after for this story
`runner`	Test runner command (vitest, jest, pytest, go test — detected from project)
`files.unit`	Unit test files to create
`files.integration`	Integration test files to create
`files.e2e`	E2E test files to create

Test Types

Type	What it Tests	When to Use
Unit	Individual functions, components in isolation	Always - every new file needs unit tests
Integration	How pieces work together (API + DB, Component + Hook)	When story involves multiple modules
E2E	Full user flows in browser	User-facing features with interactions

TDD Workflow

When

approach: "TDD"

Write failing test first - Define expected behavior
Implement minimum code - Make the test pass
Refactor - Clean up while tests stay green
Repeat - Next acceptance criterion

Example for a Dashboard component:

1. Write test: "renders user name in header"
2. Run test → FAIL (component doesn't exist)
3. Create Dashboard.tsx with user name
4. Run test → PASS
5. Write test: "shows loading state"
6. Run test → FAIL
7. Add loading state
8. Run test → PASS

Testing Anti-Patterns (AVOID THESE)

Missing integration points:

// ❌ BAD - creates function but doesn't verify callers use it
{
  "files": {"modify": ["graph.py"]},
  "acceptanceCriteria": ["Create stream_agent function"]
}

// ✅ GOOD - verifies the full chain
{
  "files": {"modify": ["graph.py", "service.py"]},
  "acceptanceCriteria": [
    "service.py calls stream_agent() (not run_agent)",
    "POST /chat returns progress SSE events"
  ]
}

(See "The Grep for Code Trap" section above for the #1 anti-pattern)

Removing/Modifying UI - Update Tests!

CRITICAL: When a story removes or modifies UI elements, it MUST update related tests.

Stories that remove UI must include:

{
  "files": {
    "modify": ["src/components/Dashboard.tsx"],
    "delete": ["src/components/SelectionPanel.tsx"]
  },
  "acceptanceCriteria": [
    "Selection panel removed from dashboard",
    "All tests referencing 'Auto-select' button updated or removed"
  ],
  "testSteps": [
    "grep -r 'Auto-select' tests/ && exit 1 || echo 'No stale test references'",
    "npx playwright test tests/e2e/dashboard.spec.ts"
  ]
}

The

grep ... && exit 1

pattern ensures the story fails if stale test references exist.

Acceptance Criteria Rules

Behavior over implementation - Describe what the user/API sees, not what code exists
Verifiable - Each criterion must be testable with a curl, pytest, or playwright
Include callers - If adding a new function, verify callers use it
Update tests - If removing UI, verify no tests reference removed elements

❌ "Use astream_events() for progress"
✅ "POST /chat streams progress events before final response"

❌ "Create stream_agent function"
✅ "service.py send_message_stream() calls stream_agent()"

Integration Test Requirements

Backend stories that modify internal functions MUST have integration tests that verify the API behavior:

# ✅ GOOD - tests actual API behavior
async def test_send_message_streams_progress_events():
    """Verify the API actually streams progress events."""
    async with client.stream("POST", f"/chat/{conv_id}/messages",
                             json={"content": "test"}) as response:
        events = [e async for e in parse_sse(response)]
        progress_events = [e for e in events if e["event_type"] == "progress"]
        assert len(progress_events) > 0, "No progress events streamed"

Example Stories by Type

Frontend story:

"testing": {
  "types": ["unit", "e2e"],
  "approach": "TDD",
  "files": {
    "unit": ["src/components/Dashboard.test.tsx"],
    "e2e": ["tests/e2e/dashboard.spec.ts"]
  }
}

Backend API story:

"testing": {
  "types": ["unit", "integration"],
  "approach": "TDD",
  "files": {
    "unit": ["tests/unit/test_stream_agent.py"],
    "integration": ["tests/integration/test_chat_streaming.py"]
  }
},
"acceptanceCriteria": [
  "service.py calls stream_agent() instead of run_agent()",
  "POST /chat/messages returns SSE stream with progress events",
  "Progress events include tool name and status"
],
"testSteps": [
  "pytest tests/integration/test_chat_streaming.py -v",
  "curl -N {config.urls.backend}/chat/1/messages -d '{\"content\":\"test\"}' | grep -q 'progress'"
]

MCP Tools

Specify which MCP tools Claude should use for verification:

Tool	When to Use
`playwright`	UI testing, screenshots, form interactions, a11y
`devtools`	Console errors, network inspection, DOM debugging
`postgres`	Database verification (future)

Frontend stories MUST have

"mcp": ["playwright", "devtools"]

. Backend-only stories can use

[]

or omit.

Playwright MCP for Visual Verification

Frontend stories should include guidance in

notes

for using Playwright MCP during implementation. This is how Ralph visually verifies that UI changes actually render correctly — screenshots catch layout bugs, missing elements, and broken styles that unit tests miss.

Every frontend story's

notes

should include Playwright MCP instructions like:

Use Playwright MCP to verify:
1. Navigate to {testUrl} and take a screenshot
2. Verify [specific element] is visible and correctly styled
3. Click [interactive element] and verify [expected behavior]
4. Check browser console for errors after interactions

Example for a login page SSO button story:

"notes": "Use Playwright MCP to verify: navigate to /login, screenshot the page, confirm 'Sign in with Okta' button is visible below the email/password form with a divider. Click the button and verify it redirects to /api/v1/auth/okta/authorize. Check devtools console for errors."

This is NOT a replacement for automated Playwright tests — it's additional visual verification that Ralph performs during the implementation step using the MCP browser tools.

Skills Reference

Point Claude to relevant skills for guidance:

Skill	When to Use
`styleguide`	Frontend stories - reference UI components
`vibe-check`	Any story - check for AI anti-patterns after
`review`	Security-sensitive stories - OWASP checks
`explain`	Complex logic - document decisions

Example:

"skills": [
  {"name": "styleguide", "usage": "Use existing Card, Button components"},
  {"name": "vibe-check", "usage": "Run after implementation to catch issues"}
]

Test Steps - CRITICAL

⚠️ THE #1 CAUSE OF FALSE PASSES: grep-only test steps that verify code exists but not behavior.

Test steps MUST be executable shell commands. Ralph runs them with bash.

The "Grep for Code" Trap - NEVER DO THIS

// ❌ BAD - This will PASS even when the feature is completely broken!
"testSteps": [
  "grep -q 'astream_events' app/domains/chat/agent/graph.py",
  "grep -q 'export function' src/api/users.ts"
]

// ✅ GOOD - This actually tests if the feature works
"testSteps": [
  "curl -N {config.urls.backend}/chat -d '{\"message\":\"test\"}' | grep -q 'progress'",
  "curl -s {config.urls.backend}/users | jq -e '.data | length >= 0'"
]

Why is grep bad? Ralph runs

grep -q 'function' file.py

→ returns 0 → marks story as PASSED. But the function could be completely broken, have wrong parameters, or never get called. The test passed but the feature doesn't work.

Backend Stories MUST Have Curl Tests

CRITICAL: Every backend story MUST include curl commands that verify actual API behavior.

Use

{config.urls.backend}

- Ralph expands this from

.ralph/config.json

// ✅ REQUIRED for backend stories
"testSteps": [
  "curl -s {config.urls.backend}/users | jq -e '.data | length > 0'",
  "curl -s -X POST {config.urls.backend}/users -d '{\"email\":\"test@test.com\"}' | jq -e '.id'",
  "curl -N {config.urls.backend}/chat/1/messages -d '{\"content\":\"test\"}' | grep -q 'progress'"
]

Ralph reads

.ralph/config.json

and expands

{config.urls.backend}

before running.

Why? Grep tests verify code exists. Curl tests verify the feature works. (See "The Grep for Code Trap" above.)

Test Steps by Story Type

Story Type Required testSteps

backend

curl commands using

{config.urls.backend}

to verify API behavior

frontend

tsc --noEmit

(type errors) +

npm test

(unit) + playwright (e2e)

e2e

playwright test commands

Frontend stories MUST include TypeScript check - curl won't catch type errors:

// ✅ Frontend story testSteps
"testSteps": [
  "npx tsc --noEmit",
  "npm test -- --testPathPattern=Dashboard",
  "npx playwright test tests/e2e/dashboard.spec.ts"
]

Good Test Steps (executable)

// Backend story - use {config.urls.backend}
"testSteps": [
  "curl -s {config.urls.backend}/health | jq -e '.status == \"ok\"'",
  "curl -s -X POST {config.urls.backend}/users -H 'Content-Type: application/json' -d '{\"email\":\"test@example.com\"}' | jq -e '.id'",
  "pytest tests/integration/test_users.py -v"
]

// Frontend story
"testSteps": [
  "npm test -- --testPathPattern=Button.test.tsx",
  "npx tsc --noEmit"
]

// E2E story
"testSteps": [
  "npx playwright test tests/e2e/user-signup.spec.ts"
]

Bad Test Steps (will PASS but miss bugs)

"testSteps": [
  "grep -q 'function createUser' app/services/user.py",  // ❌ PASSES if code exists, even if broken
  "grep -q 'export default' src/components/Dashboard.tsx", // ❌ PASSES even if component crashes
  "test -f src/api/users.ts",                            // ❌ PASSES if file exists, even if empty
  "python -c \"from app.services.auth import AuthService\"", // ❌ PASSES if import works, says nothing about behavior
  "python -c \"hasattr(AuthService, 'login')\"",          // ❌ PASSES if method exists, even if completely broken
  "Visit http://localhost:3000/dashboard",                // ❌ Not executable
  "User can see the dashboard"                            // ❌ Not executable
]

NEVER use import-checks (

python -c "from X import Y"

or
hasattr
) as test steps. These only verify a symbol exists — they don't test behavior, error handling, or integration. A function that raises on every call still passes an import check.

NEVER use grep/test to verify behavior. These will mark stories as PASSED when the feature is broken.

If a step can't be automated, put it in

acceptanceCriteria

instead. Claude will verify it visually using MCP tools.

Context Files

Use

contextFiles

to point Claude to important reference material:

"contextFiles": [
  "docs/ideas/dashboard.md",
  "docs/plans/auth-feature.md",
  "src/styles/styleguide.html",
  "docs/api-spec.md"
]

This is where ASCII mockups, design specs, and detailed requirements live. Claude reads these during the Orient step.

Guidelines

Keep stories small - Max 3-4 acceptance criteria (~1000 tokens)
Order by dependency - Foundation stories first
Specify files explicitly - Max 3-4 files per story
Define error handling - Every story specifies failure behavior
Include contextFiles - Point to idea files with full context (ASCII art, mockups)
Add relevant skills - Help Claude find the right patterns

UI Stories Must Include

```
testUrl
```
- Where to verify
```
mcp: ["playwright", "devtools"]
```
- Browser tools
Acceptance criteria for: page loads, elements render correctly
Responsive design criteria: layout adapts at mobile (< 768px), tablet (768-1024px), desktop (> 1024px) breakpoints
Playwright test or MCP verification at multiple viewport widths

API Stories Must Include

```
apiContract
```
- Expected request/response
```
errorHandling
```
- What happens on 400, 401, 500, etc.
```
testSteps
```
with curl commands to verify endpoints