C.R.I.S.P phase5-prove

The Mileva Method (CRISP) — Phase 5: Prove. Success validation against Phase R baseline metrics. Use after deployment. Triggers on "prove", "phase 5", "validate", "did it work", "success criteria", "measure results", or after go-live.

install
source · Clone the upstream repo
git clone https://github.com/radekamirko/C.R.I.S.P
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/radekamirko/C.R.I.S.P "$T" && mkdir -p ~/.claude/skills && cp -r "$T/.claude/skills/phase5-prove" ~/.claude/skills/radekamirko-c-r-i-s-p-phase5-prove && rm -rf "$T"
manifest: .claude/skills/phase5-prove/SKILL.md
source content

P — Prove: Success Validation

Did the needle move?

Project State

At the start of Phase P: read

docs/crisp-state.json
. Pull
phases.R.baseline
and
phases.R.successTarget
as the measurement baseline. Check
phases.S.sprints
to know what was built.

At the end of Phase P: update

docs/crisp-state.json
:

  • Add
    "P"
    to
    phases.complete
  • Set
    phases.P.complete
    to
    true
  • Set
    phases.P.outcome
    to
    "success"
    ,
    "partial"
    , or
    "fail"
  • Add summary notes to
    phases.P.notes

This phase closes the loop back to Phase R. One question drives everything.


What to Measure

Pull the baseline and targets from

docs/success-metrics.md
and measure against them:

  • Quantitative: Did each metric hit its target? (e.g. "quotes sent in 4min vs 4h before")
  • Qualitative: Re-run the survey or interview from the Phase R baseline frustration score — did it improve?
  • Second-order effects: Did the downstream impacts materialise? (e.g. cart abandonment dropped after checkout speed improved)

Outcomes

Success → document what worked, close the project, capture learnings.

Partial → identify which metrics hit and which didn't. Was the target unrealistic, or did the solution miss?

Fail → return to Phase R. The problem may have been misdiagnosed, or the success criteria were wrong.

In Serbia, when things go sideways, we make jokes. It's not denial — it's perspective. Balkan humor is the ability to look at a disaster and find the absurdity before you find the culprit. Apply that here. If Phase P says fail, laugh once — then go back to Phase R with fresh eyes. A failed outcome is the most honest piece of data you'll collect in the whole project. The problem was misdiagnosed. Now you know exactly where. That's worth something.


Exit Checklist

  • Quantitative metrics measured against baselines in
    docs/success-metrics.md
  • Qualitative scores re-evaluated against Phase R frustration baseline
  • Second-order effects checked
  • Success / partial / fail called explicitly — no ambiguity
  • Learnings documented
  • Client signed off