Meta-Skill-Engineering skill-improver
git clone https://github.com/merceralex397-collab/Meta-Skill-Engineering
T=$(mktemp -d) && git clone --depth=1 https://github.com/merceralex397-collab/Meta-Skill-Engineering "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skill-improver" ~/.claude/skills/merceralex397-collab-meta-skill-engineering-skill-improver-40fd3f && rm -rf "$T"
skill-improver/SKILL.mdPurpose
Improve an existing skill package. A skill is a reusable operating manual for an agent. Improving one means improving four things together:
- Routing — whether the right tasks activate it.
- Execution — whether the body gives concrete, repeatable procedure.
- Support layers — whether references, scripts, evals, and manifests exist where they help.
- Maintainability — whether the skill stays narrow and worth activating.
Preserve the skill's core purpose unless the user explicitly asks to reposition it.
When to use
Use when:
- the user provides a SKILL.md and wants it improved,
- the user says a skill feels weak, vague, generic, bloated, or under-specified,
- the user wants better triggering, structure, examples, or supporting files,
- the user wants to know whether a skill needs references/, scripts/, evals/, or manifest,
- the user wants a thin prompt upgraded into a durable skill package.
When NOT to use
Do not use when:
- creating a brand-new skill from scratch — use skill-creator,
- the problem is only the description/trigger and the body is fine — use skill-trigger-optimization,
- porting or adapting a skill to a different stack or context — use skill-adaptation,
- running a quick structural audit with no rewrite planned — use skill-anti-patterns,
- the task is a repo review, architecture review, or product planning exercise.
Improvement modes
Choose the lightest mode that solves the real problem.
Mode selection guide
Choose Mode 1 (Surgical edit) when:
- Specific failure report with reproduction steps
- Single section identified as the problem
- Skill works correctly for most cases
- Fix is isolated (changing one step, tightening one description, adding one failure case)
Choose Mode 2 (Structural refactor) when:
- Multiple anti-patterns detected (3+ from skill-anti-patterns scan)
- Section ordering contradicts logical flow
- Content is scattered across wrong sections (procedure content in Purpose, routing content in Procedure)
- Skill grew organically and needs reorganization
Choose Mode 3 (Package upgrade) when:
- Skill needs new resources (references, scripts, evals) that don't exist yet
- Skill exists as SKILL.md only but complexity justifies support files
- Existing support files are outdated or broken
When in doubt: Start with Mode 1. If Mode 1 changes touch >30% of the file, switch to Mode 2.
Mode 1 — Surgical edit
Use when the skill is broadly sound and mainly needs better trigger wording, clearer steps, stronger output contract, or removal of fluff.
Output: updated SKILL.md, concise change summary, optional eval refresh.
Change summary template:
**Skill**: [name] **Failure mode**: [category from Phase 2 table] **Evidence**: [what went wrong] ### Change **Section**: [which section changed] **Before**: [original text] **After**: [new text] **Rationale**: [why this fixes the failure mode] ### Verification - [ ] Existing evals still pass - [ ] Known good cases still work - [ ] Failure case now handled
Mode 2 — Structural refactor
Use when the skill has the right goal but poor execution: vague description, no phases, no decision rules, no failure handling, too much in one file.
Output: rewritten SKILL.md, new references/ if justified, eval stubs or updated evals.
Mode 3 — Package upgrade
Use when the skill should become a first-class reusable package: shared across projects, needs baseline comparisons, needs scripts for repeated mechanics.
Output: improved SKILL.md, manifest, evals, references, scripts only where deterministic, changelog.
Workflow
Follow these phases in order unless the user clearly wants a lighter pass.
Phase 1 — Understand the current skill
Identify:
- the skill's current purpose and intended user problem,
- the current activation boundary,
- the main failure mode,
- whether the skill is draft, project-local, or intended for reuse.
Read the existing skill package before editing. If the conversation provides enough context, extract it instead of re-asking.
Phase 2 — Diagnose the weakness
Name the primary failure mode before rewriting. Map it to a fix target:
| Failure mode | Primary fix target |
|---|---|
| undertriggering | rewrite description with concrete trigger phrases |
| overtriggering | tighten description, add "do not use" boundaries |
| prompt-blob syndrome | replace prose with concrete procedural steps |
| missing branching | add decision rules for common variants |
| wrong output format | fix output contract or output defaults |
| missing edge case | add case to workflow or operating procedure |
| scope creep | add constraints, narrow boundaries |
| bloat | remove unnecessary steps or content |
| resource mismatch | add or remove references/scripts as evidence warrants |
| no proof | add eval stubs or test prompts |
| package rot | add manifest/changelog/ownership metadata |
Score informally against: routing quality, procedural clarity, decision support, support-layer quality, evaluation readiness, maintainability. See
references/skill-quality-rubric.md for the detailed rubric.
Phase 3 — Decide what to improve
Improve in this priority order:
- Clarify purpose.
- Tighten routing description.
- Add or improve workflow phases.
- Add decision rules for common branches.
- Add output contract and failure handling.
- Decide whether references, scripts, evals, or manifest are warranted.
For support-layer decisions, see
references/resource-decision-guide.md.
For skill-type-specific improvement strategies, see
references/skill-type-playbooks.md.
Phase 4 — Rewrite with restraint
Prefer concrete decisions over generic advice. Explain why where that helps the agent adapt. Do not inflate the skill with policy text that belongs elsewhere.
If the skill spans multiple use cases, add explicit phases, a decision table, or separate mode sections — not one flat list.
Phase 5 — Add support layers only if justified
Add references when optional depth would clutter SKILL.md or the skill covers multiple variants. When extracting large reference blocks from a bloated SKILL.md, follow the classification heuristic and procedure in
references/extraction-guide.md.
Add scripts when a deterministic task recurs and a script reduces context waste.
Add evals when routing quality matters or the user wants evidence the revision helped.
Add manifest/packaging when the skill is meant to persist or be shared.
Do not add scripts or references merely to make the package feel complete.
Phase 6 — Self-review
Before presenting the result, verify:
- Description clearly states what the skill does and when it triggers.
- "Do not use" boundaries exist where confusion with adjacent skills is likely.
- Workflow is ordered and branch-aware.
- At least one stop condition, quality gate, or failure path exists.
- Support files are justified rather than ornamental.
- Original purpose is preserved.
If any check fails, revise.
Output contract
Every improvement produces:
- Updated skill files — the modified SKILL.md and any new/changed support files
- Change summary — for each change: section affected, before text, after text, rationale
- Eval prompts — 2–5 recommended test prompts to verify the improvement
- Next steps — specific follow-up actions (e.g., "run skill-evaluation to verify routing")
## Improvement Report: [skill-name] ### Mode: [Surgical edit | Structural refactor | Package upgrade] ### Primary failure mode: [from Phase 2 table] ### Changes | Section | Before | After | Rationale | |---------|--------|-------|-----------| | ... | ... | ... | ... | ### Files modified - [list of files changed/added/removed] ### Recommended eval prompts 1. [prompt that should trigger the skill] 2. [prompt that should NOT trigger] 3. [edge case prompt] ### Phase 7 — Package the result Return: - the improved files, - a short rationale for each major change, - 2–5 recommended eval prompts for the improved skill, - risks or open questions, only if something could not be settled cleanly. ### Next steps - Run `skill-evaluation` to verify routing accuracy - Run `skill-testing-harness` if no evals/ directory exists
Anti-patterns
Avoid these when improving a skill:
- Placeholder upgrades — replacing one vague paragraph with a longer vague paragraph.
- Monolith inflation — stuffing every idea into SKILL.md instead of using references/.
- Tool cosplay — adding scripts that don't solve a repeated mechanical problem.
- Silent repurposing — changing what the skill is for without telling the user.
- Unverifiable improvement — claiming the skill is better with no eval prompts or review path.
For a full structural anti-pattern catalog, use skill-anti-patterns.
Failure handling
If the skill is too incomplete to improve cleanly:
- Name what is missing.
- Preserve what can be salvaged.
- Produce the lightest viable improved draft.
- Mark assumptions explicitly.
If the user wants a quick pass rather than a full package upgrade, do that. The point is to improve the skill, not force ceremony.
Next steps
After improving a skill:
- Verify the improvement →
skill-evaluation - Build test infrastructure if none exists →
skill-testing-harness - Optimize triggers if routing changed →
skill-trigger-optimization