Agent-skills score-eval

Score the eval diff at $ARGUMENTS against the eval rubric.

install
source · Clone the upstream repo
git clone https://github.com/neondatabase/agent-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/neondatabase/agent-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/evals/neon-postgres-egress-optimizer/.claude/skills/score-eval" ~/.claude/skills/neondatabase-agent-skills-score-eval && rm -rf "$T"
manifest: evals/neon-postgres-egress-optimizer/.claude/skills/score-eval/SKILL.md
source content

Score the eval diff at $ARGUMENTS against the eval rubric.

  1. Read the diff file at the path provided
  2. Read the eval rubric at eval-rubric.md
  3. Read the original fixture app in fixtures/hono-drizzle-app/ for comparison
  4. For each problem P1-P5, answer the Detected? and Fixed? questions from the rubric as yes or no
  5. Append a row to results.csv — fill in all fields you can determine from the diff and context. Leave fields you can't determine empty.