Agent-skills score-eval

Name: score-eval
Author: neondatabase

Score the eval diff at $ARGUMENTS against the eval rubric.

install

source · Clone the upstream repo

git clone https://github.com/neondatabase/agent-skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/neondatabase/agent-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/evals/neon-postgres-egress-optimizer/.claude/skills/score-eval" ~/.claude/skills/neondatabase-agent-skills-score-eval && rm -rf "$T"

manifest: evals/neon-postgres-egress-optimizer/.claude/skills/score-eval/SKILL.md

source content

Score the eval diff at $ARGUMENTS against the eval rubric.

Read the diff file at the path provided
Read the eval rubric at eval-rubric.md
Read the original fixture app in fixtures/hono-drizzle-app/ for comparison
For each problem P1-P5, answer the Detected? and Fixed? questions from the rubric as yes or no
Append a row to results.csv — fill in all fields you can determine from the diff and context. Leave fields you can't determine empty.