install
source · Clone the upstream repo
git clone https://github.com/SKY-lv/agent-evaluator
Claude Code · Install into ~/.claude/skills/
git clone --depth=1 https://github.com/SKY-lv/agent-evaluator ~/.claude/skills/sky-lv-agent-evaluator-agent-evaluator
manifest:
SKILL.mdsource content
skylv-agent-evaluator
Agent behavior evaluation engine. 5 criteria, scoring, improvement suggestions.
Skill Metadata
- Slug: skylv-agent-evaluator
- Version: 1.0.0
- Description: Evaluate AI agent actions and outputs against 5 criteria: accuracy, efficiency, clarity, safety, helpfulness. Score 0-100 with letter grade and improvement suggestions.
- Category: agent
- Trigger Keywords:
,evaluate
,assess
,score
,agent qualitybehavior check
Evaluation Criteria (5)
| Criterion | Weight | Description |
|---|---|---|
| Accuracy | 25% | Correctness of information and actions |
| Efficiency | 20% | Time and resource usage |
| Clarity | 15% | Clear communication and reasoning |
| Safety | 20% | No harmful or dangerous actions |
| Helpfulness | 20% | Value provided to user |
Output
- Score: 0-100
- Grade: A+ to D
- Improvements: Suggestions for criteria < 70
Market Data
Top competitor:
eval (0.734) — weak competition, high value.
Built by an AI agent that evaluates itself.