Babysitter benchmark-suite-manager

Manage benchmarks for algorithm engineering experiments and evaluations

install

source · Clone the upstream repo

git clone https://github.com/a5c-ai/babysitter

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/domains/science/computer-science/skills/benchmark-suite-manager" ~/.claude/skills/a5c-ai-babysitter-benchmark-suite-manager && rm -rf "$T"

manifest: library/specializations/domains/science/computer-science/skills/benchmark-suite-manager/SKILL.md

Benchmark Suite Manager

Purpose

Provides expert guidance on managing benchmark suites for algorithm engineering and experimental evaluation.

Capabilities

Standard benchmark suite access (DIMACS, TSPLIB, etc.)
Instance generation for specific problem classes
Statistical analysis of results
Performance comparison tables
Visualization of scaling behavior
Reproducibility support

Usage Guidelines

Suite Selection: Choose appropriate benchmark suite
Instance Selection: Select representative instances
Execution: Run experiments systematically
Analysis: Perform statistical analysis
Reporting: Generate comparison tables and plots

Tools/Libraries

DIMACS
TSPLIB
SuiteSparse Matrix Collection
Statistical tools