Babysitter benchmark-suite-manager

Manage and execute mathematical benchmark suites

install

source · Clone the upstream repo

git clone https://github.com/a5c-ai/babysitter

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/domains/science/mathematics/skills/benchmark-suite-manager" ~/.claude/skills/a5c-ai-babysitter-benchmark-suite-manager-3b5f65 && rm -rf "$T"

manifest: library/specializations/domains/science/mathematics/skills/benchmark-suite-manager/SKILL.md

Benchmark Suite Manager

Purpose

Provides management and execution capabilities for mathematical benchmark suites for algorithm validation.

Capabilities

Standard benchmark access (Matrix Market, NIST, etc.)
Custom benchmark generation
Performance profiling
Accuracy validation
Comparison against reference solutions
Statistical analysis of results

Usage Guidelines

Benchmark Selection: Choose appropriate standard benchmarks
Custom Generation: Create problem-specific benchmarks
Validation: Compare against known solutions
Statistical Analysis: Properly analyze performance data

Tools/Libraries

Matrix Market
NIST Digital Library
SuiteSparse Matrix Collection