Babysitter benchmark-suite-manager
Manage and execute mathematical benchmark suites
install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/domains/science/mathematics/skills/benchmark-suite-manager" ~/.claude/skills/a5c-ai-babysitter-benchmark-suite-manager-3b5f65 && rm -rf "$T"
manifest:
library/specializations/domains/science/mathematics/skills/benchmark-suite-manager/SKILL.mdsource content
Benchmark Suite Manager
Purpose
Provides management and execution capabilities for mathematical benchmark suites for algorithm validation.
Capabilities
- Standard benchmark access (Matrix Market, NIST, etc.)
- Custom benchmark generation
- Performance profiling
- Accuracy validation
- Comparison against reference solutions
- Statistical analysis of results
Usage Guidelines
- Benchmark Selection: Choose appropriate standard benchmarks
- Custom Generation: Create problem-specific benchmarks
- Validation: Compare against known solutions
- Statistical Analysis: Properly analyze performance data
Tools/Libraries
- Matrix Market
- NIST Digital Library
- SuiteSparse Matrix Collection