Babysitter benchmark-suite-manager

Manage and execute mathematical benchmark suites

install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/domains/science/mathematics/skills/benchmark-suite-manager" ~/.claude/skills/a5c-ai-babysitter-benchmark-suite-manager-3b5f65 && rm -rf "$T"
manifest: library/specializations/domains/science/mathematics/skills/benchmark-suite-manager/SKILL.md
source content

Benchmark Suite Manager

Purpose

Provides management and execution capabilities for mathematical benchmark suites for algorithm validation.

Capabilities

  • Standard benchmark access (Matrix Market, NIST, etc.)
  • Custom benchmark generation
  • Performance profiling
  • Accuracy validation
  • Comparison against reference solutions
  • Statistical analysis of results

Usage Guidelines

  1. Benchmark Selection: Choose appropriate standard benchmarks
  2. Custom Generation: Create problem-specific benchmarks
  3. Validation: Compare against known solutions
  4. Statistical Analysis: Properly analyze performance data

Tools/Libraries

  • Matrix Market
  • NIST Digital Library
  • SuiteSparse Matrix Collection