Babysitter Cost Optimizer (Cloud Data Platforms)
Analyzes and optimizes costs for cloud data platforms
install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/data-engineering-analytics/skills/cost-optimizer" ~/.claude/skills/a5c-ai-babysitter-cost-optimizer-cloud-data-platforms && rm -rf "$T"
manifest:
library/specializations/data-engineering-analytics/skills/cost-optimizer/SKILL.mdsource content
Cost Optimizer (Cloud Data Platforms)
Overview
Analyzes and optimizes costs for cloud data platforms. This skill provides deep expertise in platform-specific cost structures and optimization strategies.
Capabilities
- Snowflake credit analysis and optimization
- BigQuery slot and on-demand optimization
- Redshift node sizing
- Storage cost optimization
- Query cost estimation
- Warehouse scheduling recommendations
- Data lifecycle policy recommendations
- Reserved capacity planning
Input Schema
{ "platform": "snowflake|bigquery|redshift|databricks", "usageMetrics": "object", "billingData": "object", "queryHistory": "object" }
Output Schema
{ "currentCost": "number", "optimizedCost": "number", "savings": "percentage", "recommendations": [{ "category": "string", "action": "string", "impact": "number", "effort": "low|medium|high" }] }
Target Processes
- Data Warehouse Setup
- Query Optimization
- Pipeline Migration
Usage Guidelines
- Provide platform-specific usage metrics
- Include billing data for cost baseline
- Share query history for optimization analysis
- Prioritize recommendations by impact and effort
Best Practices
- Regularly review and optimize warehouse sizes
- Implement auto-suspend and auto-resume policies
- Use clustering and partitioning to reduce scan costs
- Consider reserved capacity for predictable workloads
- Monitor and alert on cost anomalies