Babysitter Feature Engineering Optimizer
Optimizes feature engineering pipelines and feature store configurations
install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/data-engineering-analytics/skills/feature-engineering-optimizer" ~/.claude/skills/a5c-ai-babysitter-feature-engineering-optimizer && rm -rf "$T"
manifest:
library/specializations/data-engineering-analytics/skills/feature-engineering-optimizer/SKILL.mdsource content
Feature Engineering Optimizer
Overview
Optimizes feature engineering pipelines and feature store configurations. This skill improves ML feature quality, performance, and serving efficiency.
Capabilities
- Feature importance analysis
- Feature correlation detection
- Encoding strategy recommendations
- Feature freshness optimization
- Online/offline feature sync
- Feature versioning
- Point-in-time correctness validation
- Feature serving optimization
Input Schema
{ "features": [{ "name": "string", "definition": "string", "type": "string" }], "targetVariable": "string", "useCases": ["batch|realtime|streaming"], "performanceRequirements": "object" }
Output Schema
{ "optimizedFeatures": ["object"], "removedFeatures": ["string"], "engineeringRecommendations": ["object"], "servingConfig": "object" }
Target Processes
- Feature Store Setup
- A/B Testing Pipeline
Usage Guidelines
- Provide complete feature definitions
- Specify target variable for importance analysis
- Define use cases (batch, realtime, streaming)
- Include performance requirements for serving optimization
Best Practices
- Validate point-in-time correctness for training features
- Remove highly correlated features to reduce redundancy
- Optimize feature freshness based on actual requirements
- Version features alongside model versions
- Monitor feature drift in production