Vibe-Skills preprocessing-data-with-automated-pipelines
install
source · Clone the upstream repo
git clone https://github.com/foryourhealth111-pixel/Vibe-Skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/foryourhealth111-pixel/Vibe-Skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/bundled/skills/preprocessing-data-with-automated-pipelines" ~/.claude/skills/foryourhealth111-pixel-vibe-skills-preprocessing-data-with-automated-pipelines && rm -rf "$T"
manifest:
bundled/skills/preprocessing-data-with-automated-pipelines/SKILL.mdsource content
Data Preprocessing Pipeline
Positioning
In governed ML routing, treat this skill as a stage assistant. It is for preprocessing-heavy execution after the pack owner is chosen.
When to Use
Use this skill when:
- Prepare raw data for machine learning models.
- Automate data cleaning and transformation processes.
- Implement a robust ETL (Extract, Transform, Load) pipeline.
Not For / Boundaries
- Whole-task ML ownership: use
,scikit-learn
, orml-pipeline-workflowtraining-machine-learning-models - Leakage and prediction-time auditing: use
ml-data-leakage-guard - Grouped scientific preprocessing with stronger methodological constraints: use
scientific-data-preprocessing
Typical Outputs
- A preprocessing pipeline plan or implementation sketch
- Clear sequencing for clean, encode, transform, and validate steps
- Handoff notes for leakage review, training, or evaluation
Related Skills
before trusting fitted preprocessing stepsml-data-leakage-guard
when the next narrow problem is partition strategysplitting-datasets