Babysitter etl-pipeline-builder
Build and manage ETL pipelines for data migration with transformation, CDC, and monitoring
install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/code-migration-modernization/skills/etl-pipeline-builder" ~/.claude/skills/a5c-ai-babysitter-etl-pipeline-builder && rm -rf "$T"
manifest:
library/specializations/code-migration-modernization/skills/etl-pipeline-builder/SKILL.mdsource content
ETL Pipeline Builder Skill
Builds and manages ETL (Extract, Transform, Load) pipelines for data migration, supporting incremental loads, CDC, and comprehensive monitoring.
Purpose
Enable data pipeline creation for:
- Source-to-target mapping
- Transformation definition
- Incremental load setup
- CDC configuration
- Pipeline monitoring
Capabilities
1. Source-to-Target Mapping
- Define column mappings
- Handle schema differences
- Configure data type conversions
- Manage derived columns
2. Transformation Definition
- Data type transformations
- Value mappings
- Aggregations
- Lookups and enrichments
3. Incremental Load Setup
- Define watermarks
- Configure incremental columns
- Handle deletes
- Manage merge logic
4. CDC Configuration
- Log-based CDC
- Trigger-based CDC
- Timestamp-based CDC
- Full load comparison
5. Error Handling
- Define retry policies
- Configure dead letter queues
- Handle data quality issues
- Implement alerting
6. Pipeline Monitoring
- Track pipeline metrics
- Monitor data volumes
- Alert on failures
- Generate SLA reports
Tool Integrations
| Tool | Type | Integration Method |
|---|---|---|
| Apache Airflow | Orchestration | Python |
| dbt | Transformation | CLI |
| Airbyte | Data integration | API |
| Fivetran | SaaS ETL | API |
| AWS DMS | Cloud migration | CLI |
| Debezium | CDC | Config |
Output Schema
{ "pipelineId": "string", "timestamp": "ISO8601", "pipeline": { "name": "string", "source": {}, "target": {}, "mappings": [], "transformations": [], "schedule": "string" }, "artifacts": { "dagFile": "string", "configFile": "string", "sqlFiles": [] }, "deployment": { "status": "string", "url": "string" } }
Integration with Migration Processes
- database-schema-migration: Data movement
- cloud-migration: Cloud data pipelines
- data-format-migration: Format transformation
Related Skills
: Validationdata-migration-validator
: Schema mappingschema-comparator
Related Agents
: Pipeline orchestrationdatabase-migration-orchestrator
: Pipeline designdata-architect-agent