Claude-skill-registry discover-data
Automatically discover data pipeline and ETL skills when working with ETL. Activates for data development tasks.
install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/cc-polymath-discover-data" ~/.claude/skills/majiayu000-claude-skill-registry-discover-data && rm -rf "$T"
manifest:
skills/data/cc-polymath-discover-data/SKILL.mdsource content
Data Skills Discovery
Provides automatic access to comprehensive data skills.
When This Skill Activates
This skill auto-activates when you're working with:
- ETL
- data pipelines
- batch processing
- stream processing
- data validation
- orchestration
- Airflow
- timely dataflow
- differential dataflow
- streaming aggregations
- windowing
- real-time analytics
Available Skills
Quick Reference
The Data category contains 9 skills:
- batch-processing - Orchestrating complex data pipelines with dependencies
- data-validation - Validating data schema before processing
- dataflow-coordination - Coordination patterns for distributed dataflow systems
- differential-dataflow - Differential computation for incremental updates and efficient joins
- etl-patterns - Designing data extraction from multiple sources
- pipeline-orchestration - Coordinating complex multi-step data workflows
- stream-processing - Processing real-time event streams (Kafka, Flink)
- streaming-aggregations - Windowing, sessionization, time-series aggregation
- timely-dataflow - Low-latency streaming computation with progress tracking
Load Full Category Details
For complete descriptions and workflows:
cat ~/.claude/skills/data/INDEX.md
This loads the full Data category index with:
- Detailed skill descriptions
- Usage triggers for each skill
- Common workflow combinations
- Cross-references to related skills
Load Specific Skills
Load individual skills as needed:
# Traditional ETL/Batch cat ~/.claude/skills/data/batch-processing.md cat ~/.claude/skills/data/data-validation.md cat ~/.claude/skills/data/etl-patterns.md cat ~/.claude/skills/data/pipeline-orchestration.md # Stream Processing cat ~/.claude/skills/data/stream-processing.md cat ~/.claude/skills/data/streaming-aggregations.md # Advanced Dataflow Systems cat ~/.claude/skills/data/timely-dataflow.md cat ~/.claude/skills/data/differential-dataflow.md cat ~/.claude/skills/data/dataflow-coordination.md
Common Workflow Combinations
Real-Time Analytics Pipeline
# Load these skills together: cat ~/.claude/skills/data/stream-processing.md # Kafka setup cat ~/.claude/skills/data/streaming-aggregations.md # Windowing patterns cat ~/.claude/skills/data/dataflow-coordination.md # Coordination
Incremental Computation System
# Load these skills together: cat ~/.claude/skills/data/timely-dataflow.md # Foundation cat ~/.claude/skills/data/differential-dataflow.md # Incremental updates cat ~/.claude/skills/data/dataflow-coordination.md # Distributed coordination
Hybrid Batch + Stream
# Load these skills together: cat ~/.claude/skills/data/batch-processing.md # Batch jobs cat ~/.claude/skills/data/stream-processing.md # Stream processing cat ~/.claude/skills/data/pipeline-orchestration.md # Overall coordination
Progressive Loading
This gateway skill enables progressive loading:
- Level 1: Gateway loads automatically (you're here now)
- Level 2: Load category INDEX.md for full overview
- Level 3: Load specific skills as needed
Usage Instructions
- Auto-activation: This skill loads automatically when Claude Code detects data work
- Browse skills: Run
for full category overviewcat ~/.claude/skills/data/INDEX.md - Load specific skills: Use bash commands above to load individual skills
Next Steps: Run
cat ~/.claude/skills/data/INDEX.md to see full category details.