Babysitter Data Catalog Enricher
Enriches data catalog entries with automated metadata
install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/data-engineering-analytics/skills/data-catalog-enricher" ~/.claude/skills/a5c-ai-babysitter-data-catalog-enricher && rm -rf "$T"
manifest:
library/specializations/data-engineering-analytics/skills/data-catalog-enricher/SKILL.mdsource content
Data Catalog Enricher
Overview
Enriches data catalog entries with automated metadata. This skill enhances data discoverability and governance through intelligent metadata augmentation.
Capabilities
- Automated tag suggestion
- Business glossary term matching
- Owner/steward recommendation
- Usage pattern analysis
- Data classification (sensitivity, PII)
- Quality score integration
- Lineage enrichment
- Search optimization
Input Schema
{ "catalogEntry": "object", "dataProfile": "object", "existingGlossary": "object", "organizationContext": "object" }
Output Schema
{ "enrichedEntry": "object", "suggestedTags": ["string"], "glossaryMatches": ["object"], "classificationResults": "object", "ownerSuggestions": ["string"] }
Target Processes
- Data Catalog
- Data Lineage Mapping
- Data Quality Framework
Usage Guidelines
- Provide existing catalog entry for enrichment
- Include data profile for classification analysis
- Supply business glossary for term matching
- Add organization context for owner recommendations
Best Practices
- Regularly update glossary matches as glossary evolves
- Validate PII classifications with data stewards
- Integrate quality scores from quality framework
- Maintain consistent tagging taxonomy
- Review and approve automated classifications