Skills data-analytics
Create data pipeline and analytics architecture diagrams using PlantUML syntax with database/analytics stencil icons. Best for ETL pipelines, data lakes, real-time streaming, data warehousing, and BI dashboard design.
install
source · Clone the upstream repo
git clone https://github.com/markdown-viewer/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/markdown-viewer/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data-analytics" ~/.claude/skills/markdown-viewer-skills-data-analytics && rm -rf "$T"
manifest:
data-analytics/SKILL.mdsource content
Data Analytics Diagram Generator
Quick Start: Define data sources → Declare ingestion/ETL icons → Connect to storage/warehouse → Add BI/visualization → Wrap in
```plantuml fence.
⚠️ IMPORTANT: Always use
or```plantumlcode fence. NEVER use```puml— it will NOT render as a diagram.```text
Critical Rules
- Every diagram starts with
and ends with@startuml@enduml - Use
for data pipelines (Source → Ingest → Transform → Store → Visualize)left to right direction - Use
stencil syntax for analytics, database, and storage iconsmxgraph.aws4.* - Default colors are applied automatically — you do NOT need to specify
orfillColorstrokeColor - Use
orrectangle "Zone" { ... }
for grouping pipeline stagespackage "Layer" { ... } - Directed flows use
, async/streaming flows use-->
(dashed)..>
Full stencil reference: See stencils/README.md for 9500+ available icons.
Mxgraph Stencil Syntax
mxgraph.aws4.<icon> "Label" as <alias>
Analytics & ETL Stencils
| Category | Stencils | Purpose |
|---|---|---|
| Query Engine | , | Serverless SQL on S3 data |
| ETL | , , , , | Data integration & cataloging |
| Streaming | , , , , | Real-time data streaming |
| MapReduce | , , , | Big data processing (Spark, Hive) |
| Data Warehouse | , , , | Columnar analytics warehouse |
| Search | , , | Full-text search & log analytics |
| BI | | Dashboards & visualizations |
| Data Lake | , , , | Governed data lake storage |
| Catalog | , | Data governance & sharing |
| Streaming Kafka | , | Managed Kafka streaming |
Database Stencils
| Category | Stencils | Purpose |
|---|---|---|
| Relational | , , , , , | Transactional databases |
| NoSQL | , , , | Key-value & document store |
| Graph | | Graph database |
| In-Memory | , , | Cache & session store |
| Document | , | Document database |
| Ledger | | Immutable transaction log |
| Wide-Column | | Cassandra-compatible |
Connection Types
| Syntax | Meaning | Use Case |
|---|---|---|
| Solid arrow | Batch data flow / API call |
| Dashed arrow | Streaming / async / CDC |
| Solid line | Bidirectional sync |
| Labeled connection | Describe data format or volume |
Quick Example
@startuml left to right direction mxgraph.aws4.s3 "Data Lake\n(S3)" as s3 mxgraph.aws4.glue "Glue\nETL" as glue mxgraph.aws4.redshift "Redshift" as rs mxgraph.aws4.quicksight "QuickSight" as qs s3 --> glue glue --> rs rs --> qs @enduml
Data Analytics Architecture Types
| Type | Purpose | Key Stencils | Example |
|---|---|---|---|
| Data Lake | Centralized raw data store | , , , | data-lake.md |
| Real-time Streaming | Event stream processing | , , , | real-time-streaming.md |
| Data Warehouse | Star-schema analytics | , , | data-warehouse.md |
| ETL Pipeline | Extract-transform-load | , , , | etl-pipeline.md |
| Log Analytics | Centralized logging | , , | log-analytics.md |
| ML Feature Store | Feature engineering pipeline | , , , | ml-feature-pipeline.md |
| CDC Pipeline | Database change capture | , , , | cdc-pipeline.md |
| Multi-source BI | Cross-database reporting | , , , | multi-source-bi.md |