Skills data-analytics

Create data pipeline and analytics architecture diagrams using PlantUML syntax with database/analytics stencil icons. Best for ETL pipelines, data lakes, real-time streaming, data warehousing, and BI dashboard design.

install
source · Clone the upstream repo
git clone https://github.com/markdown-viewer/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/markdown-viewer/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data-analytics" ~/.claude/skills/markdown-viewer-skills-data-analytics && rm -rf "$T"
manifest: data-analytics/SKILL.md
source content

Data Analytics Diagram Generator

Quick Start: Define data sources → Declare ingestion/ETL icons → Connect to storage/warehouse → Add BI/visualization → Wrap in

```plantuml
fence.

⚠️ IMPORTANT: Always use

```plantuml
or
```puml
code fence. NEVER use
```text
— it will NOT render as a diagram.

Critical Rules

  • Every diagram starts with
    @startuml
    and ends with
    @enduml
  • Use
    left to right direction
    for data pipelines (Source → Ingest → Transform → Store → Visualize)
  • Use
    mxgraph.aws4.*
    stencil syntax for analytics, database, and storage icons
  • Default colors are applied automatically — you do NOT need to specify
    fillColor
    or
    strokeColor
  • Use
    rectangle "Zone" { ... }
    or
    package "Layer" { ... }
    for grouping pipeline stages
  • Directed flows use
    -->
    , async/streaming flows use
    ..>
    (dashed)

Full stencil reference: See stencils/README.md for 9500+ available icons.

Mxgraph Stencil Syntax

mxgraph.aws4.<icon> "Label" as <alias>

Analytics & ETL Stencils

CategoryStencilsPurpose
Query Engine
athena
,
athena_data_source_connectors
Serverless SQL on S3 data
ETL
glue
,
glue_crawlers
,
glue_data_catalog
,
aws_glue_data_quality
,
aws_glue_for_ray
Data integration & cataloging
Streaming
kinesis
,
kinesis_data_streams
,
kinesis_data_firehose
,
kinesis_data_analytics
,
kinesis_video_streams
Real-time data streaming
MapReduce
emr
,
emr_engine
,
emr_engine_mapr_m3
,
emr_engine_mapr_m5
Big data processing (Spark, Hive)
Data Warehouse
redshift
,
redshift_ra3
,
redshift_streaming_ingestion
,
redshift_ml
Columnar analytics warehouse
Search
opensearch_service_data_node
,
opensearch_ingestion
,
cloudsearch
Full-text search & log analytics
BI
quicksight
Dashboards & visualizations
Data Lake
lake_formation
,
s3
,
glacier
,
glacier_deep_archive
Governed data lake storage
Catalog
datazone_custom_asset_type
,
data_exchange
Data governance & sharing
Streaming Kafka
msk
,
msk_connect
Managed Kafka streaming

Database Stencils

CategoryStencilsPurpose
Relational
aurora
,
aurora_instance
,
rds
,
rds_instance
,
rds_mysql_instance
,
rds_postgresql_instance
Transactional databases
NoSQL
dynamodb
,
dynamodb_table
,
dynamodb_global_secondary_index
,
dynamodb_stream
Key-value & document store
Graph
neptune
Graph database
In-Memory
elasticache
,
elasticache_for_redis
,
elasticache_for_memcached
Cache & session store
Document
documentdb
,
documentdb_with_mongodb_compatibility
Document database
Ledger
quantum_ledger_database
Immutable transaction log
Wide-Column
keyspaces
Cassandra-compatible

Connection Types

SyntaxMeaningUse Case
A --> B
Solid arrowBatch data flow / API call
A ..> B
Dashed arrowStreaming / async / CDC
A -- B
Solid lineBidirectional sync
A --> B : "label"
Labeled connectionDescribe data format or volume

Quick Example

@startuml
left to right direction
mxgraph.aws4.s3 "Data Lake\n(S3)" as s3
mxgraph.aws4.glue "Glue\nETL" as glue
mxgraph.aws4.redshift "Redshift" as rs
mxgraph.aws4.quicksight "QuickSight" as qs

s3 --> glue
glue --> rs
rs --> qs
@enduml

Data Analytics Architecture Types

TypePurposeKey StencilsExample
Data LakeCentralized raw data store
s3
,
lake_formation
,
glue
,
athena
data-lake.md
Real-time StreamingEvent stream processing
kinesis
,
msk
,
lambda_function
,
opensearch_service
real-time-streaming.md
Data WarehouseStar-schema analytics
redshift
,
glue
,
quicksight
data-warehouse.md
ETL PipelineExtract-transform-load
glue
,
glue_crawlers
,
glue_data_catalog
,
s3
etl-pipeline.md
Log AnalyticsCentralized logging
kinesis_data_firehose
,
opensearch_service
,
lambda_function
log-analytics.md
ML Feature StoreFeature engineering pipeline
glue
,
s3
,
athena
,
emr
ml-feature-pipeline.md
CDC PipelineDatabase change capture
dynamodb_streams
,
kinesis
,
lambda_function
,
redshift
cdc-pipeline.md
Multi-source BICross-database reporting
aurora
,
dynamodb
,
redshift
,
quicksight
multi-source-bi.md