Babysitter Kafka Topic Designer

Designs and optimizes Apache Kafka topics and configurations

install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/data-engineering-analytics/skills/kafka-topic-designer" ~/.claude/skills/a5c-ai-babysitter-kafka-topic-designer && rm -rf "$T"
manifest: library/specializations/data-engineering-analytics/skills/kafka-topic-designer/SKILL.md
source content

Kafka Topic Designer

Overview

Designs and optimizes Apache Kafka topics and configurations. This skill provides expertise in topic architecture, partitioning strategies, and producer/consumer configuration for optimal streaming performance.

Capabilities

  • Topic naming convention design
  • Partition strategy optimization
  • Replication factor recommendations
  • Retention policy configuration
  • Compaction strategy design
  • Schema registry integration
  • Consumer group design
  • Throughput capacity planning
  • Security configuration (ACLs, encryption)

Input Schema

{
  "requirements": {
    "throughputMBps": "number",
    "messageSize": "number",
    "retentionDays": "number",
    "orderingRequirements": "string"
  },
  "existingTopics": ["object"],
  "clusterConfig": "object"
}

Output Schema

{
  "topicDesign": {
    "name": "string",
    "partitions": "number",
    "replicationFactor": "number",
    "configs": "object"
  },
  "schemaDefinition": "object",
  "producerConfig": "object",
  "consumerConfig": "object"
}

Target Processes

  • Streaming Pipeline
  • ETL/ELT Pipeline (CDC)
  • Feature Store Setup

Usage Guidelines

  1. Define throughput and latency requirements
  2. Specify message size and retention needs
  3. Document ordering requirements (per-partition, global)
  4. Include existing topic configurations for consistency

Best Practices

  • Use consistent naming conventions across all topics
  • Size partitions based on consumer parallelism needs
  • Configure appropriate retention for compliance and replay scenarios
  • Implement schema registry for schema evolution
  • Set up proper ACLs for security