Babysitter constitutional-ai-prompts

Constitutional AI and safety guardrail prompts for aligned LLM behavior

install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/ai-agents-conversational/skills/constitutional-ai-prompts" ~/.claude/skills/a5c-ai-babysitter-constitutional-ai-prompts && rm -rf "$T"
manifest: library/specializations/ai-agents-conversational/skills/constitutional-ai-prompts/SKILL.md
source content

Constitutional AI Prompts Skill

Capabilities

  • Design constitutional AI principles
  • Implement self-critique and revision prompts
  • Create harmlessness guidelines
  • Design refusal patterns for unsafe requests
  • Implement red-team testing prompts
  • Create ethics-aware response frameworks

Target Processes

  • system-prompt-guardrails
  • content-moderation-safety

Implementation Details

Constitutional Patterns

  1. Critique-Revision: Self-evaluate and improve responses
  2. Principle Adherence: Follow defined ethical principles
  3. Harmlessness Focus: Prioritize safe responses
  4. Helpfulness Balance: Balance helpfulness with safety
  5. Transparency: Acknowledge limitations

Configuration Options

  • Constitutional principles list
  • Critique prompts
  • Revision guidelines
  • Refusal templates
  • Escalation triggers

Best Practices

  • Define clear constitutional principles
  • Balance helpfulness and safety
  • Test with adversarial inputs
  • Document refusal patterns
  • Regular principle review

Dependencies

  • langchain-core