Babysitter constitutional-ai-prompts
Constitutional AI and safety guardrail prompts for aligned LLM behavior
install
source · Clone the upstream repo
git clone https://github.com/a5c-ai/babysitter
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/a5c-ai/babysitter "$T" && mkdir -p ~/.claude/skills && cp -r "$T/library/specializations/ai-agents-conversational/skills/constitutional-ai-prompts" ~/.claude/skills/a5c-ai-babysitter-constitutional-ai-prompts && rm -rf "$T"
manifest:
library/specializations/ai-agents-conversational/skills/constitutional-ai-prompts/SKILL.mdsource content
Constitutional AI Prompts Skill
Capabilities
- Design constitutional AI principles
- Implement self-critique and revision prompts
- Create harmlessness guidelines
- Design refusal patterns for unsafe requests
- Implement red-team testing prompts
- Create ethics-aware response frameworks
Target Processes
- system-prompt-guardrails
- content-moderation-safety
Implementation Details
Constitutional Patterns
- Critique-Revision: Self-evaluate and improve responses
- Principle Adherence: Follow defined ethical principles
- Harmlessness Focus: Prioritize safe responses
- Helpfulness Balance: Balance helpfulness with safety
- Transparency: Acknowledge limitations
Configuration Options
- Constitutional principles list
- Critique prompts
- Revision guidelines
- Refusal templates
- Escalation triggers
Best Practices
- Define clear constitutional principles
- Balance helpfulness and safety
- Test with adversarial inputs
- Document refusal patterns
- Regular principle review
Dependencies
- langchain-core