Skills outlines
install
source · Clone the upstream repo
git clone https://github.com/TerminalSkills/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/TerminalSkills/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/outlines" ~/.claude/skills/terminalskills-skills-outlines && rm -rf "$T"
manifest:
skills/outlines/SKILL.mdsafety · automated scan (low risk)
This is a pattern-based risk scan, not a security review. Our crawler flagged:
- pip install
Always read a skill's source content before installing. Patterns alone don't mean the skill is malicious — but they warrant attention.
source content
Outlines — Structured Text Generation
You are an expert in Outlines, the Python library for reliable structured text generation with LLMs. You help developers generate guaranteed-valid JSON, regex-matching text, and grammar-constrained output from open-source models — using finite state machine guided generation that constrains the token sampling process to produce only valid output on the first try.
Core Capabilities
Structured Generation
import outlines from pydantic import BaseModel, Field from enum import Enum # Load model model = outlines.models.transformers("meta-llama/Llama-3.1-8B-Instruct") # JSON generation with Pydantic schema class Sentiment(str, Enum): positive = "positive" negative = "negative" neutral = "neutral" class ReviewAnalysis(BaseModel): sentiment: Sentiment score: float = Field(ge=0, le=1) topics: list[str] = Field(min_length=1, max_length=5) summary: str = Field(max_length=200) generator = outlines.generate.json(model, ReviewAnalysis) result = generator( "Analyze this review: 'Great product, fast shipping, but packaging could be better'" ) # result is a validated ReviewAnalysis instance — guaranteed to match schema print(result.sentiment) # Sentiment.positive print(result.score) # 0.85 print(result.topics) # ["product quality", "shipping", "packaging"] # Regex-constrained generation phone_gen = outlines.generate.regex(model, r"\(\d{3}\) \d{3}-\d{4}") phone = phone_gen("Generate a US phone number:") # phone = "(415) 555-0123" — always matches the regex # Choice (classification) classifier = outlines.generate.choice(model, ["spam", "ham", "uncertain"]) result = classifier("Is this spam? 'You won $1000000!!!'") # result = "spam" # Format-constrained (date, number, etc.) date_gen = outlines.generate.format(model, datetime.date) date = date_gen("When was Python created?") # date = datetime.date(1991, 2, 20) — always a valid date object
Batch Processing
# Batch inference for throughput generator = outlines.generate.json(model, ReviewAnalysis) reviews = [ "Amazing quality, will buy again!", "Terrible customer service, never ordering here.", "It's okay, nothing special.", ] prompts = [f"Analyze: '{r}'" for r in reviews] results = generator(prompts, max_tokens=200) # results is a list of ReviewAnalysis objects — all guaranteed valid
Grammar-Constrained
# Custom grammar (CFG) arithmetic_grammar = r""" ?start: expression ?expression: term (("+" | "-") term)* ?term: factor (("*" | "/") factor)* ?factor: NUMBER | "(" expression ")" NUMBER: /[0-9]+(\.[0-9]+)?/ """ calc_gen = outlines.generate.cfg(model, arithmetic_grammar) expr = calc_gen("Generate a math expression that equals 42:") # expr = "(6 * 7)" — always valid arithmetic
With vLLM
# Use with vLLM for production throughput model = outlines.models.vllm("meta-llama/Llama-3.1-8B-Instruct", tensor_parallel_size=1, gpu_memory_utilization=0.9) generator = outlines.generate.json(model, ReviewAnalysis) # Combines Outlines' constrained generation with vLLM's batching + PagedAttention
Installation
pip install outlines
Best Practices
- Pydantic schemas — Define output with Pydantic models; Outlines compiles to FSM for guaranteed compliance
- Regex for patterns — Use
for dates, emails, IDs; output always matches the patterngenerate.regex() - Choice for classification — Use
instead of free text; constrained to exact optionsgenerate.choice() - vLLM for production — Combine with vLLM backend for high-throughput constrained generation
- Batch for efficiency — Pass lists of prompts; Outlines batches efficiently with the model
- Field constraints — Use Pydantic's
,ge
,le
,min_length
; further constrains outputmax_length - Grammar for DSLs — Use CFG grammars for domain-specific output (SQL, code, formulas)
- First-try guarantee — Unlike retry-based approaches, Outlines gets valid output on the first generation