Flow operability

Operations - deployment, rollback, feature flags. Use for ops tooling.

install

source · Clone the upstream repo

git clone https://github.com/SylphxAI/flow

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/SylphxAI/flow "$T" && mkdir -p ~/.claude/skills && cp -r "$T/.claude/skills/operability" ~/.claude/skills/sylphxai-flow-operability && rm -rf "$T"

manifest: .claude/skills/operability/SKILL.md

source content

Operability Guideline

Tech Stack

Workflows: Upstash Workflows + QStash
Cache: Upstash Redis
Platform: Vercel

Non-Negotiables

Dead-letter handling must exist and be operable (visible, replayable)
Side-effects (email, billing, ledger) must be idempotent or safely re-entrant
Drift alerts must have remediation playbooks

Context

Operability is about running the system in production — not just building it. Systems fail. Jobs get stuck. State drifts. The question is: when something goes wrong, can an operator fix it without deploying code?

Consider the operator experience during an incident. What tools do they have? What runbooks exist? Can they safely retry failed jobs? Can they detect and fix drift?

Driving Questions

What happens when a job fails permanently?
How would an operator know something is stuck?
Can failed workflows be safely replayed without duplicating side-effects?
What drift can occur between systems, and how would we detect it?
What's the rollback plan if a deploy breaks something critical?
What runbooks exist, and what runbooks should exist but don't?