Flow operability

Operations - deployment, rollback, feature flags. Use for ops tooling.

install
source · Clone the upstream repo
git clone https://github.com/SylphxAI/flow
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/SylphxAI/flow "$T" && mkdir -p ~/.claude/skills && cp -r "$T/.claude/skills/operability" ~/.claude/skills/sylphxai-flow-operability && rm -rf "$T"
manifest: .claude/skills/operability/SKILL.md
source content

Operability Guideline

Tech Stack

  • Workflows: Upstash Workflows + QStash
  • Cache: Upstash Redis
  • Platform: Vercel

Non-Negotiables

  • Dead-letter handling must exist and be operable (visible, replayable)
  • Side-effects (email, billing, ledger) must be idempotent or safely re-entrant
  • Drift alerts must have remediation playbooks

Context

Operability is about running the system in production — not just building it. Systems fail. Jobs get stuck. State drifts. The question is: when something goes wrong, can an operator fix it without deploying code?

Consider the operator experience during an incident. What tools do they have? What runbooks exist? Can they safely retry failed jobs? Can they detect and fix drift?

Driving Questions

  • What happens when a job fails permanently?
  • How would an operator know something is stuck?
  • Can failed workflows be safely replayed without duplicating side-effects?
  • What drift can occur between systems, and how would we detect it?
  • What's the rollback plan if a deploy breaks something critical?
  • What runbooks exist, and what runbooks should exist but don't?