Flow trust-safety

Trust and safety - abuse prevention, rate limiting. Use when fighting bad actors.

install

source · Clone the upstream repo

git clone https://github.com/SylphxAI/flow

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/SylphxAI/flow "$T" && mkdir -p ~/.claude/skills && cp -r "$T/.claude/skills/trust-safety" ~/.claude/skills/sylphxai-flow-trust-safety && rm -rf "$T"

manifest: .claude/skills/trust-safety/SKILL.md

source content

Trust Safety Guideline

Tech Stack

Analytics: PostHog
Database: Neon (Postgres)
Workflows: Upstash Workflows + QStash

Non-Negotiables

All enforcement actions must be auditable (who/when/why)
Appeals process must exist for affected users
Graduated response levels must be defined (warn → restrict → suspend → ban)

Context

Trust & safety is about protecting users — from each other and from malicious actors. Every platform eventually attracts abuse. The question is whether you're prepared for it or scrambling to react.

Consider: what would a bad actor try to do? How would we detect it? How would we respond? What about the false positives — innocent users caught by automated systems? A good T&S system is effective against abuse AND fair to legitimate users.

Driving Questions

What would a motivated bad actor try to do on this platform?
How would we detect coordinated abuse or bot networks?
What happens when automated moderation gets it wrong?
How do affected users appeal decisions, and is it fair?
What abuse patterns exist that we haven't addressed?
What would make users trust that we're protecting them?