Skills muapi-seedance-2
Expert Cinema Director skill for Seedance 2.0 (ByteDance) — high-fidelity video generation using technical camera grammar and multimodal references. Supports text-to-video, image-to-video, and video extension.
git clone https://github.com/openclaw/skills
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/anil-matcha/muapi-seedance-2" ~/.claude/skills/clawdbot-skills-muapi-seedance-2 && rm -rf "$T"
skills/anil-matcha/muapi-seedance-2/SKILL.md🎬 Seedance 2.0 Cinema Expert
The definitive skill for "Director-Level" AI video orchestration. Seedance 2.0 is not a descriptive model; it is an instructional model. It responds best to technical cinematography, physics directives, and precise camera grammar.
Core Competencies
- Text-to-Video (t2v): Generate cinematic video from a Director Brief using
.seedance-v2.0-t2v - Image-to-Video (i2v): Animate 1–9 reference images into a video using
.seedance-v2.0-i2v - Video Extension (extend): Seamlessly continue an existing Seedance 2.0 video using
.seedance-v2.0-extend - Multimodal Referencing: Utilize
system (@tag
,@image1
) for style, motion, and rhythm locking.@video1 - Audio-Visual Sync: Native high-fidelity sound generation synchronized with visual motion.
- Temporal Consistency: Maintain character, clothing, and environment stability across shots.
🏗️ Technical Specification: The Director Brief
To get professional results, ALWAYS structure the prompt using this hierarchy:
| Component | Instruction Type | Example |
|---|---|---|
| Scene | Environment + Lighting | "A rain-soaked cyberpunk street, magenta neon reflections on wet asphalt." |
| Subject | Identity + Detail | "A woman in a black trenchcoat, determined focus, cinematic skin textures." |
| Action | Fluid Interaction | "Walking forward through the crowd, coat billowing slightly in the wind." |
| Camera | Movement + Lens | "Medium tracking shot, 35mm lens, slow dolly backward. Subtle handheld jitter." |
| Style | Mood + Intent | "Cinematic epic, warm color grade, shallow DOF, rack focus to subject's face." |
🧠 Prompt Optimization Protocol
The Agent MUST transform user intent into a technical "Director Brief" before execution.
- Technical Grammar: Use camera terms: Dolly In/Out, Crane Shot, Whip Pan, Tracking Shot, Anamorphic Lens, Shallow Depth of Field.
- Physics Directives: Use "caustic patterns," "volumetric rays," or "subsurface scattering" instead of "good lighting."
- Timecode Notation: For multi-beat scenes, use
format to specify timing.[00:00-00:05s] - Tag References: If files provided, use: "Replicate the camera movement of @video1 while maintaining the visual style of @image1."
- ORDER MATTERS: Tokens at the start define composition; tokens at the end define texture and micro-motion.
- Multi-Image i2v: Provide up to 9 reference images. The model blends aspects (style, identity, environment) across all inputs.
🚀 Protocol: Using Seedance 2
Mode 1: Text-to-Video (t2v)
# Epic reveal shot bash scripts/generate-seedance.sh \ --subject "a hidden temple in the Andes, mist rolling through the canopy" \ --intent "epic" \ --aspect "16:9" \ --duration 10 \ --quality high \ --view # Tense close-up, vertical for social bash scripts/generate-seedance.sh \ --subject "a detective examines a cryptic clue under harsh lamp light" \ --intent "tense" \ --aspect "9:16" \ --duration 5
Mode 2: Image-to-Video (i2v)
Animate one or more reference images. Up to 9 images can be supplied — the model synthesizes motion across all of them.
# Animate a single local image bash scripts/generate-seedance.sh \ --mode i2v \ --file hero.jpg \ --subject "hero strides forward, coat billowing in slow motion" \ --intent "epic" \ --aspect "16:9" \ --view # Animate from a URL bash scripts/generate-seedance.sh \ --mode i2v \ --image "https://example.com/scene.jpg" \ --subject "camera slowly pulls back to reveal the full landscape" \ --intent "reveal" \ --duration 10 # Multi-image blending (character + environment + style reference) bash scripts/generate-seedance.sh \ --mode i2v \ --file character.jpg \ --file environment.jpg \ --image "https://example.com/style.jpg" \ --subject "character walks through the environment in cinematic style" \ --quality high
Mode 3: Extend Video
Continue an existing Seedance 2.0 video seamlessly, preserving visual style, motion, and audio.
# Extend with no new prompt (model continues naturally) bash scripts/generate-seedance.sh \ --mode extend \ --request-id "abc-123-def-456" \ --duration 10 # Extend with directional prompt bash scripts/generate-seedance.sh \ --mode extend \ --request-id "abc-123-def-456" \ --subject "camera continues to pull back, revealing the vast city below" \ --intent "reveal" \ --duration 10 \ --quality high \ --view
Async Pattern (for long jobs)
# Submit and get request_id immediately RESULT=$(bash scripts/generate-seedance.sh --mode i2v --file photo.jpg --async --json) REQUEST_ID=$(echo "$RESULT" | jq -r '.request_id') # Check later bash ../../../../core/media/generate-video.sh --result "$REQUEST_ID"
⚠️ Constraints & Guardrails
- No Keyword Soup: DO NOT use "8k, masterpiece, trending." Use technical descriptions: "High-fidelity production grade, 24fps, cinematic grain."
- Continuous Action: Describe one fluid motion. Avoid "The man runs and then he stops." Use "The man gradually transitions from a sprint to a sudden stop, chest heaving."
- Face Stability: For consistent characters: "Maintain high character consistency, zero facial flicker, persistent clothing details."
- Extension Only Works on v2.0:
requires a--mode extend
from a previousrequest_id
orseedance-v2.0-t2v
job.seedance-v2.0-i2v - Aspect Ratios: 16:9, 9:16, 4:3, 3:4 (Seedance 2.0 supports all four).
- Duration: 5, 10, or 15 seconds.
- Quality:
(faster) orbasic
(higher fidelity).high
🎭 Prompt Templates (from awesome-seedance community)
Cinematic Film Styles
[SCENE] Rain-soaked cyberpunk alley, neon signs reflected on wet cobblestones. [SUBJECT] A lone figure in a weathered trench coat, face obscured by a wide-brim hat. [ACTION] Walking slowly, each step splashing neon color into the puddles. [CAMERA] Low-angle tracking shot, anamorphic lens, slow dolly in. Rack focus to face. [STYLE] Denis Villeneuve aesthetic, high contrast, desaturated blues and magentas. 24fps.
Advertising / Product Motion
[SCENE] Minimalist white studio, single product on a rotating pedestal. [ACTION] Subtle 360° rotation, product details catching specular highlights. [CAMERA] Tight medium shot, macro lens pass over surface texture, slow orbit. [STYLE] Commercial grade, perfect exposure, zero background distraction.
Action / Physics
[SCENE] Desert canyon at sunrise, sandy terrain, long shadows. [SUBJECT] High-performance sports car accelerating through a turn. [ACTION] Rear wheels spinning with dust plume, chassis flexing under g-force. [CAMERA] Low hero angle dolly tracking alongside, then whip pan to lead car. [STYLE] Hollywood racing film, warm golden grade, motion blur on wheels. 24fps.
Character Consistency (Martial Arts / Action)
[SUBJECT] Same fighter throughout: young woman, white gi, black belt, determined expression. [ACTION] Fluid kata sequence — rising block, stepping side kick, spinning back fist. [CAMERA] Full-body wide shot, then cut to close-up of fist impact in slow motion. [STYLE] Maintain identical lighting, clothing, and facial features in every frame. Zero flicker.
⚙️ Implementation Details
| Model | Endpoint | Use Case |
|---|---|---|
| Text-to-Video | Generate from Director Brief |
| Image-to-Video | Animate 1–9 reference images |
| Extend Video | Continue a v2.0 generated video |
This skill acts as a Cinematographic Wrapper that translates low-level creative intent into high-fidelity technical instructions for the
muapi core.