muapi-seedance-2

Included with Lifetime

$97 forever

Expert Cinema Director skill for Seedance 2.0 (ByteDance) — high-fidelity video generation across Chinese, Global, and VIP tiers. Supports text-to-video, image-to-video, first-last-frame, omni reference, character training, omni-reference training, video editing, and watermark removal.

Image & Videoscripts

What this skill does


# 🎬 Seedance 2.0 Cinema Expert

**The definitive skill for "Director-Level" AI video orchestration.**
Seedance 2.0 is not a descriptive model; it is an *instructional* model. It responds best to technical cinematography, physics directives, and precise camera grammar.

## Core Competencies

1.  **Text-to-Video (t2v)**: Generate cinematic video from a Director Brief — Chinese, Global, or VIP tier.
2.  **Image-to-Video (i2v)**: Animate 1–9 reference images — Chinese, Global (smart mode), or VIP tier.
3.  **Video Extension (extend)**: Seamlessly continue an existing Seedance 2.0 video (Chinese tier).
4.  **First & Last Frame (first-last)**: Interpolate a fluid video between a start image and end image (Global/VIP).
5.  **Omni Reference (omni)**: Full multimodal reference with images + audio + character refs (all tiers).
6.  **Omni Reference Training (omni-train)**: Train a custom persistent character for identity-consistent generation.
7.  **Character Sheet (character)**: Build a reusable character from 1–3 images (Chinese tier).
8.  **Video Edit (video-edit)**: Edit an existing video with a prompt + optional reference images (Chinese tier).
9.  **Watermark Removal (watermark-remove)**: Strip Seedance 2.0 watermarks (basic or Pro).

---

## 🏷️ Tiers

| Tier | Flag | Censorship | Aspect Ratios | Duration | Quality param |
|:---|:---|:---|:---|:---|:---|
| **Chinese** (default) | `--tier chinese` | Low | 16:9, 9:16, 4:3, 3:4 | 5 / 10 / 15 s | Yes (basic/high) |
| **Global** | `--tier global` | Standard | + 21:9, 1:1 | Any 4–15 s | No |
| **VIP** | `--tier vip` | Low | + 21:9, 1:1 | Any 4–15 s | No |

Add `--fast` to any Global or VIP call to use the fast-queue variant (lower latency, same quality).

---

## 📥 Input Limits

| Input Type | Chinese i2v/omni | Global/VIP i2v/omni | Formats | Max Size |
|:---|:---|:---|:---|:---|
| Images | ≤ 9 | ≤ 9 | jpeg, png, webp | 30 MB each |
| Videos | ≤ 3 (omni only) | Not supported | mp4, mov | 50 MB each |
| Audio | ≤ 3 | ≤ 3 | mp3, wav | 15 MB each |
| **First-Last** | — | 1–2 images | jpeg, png, webp | 30 MB each |
| **Video Edit** | 1 video + ≤ 9 imgs | — | mp4 ≤ 10 MB / 15s | — |

**Output**: 4–15 seconds, auto-generated sound, 480p–720p.

---

## ⚠️ Restrictions

- **No realistic human faces** in uploaded images/videos (except character/omni-train modes).
- `--mode extend` requires a `request_id` from a prior `seedance-v2.0-t2v` or `seedance-v2.0-i2v` job.
- `--mode first-last` requires `--tier global` or `--tier vip`.
- Global/VIP omni does **not** support video references (images + audio only).
- `--quality` applies to Chinese tier only.

---

## 🔗 Core Syntax: The @ Reference System

Assign explicit roles to each uploaded asset. Tags differ by mode.

### Chinese Tier (i2v, omni)
```
@image1  @image2  ...  @image9    (images_list order)
@video1  @video2  @video3         (video_files order)
@audio1  @audio2  @audio3         (audio_files order)
```

### Global/VIP Omni (omni-reference-no-video / vip-omni-reference)
```
@image1  @image2  ...  @image9    (images_list order)
@audio1  @audio2  @audio3         (audio_files order)
```

### Character References (all tiers)
```
@character:<request_id>            — from seedance-2-character or completed t2v/i2v job
@omni-character:<character_id>     — from seedance-2-omni-reference-train output
```

### Role Assignment Table

| Purpose | Example Syntax |
|:---|:---|
| First frame | `@Image1 as the first frame` |
| Last frame | `@Image2 as the last frame` |
| Character appearance | `@Image1's character as the subject` |
| Scene / background | `scene references @Image3` |
| Camera movement | `reference @Video1's camera movement` |
| Action / motion | `reference @Video1's action choreography` |
| Visual effects | `completely reference @Video1's effects and transitions` |
| Rhythm / tempo | `video rhythm references @Video1` |
| Voice / tone | `narration voice references @Video1` |
| Background music | `BGM references @Audio1` |
| Sound effects | `sound effects reference @Video3's audio` |
| Outfit / clothing | `wearing the outfit from @Image2` |
| Product appearance | `product details reference @Image3` |

### Multi-Reference Combination
```
@Image1's character as the subject, reference @Video1's camera movement
and action choreography, BGM references @Audio1, scene references @Image2
```

---

## 🏗️ Technical Specification: The Director Brief

Structure prompts using this six-component hierarchy. Order matters — composition first, texture and micro-motion last:

| Component | Instruction Type | Example |
|:---|:---|:---|
| **Scene** | Environment + Lighting | "A rain-soaked cyberpunk street, magenta neon reflections on wet asphalt." |
| **Subject** | Identity + Detail | "A woman in a black trenchcoat, determined focus, cinematic skin textures." |
| **Action** | Fluid Interaction | "Walking forward through the crowd, coat billowing slightly in the wind." |
| **Camera** | Movement + Lens + Speed | "Medium tracking shot, 35mm lens, slow dolly backward over 6s. Subtle handheld jitter." |
| **Audio** | Music + SFX + Ambience | "Low ambient hum, distant traffic, single piano note at 5s. No dialogue." |
| **Pacing/Style** | Timing + Mood + Grade | "Cinematic epic, warm color grade, shallow DOF. Slow build — single action only, no scene cuts." |

> **Seedance 2.0 generates audio natively.** Always include an Audio directive — even one sentence. Without it the model generates random ambient sound that may not match your scene.

### Time-Segmented Prompts (Recommended for 10s+ videos)
Break prompts into timed segments for precise control:
```
0–3s: [opening scene, camera move, establishing action]
3–6s: [mid-section development, subject in motion]
6–10s: [climax or key action beat]
10–15s: [resolution, brand/product hold, text/tagline fade in]
```

> **Single-beat rule:** Each segment should contain one action. 4–7s = one beat. 10–15s = 3–4 beats maximum. Overloading a segment with multiple narrative changes degrades output quality.

### Negative Prompting

Seedance 2.0 supports appending negative guidance directly in the prompt. Use plain language at the end:

```
[your director brief above]
Avoid: camera shake, jump cuts, lens distortion, overexposure, watermarks, text overlays.
```

Common negative additions:
- `Avoid: abrupt cuts, scene changes, multiple locations.` (for single-take shots)
- `Avoid: human faces, realistic people.` (for product-only content)
- `Avoid: fast motion, blur, unstable framing.` (for smooth product reveals)

---

## 🎥 Camera Language Reference

### Basic Movements
| Term | Description |
|:---|:---|
| Push in / Slow push | Camera moves toward subject |
| Pull back / Pull away | Camera moves away from subject |
| Pan left/right | Camera rotates horizontally |
| Tilt up/down | Camera rotates vertically |
| Track / Follow shot | Camera follows subject movement |
| Orbit / Revolve | Camera circles around subject |
| One-take / Oner | Continuous shot with no cuts |

### Advanced Techniques
| Term | Description |
|:---|:---|
| Hitchcock zoom (dolly zoom) | Push in + zoom out — creates vertigo effect |
| Fisheye lens | Ultra-wide distorted lens |
| Low angle / High angle | Camera below/above subject |
| Bird's eye / Overhead | Top-down view |
| First-person POV (FPV) | Immersive subjective camera from character/object's eyes — GoPro-style wide angle, forward motion, no cuts |
| Drone flythrough | Cinematic aerial descent — gimbal-stabilized, sweeping lateral arc, DJI Inspire aesthetic |
| Architectural flythrough | Ground-level continuous dolly through connected spaces — one-take, practical lighting |
| Whip pan | Very fast horizontal pan with motion blur |
| Crane shot | Vertical movement like a crane arm |

### Shot Sizes
| Term | Description |
|:---|:---|
| Extreme close-up | Eyes, mouth, or small detail only |
| Close-up | Face fills frame |
| Medium close-up | Head and shoulders |
| Medium shot | Waist up |
| Full shot | Entire body |
| Wide / Est

Files: 2

Size: 58.6 KB

Complexity: 56/100

Category: Image & Video

Source: https://github.com/samuraigpt/generative-media-skills/tree/main/library/motion/seedance-2

Related in Image & Video

watch

Included

Watch a video (URL or local path). Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls the transcript from captions (or Whisper API fallback), and hands the result to Claude so it can answer questions about what's in the video.

Image & Videoscriptsfeatured

physical-ai-defect-image-generation

Included

Use when the user wants to orchestrate defect image generation, run associated setup, or handle outputs on OSMO. The Day 0 path handles cold-start with USD-to-ROI, image-edit augmentation, and AnomalyGen to create initial PCBA datasets. The Day 1 path performs inference and labeling on real images. This skill helps with first-time asset setup, creation of finetuning checkpoints, and configuring deployment. Trigger keywords: defect image generation, dig workflow, dig pipeline, defect image detection workflow, aoi pipeline, aoi anomalygen, usd2roi anomalygen, day 0 pcba, day 1 pcba, day 1 real-photo alignment, day 1 manual roi, metal surface anomaly, glass defect, anomalygen finetune, setup_pcb, setup_metal, setup_glass, setup_pretrained, dig setup, dig datasets, dig pretrained checkpoint, dig image-edit endpoint.

Image & Videoscripts

accelint-react-best-practices

Included

React performance optimization and best practices. ALWAYS use this skill when working with any React code - writing components, hooks, JSX; refactoring; optimizing re-renders, memoization, state management; reviewing for performance; fixing hydration mismatches; debugging infinite re-renders, stale closures, input focus loss, animations restarting; preventing remounting; implementing transitions, lazy initialization, effect dependencies. Even simple React tasks benefit from these patterns. Covers React 19+ (useEffectEvent, Activity, ref props). Triggers - useEffect, useState, useMemo, useCallback, memo, inline components, nested components, components inside components, re-render, performance, hydration, SSR, Next.js, useDeferredValue, combined hooks.

Image & Videoscripts

elevenlabs-agents

Included

Build conversational AI voice agents with ElevenLabs Platform using React, JavaScript, React Native, or Swift SDKs. Configure agents, tools (client/server/MCP), RAG knowledge bases, multi-voice, and Scribe real-time STT. Use when: building voice chat interfaces, implementing AI phone agents with Twilio, configuring agent workflows or tools, adding RAG knowledge bases, testing with CLI "agents as code", or troubleshooting deprecated @11labs packages, Android audio cutoff, CSP violations, dynamic variables, or WebRTC config. Keywords: ElevenLabs Agents, ElevenLabs voice agents, AI voice agents, conversational AI, @elevenlabs/react, @elevenlabs/client, @elevenlabs/react-native, @elevenlabs/elevenlabs-js, @elevenlabs/agents-cli, elevenlabs SDK, voice AI, TTS, text-to-speech, ASR, speech recognition, turn-taking model, WebRTC voice, WebSocket voice, ElevenLabs conversation, agent system prompt, agent tools, agent knowledge base, RAG voice agents, multi-voice agents, pronunciation dictionary, voice speed control, elevenlabs scribe, @11labs deprecated, Android audio cutoff, CSP violation elevenlabs, dynamic variables elevenlabs, case-sensitive tool names, webhook authentication

Image & Videoscripts

humanizer

Included

Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 28 pattern detectors, 560+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.

Image & Videoscripts

generating-mermaid-diagrams

Included

Salesforce architecture diagrams using Mermaid with ASCII fallback. Use this skill when generating text-based diagrams for Salesforce architecture, OAuth flows, ERDs, integration sequences, or Agentforce structure. TRIGGER when: user says "diagram", "visualize", "ERD", or asks for sequence diagrams, flowcharts, class diagrams, or architecture visualizations in Mermaid. DO NOT TRIGGER when: user wants PNG/SVG image output (use generating-visual-diagrams), or asks about non-Salesforce systems.

Image & Videoscripts