muapi-seedance-2
Expert Cinema Director skill for Seedance 2.0 (ByteDance) โ high-fidelity video generation across Chinese, Global, and VIP tiers. Supports text-to-video, image-to-video, first-last-frame, omni reference, character training, omni-reference training, video editing, and watermark removal.
What this skill does
# ๐ฌ Seedance 2.0 Cinema Expert **The definitive skill for "Director-Level" AI video orchestration.** Seedance 2.0 is not a descriptive model; it is an *instructional* model. It responds best to technical cinematography, physics directives, and precise camera grammar. ## Core Competencies 1. **Text-to-Video (t2v)**: Generate cinematic video from a Director Brief โ Chinese, Global, or VIP tier. 2. **Image-to-Video (i2v)**: Animate 1โ9 reference images โ Chinese, Global (smart mode), or VIP tier. 3. **Video Extension (extend)**: Seamlessly continue an existing Seedance 2.0 video (Chinese tier). 4. **First & Last Frame (first-last)**: Interpolate a fluid video between a start image and end image (Global/VIP). 5. **Omni Reference (omni)**: Full multimodal reference with images + audio + character refs (all tiers). 6. **Omni Reference Training (omni-train)**: Train a custom persistent character for identity-consistent generation. 7. **Character Sheet (character)**: Build a reusable character from 1โ3 images (Chinese tier). 8. **Video Edit (video-edit)**: Edit an existing video with a prompt + optional reference images (Chinese tier). 9. **Watermark Removal (watermark-remove)**: Strip Seedance 2.0 watermarks (basic or Pro). --- ## ๐ท๏ธ Tiers | Tier | Flag | Censorship | Aspect Ratios | Duration | Quality param | |:---|:---|:---|:---|:---|:---| | **Chinese** (default) | `--tier chinese` | Low | 16:9, 9:16, 4:3, 3:4 | 5 / 10 / 15 s | Yes (basic/high) | | **Global** | `--tier global` | Standard | + 21:9, 1:1 | Any 4โ15 s | No | | **VIP** | `--tier vip` | Low | + 21:9, 1:1 | Any 4โ15 s | No | Add `--fast` to any Global or VIP call to use the fast-queue variant (lower latency, same quality). --- ## ๐ฅ Input Limits | Input Type | Chinese i2v/omni | Global/VIP i2v/omni | Formats | Max Size | |:---|:---|:---|:---|:---| | Images | โค 9 | โค 9 | jpeg, png, webp | 30 MB each | | Videos | โค 3 (omni only) | Not supported | mp4, mov | 50 MB each | | Audio | โค 3 | โค 3 | mp3, wav | 15 MB each | | **First-Last** | โ | 1โ2 images | jpeg, png, webp | 30 MB each | | **Video Edit** | 1 video + โค 9 imgs | โ | mp4 โค 10 MB / 15s | โ | **Output**: 4โ15 seconds, auto-generated sound, 480pโ720p. --- ## โ ๏ธ Restrictions - **No realistic human faces** in uploaded images/videos (except character/omni-train modes). - `--mode extend` requires a `request_id` from a prior `seedance-v2.0-t2v` or `seedance-v2.0-i2v` job. - `--mode first-last` requires `--tier global` or `--tier vip`. - Global/VIP omni does **not** support video references (images + audio only). - `--quality` applies to Chinese tier only. --- ## ๐ Core Syntax: The @ Reference System Assign explicit roles to each uploaded asset. Tags differ by mode. ### Chinese Tier (i2v, omni) ``` @image1 @image2 ... @image9 (images_list order) @video1 @video2 @video3 (video_files order) @audio1 @audio2 @audio3 (audio_files order) ``` ### Global/VIP Omni (omni-reference-no-video / vip-omni-reference) ``` @image1 @image2 ... @image9 (images_list order) @audio1 @audio2 @audio3 (audio_files order) ``` ### Character References (all tiers) ``` @character:<request_id> โ from seedance-2-character or completed t2v/i2v job @omni-character:<character_id> โ from seedance-2-omni-reference-train output ``` ### Role Assignment Table | Purpose | Example Syntax | |:---|:---| | First frame | `@Image1 as the first frame` | | Last frame | `@Image2 as the last frame` | | Character appearance | `@Image1's character as the subject` | | Scene / background | `scene references @Image3` | | Camera movement | `reference @Video1's camera movement` | | Action / motion | `reference @Video1's action choreography` | | Visual effects | `completely reference @Video1's effects and transitions` | | Rhythm / tempo | `video rhythm references @Video1` | | Voice / tone | `narration voice references @Video1` | | Background music | `BGM references @Audio1` | | Sound effects | `sound effects reference @Video3's audio` | | Outfit / clothing | `wearing the outfit from @Image2` | | Product appearance | `product details reference @Image3` | ### Multi-Reference Combination ``` @Image1's character as the subject, reference @Video1's camera movement and action choreography, BGM references @Audio1, scene references @Image2 ``` --- ## ๐๏ธ Technical Specification: The Director Brief Structure prompts using this six-component hierarchy. Order matters โ composition first, texture and micro-motion last: | Component | Instruction Type | Example | |:---|:---|:---| | **Scene** | Environment + Lighting | "A rain-soaked cyberpunk street, magenta neon reflections on wet asphalt." | | **Subject** | Identity + Detail | "A woman in a black trenchcoat, determined focus, cinematic skin textures." | | **Action** | Fluid Interaction | "Walking forward through the crowd, coat billowing slightly in the wind." | | **Camera** | Movement + Lens + Speed | "Medium tracking shot, 35mm lens, slow dolly backward over 6s. Subtle handheld jitter." | | **Audio** | Music + SFX + Ambience | "Low ambient hum, distant traffic, single piano note at 5s. No dialogue." | | **Pacing/Style** | Timing + Mood + Grade | "Cinematic epic, warm color grade, shallow DOF. Slow build โ single action only, no scene cuts." | > **Seedance 2.0 generates audio natively.** Always include an Audio directive โ even one sentence. Without it the model generates random ambient sound that may not match your scene. ### Time-Segmented Prompts (Recommended for 10s+ videos) Break prompts into timed segments for precise control: ``` 0โ3s: [opening scene, camera move, establishing action] 3โ6s: [mid-section development, subject in motion] 6โ10s: [climax or key action beat] 10โ15s: [resolution, brand/product hold, text/tagline fade in] ``` > **Single-beat rule:** Each segment should contain one action. 4โ7s = one beat. 10โ15s = 3โ4 beats maximum. Overloading a segment with multiple narrative changes degrades output quality. ### Negative Prompting Seedance 2.0 supports appending negative guidance directly in the prompt. Use plain language at the end: ``` [your director brief above] Avoid: camera shake, jump cuts, lens distortion, overexposure, watermarks, text overlays. ``` Common negative additions: - `Avoid: abrupt cuts, scene changes, multiple locations.` (for single-take shots) - `Avoid: human faces, realistic people.` (for product-only content) - `Avoid: fast motion, blur, unstable framing.` (for smooth product reveals) --- ## ๐ฅ Camera Language Reference ### Basic Movements | Term | Description | |:---|:---| | Push in / Slow push | Camera moves toward subject | | Pull back / Pull away | Camera moves away from subject | | Pan left/right | Camera rotates horizontally | | Tilt up/down | Camera rotates vertically | | Track / Follow shot | Camera follows subject movement | | Orbit / Revolve | Camera circles around subject | | One-take / Oner | Continuous shot with no cuts | ### Advanced Techniques | Term | Description | |:---|:---| | Hitchcock zoom (dolly zoom) | Push in + zoom out โ creates vertigo effect | | Fisheye lens | Ultra-wide distorted lens | | Low angle / High angle | Camera below/above subject | | Bird's eye / Overhead | Top-down view | | First-person POV (FPV) | Immersive subjective camera from character/object's eyes โ GoPro-style wide angle, forward motion, no cuts | | Drone flythrough | Cinematic aerial descent โ gimbal-stabilized, sweeping lateral arc, DJI Inspire aesthetic | | Architectural flythrough | Ground-level continuous dolly through connected spaces โ one-take, practical lighting | | Whip pan | Very fast horizontal pan with motion blur | | Crane shot | Vertical movement like a crane arm | ### Shot Sizes | Term | Description | |:---|:---| | Extreme close-up | Eyes, mouth, or small detail only | | Close-up | Face fills frame | | Medium close-up | Head and shoulders | | Medium shot | Waist up | | Full shot | Entire body | | Wide / Est
Related in Image & Video
watch
IncludedWatch a video (URL or local path). Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls the transcript from captions (or Whisper API fallback), and hands the result to Claude so it can answer questions about what's in the video.
physical-ai-defect-image-generation
IncludedUse when the user wants to orchestrate defect image generation, run associated setup, or handle outputs on OSMO. The Day 0 path handles cold-start with USD-to-ROI, image-edit augmentation, and AnomalyGen to create initial PCBA datasets. The Day 1 path performs inference and labeling on real images. This skill helps with first-time asset setup, creation of finetuning checkpoints, and configuring deployment. Trigger keywords: defect image generation, dig workflow, dig pipeline, defect image detection workflow, aoi pipeline, aoi anomalygen, usd2roi anomalygen, day 0 pcba, day 1 pcba, day 1 real-photo alignment, day 1 manual roi, metal surface anomaly, glass defect, anomalygen finetune, setup_pcb, setup_metal, setup_glass, setup_pretrained, dig setup, dig datasets, dig pretrained checkpoint, dig image-edit endpoint.
accelint-react-best-practices
IncludedReact performance optimization and best practices. ALWAYS use this skill when working with any React code - writing components, hooks, JSX; refactoring; optimizing re-renders, memoization, state management; reviewing for performance; fixing hydration mismatches; debugging infinite re-renders, stale closures, input focus loss, animations restarting; preventing remounting; implementing transitions, lazy initialization, effect dependencies. Even simple React tasks benefit from these patterns. Covers React 19+ (useEffectEvent, Activity, ref props). Triggers - useEffect, useState, useMemo, useCallback, memo, inline components, nested components, components inside components, re-render, performance, hydration, SSR, Next.js, useDeferredValue, combined hooks.
elevenlabs-agents
IncludedBuild conversational AI voice agents with ElevenLabs Platform using React, JavaScript, React Native, or Swift SDKs. Configure agents, tools (client/server/MCP), RAG knowledge bases, multi-voice, and Scribe real-time STT. Use when: building voice chat interfaces, implementing AI phone agents with Twilio, configuring agent workflows or tools, adding RAG knowledge bases, testing with CLI "agents as code", or troubleshooting deprecated @11labs packages, Android audio cutoff, CSP violations, dynamic variables, or WebRTC config. Keywords: ElevenLabs Agents, ElevenLabs voice agents, AI voice agents, conversational AI, @elevenlabs/react, @elevenlabs/client, @elevenlabs/react-native, @elevenlabs/elevenlabs-js, @elevenlabs/agents-cli, elevenlabs SDK, voice AI, TTS, text-to-speech, ASR, speech recognition, turn-taking model, WebRTC voice, WebSocket voice, ElevenLabs conversation, agent system prompt, agent tools, agent knowledge base, RAG voice agents, multi-voice agents, pronunciation dictionary, voice speed control, elevenlabs scribe, @11labs deprecated, Android audio cutoff, CSP violation elevenlabs, dynamic variables elevenlabs, case-sensitive tool names, webhook authentication
humanizer
IncludedHumanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 28 pattern detectors, 560+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.
generating-mermaid-diagrams
IncludedSalesforce architecture diagrams using Mermaid with ASCII fallback. Use this skill when generating text-based diagrams for Salesforce architecture, OAuth flows, ERDs, integration sequences, or Agentforce structure. TRIGGER when: user says "diagram", "visualize", "ERD", or asks for sequence diagrams, flowcharts, class diagrams, or architecture visualizations in Mermaid. DO NOT TRIGGER when: user wants PNG/SVG image output (use generating-visual-diagrams), or asks about non-Salesforce systems.