remotion-video-toolkit
Complete toolkit for programmatic video creation with Remotion + React. Covers animations, timing, rendering (CLI/Node.js/Lambda/Cloud Run), captions, 3D, charts, text effects, transitions, and media handling. Use when writing Remotion code, building video generation pipelines, or creating data-driven video templates.
What this skill does
# Remotion Video Toolkit
Write React components, get real MP4 videos. This skill teaches your AI agent how to build with Remotion — from a first animation to a production rendering pipeline.
29 rules. Every major Remotion feature covered.
---
## What you can build with this
**Personalized video at scale**
Feed user data as JSON props, render a unique video per user. Think Spotify Wrapped, GitHub Unwrapped, onboarding walkthroughs — one template, thousands of outputs.
**Automated social media clips**
Pull live data (stats, leaderboards, product metrics) and render daily or weekly video posts without anyone touching a timeline editor.
**Dynamic ads and marketing videos**
Swap in customer name, product image, pricing. Same template, infinite variations. Render server-side via API or Lambda.
**Animated data visualizations**
Turn dashboards and KPI reports into shareable video clips with animated charts and transitions.
**TikTok and Reels captions**
Transcribe audio, display word-by-word highlighted subtitles, export ready for upload.
**Product showcase videos**
Auto-generate from your database — images, specs, pricing — straight to MP4.
**Educational and explainer content**
Animated course materials, certificate videos, step-by-step walkthroughs — all driven by code.
**Video generation as a service**
Expose rendering as an HTTP endpoint. Your app sends JSON, gets back a video file.
---
## Requirements
- **Node.js** 18+
- **React** 18+ (Remotion renders React components frame-by-frame)
- **Remotion** — scaffold with `npx create-video@latest`
- **FFmpeg** — ships with `@remotion/renderer`, no separate install needed
- For serverless rendering: **AWS** account (Lambda) or **GCP** account (Cloud Run)
---
## What's inside
### Core
| Rule | Description |
|------|-------------|
| [Compositions](rules/compositions.md) | Define videos, stills, folders, default props, dynamic metadata |
| [Rendering](rules/rendering.md) | CLI, Node.js API, AWS Lambda, Cloud Run, Express server patterns |
| [Calculate metadata](rules/calculate-metadata.md) | Set duration, dimensions, and props dynamically at render time |
### Animation and timing
| Rule | Description |
|------|-------------|
| [Animations](rules/animations.md) | Fade, scale, rotate, slide |
| [Timing](rules/timing.md) | Interpolation curves, easing, spring physics |
| [Sequencing](rules/sequencing.md) | Delay, chain, and orchestrate scenes |
| [Transitions](rules/transitions.md) | Scene-to-scene transitions |
| [Trimming](rules/trimming.md) | Cut the start or end of any animation |
### Text and typography
| Rule | Description |
|------|-------------|
| [Text animations](rules/text-animations.md) | Typewriter, word highlight, reveal effects |
| [Fonts](rules/fonts.md) | Google Fonts and local font loading |
| [Measuring text](rules/measuring-text.md) | Fit text to containers, detect overflow |
### Media
| Rule | Description |
|------|-------------|
| [Videos](rules/videos.md) | Embed, trim, speed, volume, loop, pitch shift |
| [Audio](rules/audio.md) | Import, trim, fade, volume and speed control |
| [Images](rules/images.md) | The Img component |
| [GIFs](rules/gifs.md) | Timeline-synced GIF playback |
| [Assets](rules/assets.md) | Importing any media into compositions |
| [Decode check](rules/can-decode.md) | Validate browser compatibility |
### Captions and subtitles
| Rule | Description |
|------|-------------|
| [Transcribe captions](rules/transcribe-captions.md) | Audio to captions via Whisper, Deepgram, or AssemblyAI |
| [Display captions](rules/display-captions.md) | TikTok-style word-by-word highlighting |
| [Import SRT](rules/import-srt-captions.md) | Load existing .srt files |
### Data visualization
| Rule | Description |
|------|-------------|
| [Charts](rules/charts.md) | Animated bar charts, line graphs, data-driven visuals |
### Advanced
| Rule | Description |
|------|-------------|
| [3D content](rules/3d.md) | Three.js and React Three Fiber |
| [Lottie](rules/lottie.md) | After Effects animations via Lottie |
| [TailwindCSS](rules/tailwind.md) | Style compositions with Tailwind |
| [DOM measurement](rules/measuring-dom-nodes.md) | Measure element dimensions at render time |
### Media utilities
| Rule | Description |
|------|-------------|
| [Video duration](rules/get-video-duration.md) | Get length in seconds |
| [Video dimensions](rules/get-video-dimensions.md) | Get width and height |
| [Audio duration](rules/get-audio-duration.md) | Get audio length |
| [Extract frames](rules/extract-frames.md) | Pull frames at specific timestamps |
---
## Quick start
```bash
# Scaffold a project
npx create-video@latest my-video
# Preview in browser
cd my-video && npm start
# Render to MP4
npx remotion render src/index.ts MyComposition out/video.mp4
# Pass dynamic data
npx remotion render src/index.ts MyComposition out.mp4 --props '{"title": "Hello"}'
```
---
## Contribute
**Source:** [github.com/shreefentsar/remotion-video-toolkit](https://github.com/shreefentsar/remotion-video-toolkit)
Missing something? Found a better approach? Open a PR — new rules, improved examples, bug fixes all welcome.
Built by [Zone 99](https://99.zone)
Related in Image & Video
watch
IncludedWatch a video (URL or local path). Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls the transcript from captions (or Whisper API fallback), and hands the result to Claude so it can answer questions about what's in the video.
physical-ai-defect-image-generation
IncludedUse when the user wants to orchestrate defect image generation, run associated setup, or handle outputs on OSMO. The Day 0 path handles cold-start with USD-to-ROI, image-edit augmentation, and AnomalyGen to create initial PCBA datasets. The Day 1 path performs inference and labeling on real images. This skill helps with first-time asset setup, creation of finetuning checkpoints, and configuring deployment. Trigger keywords: defect image generation, dig workflow, dig pipeline, defect image detection workflow, aoi pipeline, aoi anomalygen, usd2roi anomalygen, day 0 pcba, day 1 pcba, day 1 real-photo alignment, day 1 manual roi, metal surface anomaly, glass defect, anomalygen finetune, setup_pcb, setup_metal, setup_glass, setup_pretrained, dig setup, dig datasets, dig pretrained checkpoint, dig image-edit endpoint.
accelint-react-best-practices
IncludedReact performance optimization and best practices. ALWAYS use this skill when working with any React code - writing components, hooks, JSX; refactoring; optimizing re-renders, memoization, state management; reviewing for performance; fixing hydration mismatches; debugging infinite re-renders, stale closures, input focus loss, animations restarting; preventing remounting; implementing transitions, lazy initialization, effect dependencies. Even simple React tasks benefit from these patterns. Covers React 19+ (useEffectEvent, Activity, ref props). Triggers - useEffect, useState, useMemo, useCallback, memo, inline components, nested components, components inside components, re-render, performance, hydration, SSR, Next.js, useDeferredValue, combined hooks.
elevenlabs-agents
IncludedBuild conversational AI voice agents with ElevenLabs Platform using React, JavaScript, React Native, or Swift SDKs. Configure agents, tools (client/server/MCP), RAG knowledge bases, multi-voice, and Scribe real-time STT. Use when: building voice chat interfaces, implementing AI phone agents with Twilio, configuring agent workflows or tools, adding RAG knowledge bases, testing with CLI "agents as code", or troubleshooting deprecated @11labs packages, Android audio cutoff, CSP violations, dynamic variables, or WebRTC config. Keywords: ElevenLabs Agents, ElevenLabs voice agents, AI voice agents, conversational AI, @elevenlabs/react, @elevenlabs/client, @elevenlabs/react-native, @elevenlabs/elevenlabs-js, @elevenlabs/agents-cli, elevenlabs SDK, voice AI, TTS, text-to-speech, ASR, speech recognition, turn-taking model, WebRTC voice, WebSocket voice, ElevenLabs conversation, agent system prompt, agent tools, agent knowledge base, RAG voice agents, multi-voice agents, pronunciation dictionary, voice speed control, elevenlabs scribe, @11labs deprecated, Android audio cutoff, CSP violation elevenlabs, dynamic variables elevenlabs, case-sensitive tool names, webhook authentication
humanizer
IncludedHumanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 28 pattern detectors, 560+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.
generating-mermaid-diagrams
IncludedSalesforce architecture diagrams using Mermaid with ASCII fallback. Use this skill when generating text-based diagrams for Salesforce architecture, OAuth flows, ERDs, integration sequences, or Agentforce structure. TRIGGER when: user says "diagram", "visualize", "ERD", or asks for sequence diagrams, flowcharts, class diagrams, or architecture visualizations in Mermaid. DO NOT TRIGGER when: user wants PNG/SVG image output (use generating-visual-diagrams), or asks about non-Salesforce systems.