HeyGen Automation
Automate AI video generation, avatar browsing, template-based video creation, and video status tracking through HeyGen's platform via Composio
What this skill does
# HeyGen Automation Automate AI-powered video creation workflows -- browse avatars and templates, generate personalized videos from templates, track processing status, and retrieve shareable URLs -- all orchestrated through the Composio MCP integration. **Toolkit docs:** [composio.dev/toolkits/heygen](https://composio.dev/toolkits/heygen) --- ## Setup 1. Connect your HeyGen account through the Composio MCP server at `https://rube.app/mcp` 2. The agent will prompt you with an authentication link if no active connection exists 3. Once connected, all `HEYGEN_*` tools become available for execution --- ## Core Workflows ### 1. Browse Available Templates Retrieve pre-designed avatar templates from the HeyGen library for video creation. **Tool:** `HEYGEN_V2_TEMPLATES` ``` No parameters required -- returns template IDs, names, preview images, and customization options. ``` Use this to discover templates before generating videos. Each template has a unique ID needed for generation. --- ### 2. Inspect Template Details & Variables Retrieve the full structure of a template including all variables, scene mappings, and customization options. **Tool:** `HEYGEN_RETRIEVE_TEMPLATE_DETAILS_V3` | Parameter | Type | Required | Description | |-----------|------|----------|-------------| | `template_id` | string | Yes | Template ID from `HEYGEN_V2_TEMPLATES` or the HeyGen dashboard | This is a **required prerequisite** before generating videos -- it tells you exactly which variables must be populated. --- ### 3. Browse Available Avatars List all realistic avatars available for video creation, useful for mapping characters to scenes. **Tool:** `HEYGEN_V2_AVATARS` ``` No parameters required -- returns avatar identifiers, names, and preview images. The list may be paginated; additional calls might be needed for the full set. ``` --- ### 4. Generate Video from Template Create a customized video by populating a template with your variables and content. **Tool:** `HEYGEN_V2_TEMPLATE_GENERATE` Provide a video title and all template variables discovered via `HEYGEN_RETRIEVE_TEMPLATE_DETAILS_V3`. Every required variable must be supplied with exact names and types. --- ### 5. Check Video Processing Status Poll the processing status and retrieve metadata for a video being generated. **Tool:** `HEYGEN_RETRIEVE_VIDEO_STATUS_DETAILS` | Parameter | Type | Required | Description | |-----------|------|----------|-------------| | `video_id` | string | Yes | Video ID returned from the generation API | Returned URLs expire after 7 days but can be regenerated by calling this endpoint again. --- ### 6. Get Shareable Video URL Generate a public, shareable link that allows anyone to view the video without authentication. **Tool:** `HEYGEN_RETRIEVE_SHARABLE_VIDEO_URL` Provide the `video_id` to receive a public URL for distribution. --- ## Known Pitfalls | Pitfall | Details | |---------|---------| | **All template variables required** | `HEYGEN_V2_TEMPLATE_GENERATE` demands every variable from `HEYGEN_RETRIEVE_TEMPLATE_DETAILS_V3` with exact names and types -- missing or misnamed fields break scenes or fail generation | | **Always inspect templates first** | Never skip `HEYGEN_RETRIEVE_TEMPLATE_DETAILS_V3` before generation -- multi-scene templates have complex variable structures | | **Video URLs expire** | URLs from `HEYGEN_RETRIEVE_VIDEO_STATUS_DETAILS` expire after 7 days; regenerate by calling the endpoint again | | **Paginated avatar lists** | `HEYGEN_V2_AVATARS` may return paginated results -- additional calls may be needed for the full library | | **Processing time varies** | Video generation is asynchronous; poll `HEYGEN_RETRIEVE_VIDEO_STATUS_DETAILS` until status is complete before using output URLs | --- ## Quick Reference | Tool Slug | Purpose | |-----------|---------| | `HEYGEN_V2_TEMPLATES` | List available video templates | | `HEYGEN_RETRIEVE_TEMPLATE_DETAILS_V3` | Get template variables and scene structure | | `HEYGEN_V2_AVATARS` | Browse available AI avatars | | `HEYGEN_V2_TEMPLATE_GENERATE` | Generate video from a template | | `HEYGEN_RETRIEVE_VIDEO_STATUS_DETAILS` | Check video processing status and get URLs | | `HEYGEN_RETRIEVE_SHARABLE_VIDEO_URL` | Get a public shareable video link | --- *Powered by [Composio](https://composio.dev)*
Related in Image & Video
watch
IncludedWatch a video (URL or local path). Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls the transcript from captions (or Whisper API fallback), and hands the result to Claude so it can answer questions about what's in the video.
physical-ai-defect-image-generation
IncludedUse when the user wants to orchestrate defect image generation, run associated setup, or handle outputs on OSMO. The Day 0 path handles cold-start with USD-to-ROI, image-edit augmentation, and AnomalyGen to create initial PCBA datasets. The Day 1 path performs inference and labeling on real images. This skill helps with first-time asset setup, creation of finetuning checkpoints, and configuring deployment. Trigger keywords: defect image generation, dig workflow, dig pipeline, defect image detection workflow, aoi pipeline, aoi anomalygen, usd2roi anomalygen, day 0 pcba, day 1 pcba, day 1 real-photo alignment, day 1 manual roi, metal surface anomaly, glass defect, anomalygen finetune, setup_pcb, setup_metal, setup_glass, setup_pretrained, dig setup, dig datasets, dig pretrained checkpoint, dig image-edit endpoint.
accelint-react-best-practices
IncludedReact performance optimization and best practices. ALWAYS use this skill when working with any React code - writing components, hooks, JSX; refactoring; optimizing re-renders, memoization, state management; reviewing for performance; fixing hydration mismatches; debugging infinite re-renders, stale closures, input focus loss, animations restarting; preventing remounting; implementing transitions, lazy initialization, effect dependencies. Even simple React tasks benefit from these patterns. Covers React 19+ (useEffectEvent, Activity, ref props). Triggers - useEffect, useState, useMemo, useCallback, memo, inline components, nested components, components inside components, re-render, performance, hydration, SSR, Next.js, useDeferredValue, combined hooks.
elevenlabs-agents
IncludedBuild conversational AI voice agents with ElevenLabs Platform using React, JavaScript, React Native, or Swift SDKs. Configure agents, tools (client/server/MCP), RAG knowledge bases, multi-voice, and Scribe real-time STT. Use when: building voice chat interfaces, implementing AI phone agents with Twilio, configuring agent workflows or tools, adding RAG knowledge bases, testing with CLI "agents as code", or troubleshooting deprecated @11labs packages, Android audio cutoff, CSP violations, dynamic variables, or WebRTC config. Keywords: ElevenLabs Agents, ElevenLabs voice agents, AI voice agents, conversational AI, @elevenlabs/react, @elevenlabs/client, @elevenlabs/react-native, @elevenlabs/elevenlabs-js, @elevenlabs/agents-cli, elevenlabs SDK, voice AI, TTS, text-to-speech, ASR, speech recognition, turn-taking model, WebRTC voice, WebSocket voice, ElevenLabs conversation, agent system prompt, agent tools, agent knowledge base, RAG voice agents, multi-voice agents, pronunciation dictionary, voice speed control, elevenlabs scribe, @11labs deprecated, Android audio cutoff, CSP violation elevenlabs, dynamic variables elevenlabs, case-sensitive tool names, webhook authentication
humanizer
IncludedHumanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 28 pattern detectors, 560+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.
generating-mermaid-diagrams
IncludedSalesforce architecture diagrams using Mermaid with ASCII fallback. Use this skill when generating text-based diagrams for Salesforce architecture, OAuth flows, ERDs, integration sequences, or Agentforce structure. TRIGGER when: user says "diagram", "visualize", "ERD", or asks for sequence diagrams, flowcharts, class diagrams, or architecture visualizations in Mermaid. DO NOT TRIGGER when: user wants PNG/SVG image output (use generating-visual-diagrams), or asks about non-Salesforce systems.