stability-ai
Geracao de imagens via Stability AI (SD3.5, Ultra, Core). Text-to-image, img2img, inpainting, upscale, remove-bg, search-replace. 15 estilos artisticos.
What this skill does
# Stability AI — Gerador de Imagens Profissional
## Overview
Geracao de imagens via Stability AI (SD3.5, Ultra, Core). Text-to-image, img2img, inpainting, upscale, remove-bg, search-replace. 15 estilos artisticos.
## When to Use This Skill
- When the user mentions "stability ai" or related topics
- When the user mentions "stable diffusion" or related topics
- When the user mentions "sd3.5" or related topics
- When the user mentions "gerar arte" or related topics
- When the user mentions "gerar ilustracao" or related topics
- When the user mentions "image to image" or related topics
## Do Not Use This Skill When
- The task is unrelated to stability ai
- A simpler, more specific tool can handle the request
- The user needs general-purpose assistance without domain expertise
## How It Works
Skill para gerar imagens artisticas e fotorrealistas usando a Stability AI API.
**Gratuito** com Community License (sem limite para uso pessoal/pequenas empresas).
## Quando Usar Esta Skill Vs Ai-Studio-Image
| Cenario | Skill recomendada |
|---------|-------------------|
| Foto humanizada para Instagram/redes sociais | ai-studio-image |
| Arte digital, ilustracao, concept art | **stability-ai** |
| Foto com camera de celular (realismo casual) | ai-studio-image |
| Fotorrealismo cinematografico (8K, detalhado) | **stability-ai** |
| Material educacional com visual profissional | ai-studio-image |
| Poster, wallpaper, book cover, game asset | **stability-ai** |
| Inpainting (editar parte de uma imagem) | **stability-ai** |
| Upscale (aumentar resolucao) | **stability-ai** |
| Remover fundo de imagem | **stability-ai** |
| Search & Replace (trocar objeto em imagem) | **stability-ai** |
| Apagar elemento de uma imagem | **stability-ai** |
## Setup Rapido
1. Criar conta em **platform.stability.ai** (gratuito)
2. Copiar API Key do dashboard
3. Colar no `.env`: `STABILITY_API_KEY=sk-sua-chave-aqui`
4. `pip install -r scripts/requirements.txt`
Detalhes completos em `references/setup-guide.md`.
## 1. Modos De Operacao
| Comando | O que faz | Endpoint |
|---------|-----------|----------|
| `--mode generate` | Texto para imagem (SD3.5) | `/generate/sd3` |
| `--mode ultra` | Texto para imagem premium | `/generate/ultra` |
| `--mode core` | Texto para imagem rapido | `/generate/core` |
| `--mode img2img` | Imagem + texto para nova imagem | `/generate/sd3` |
| `--mode upscale` | Aumentar resolucao (conservativo) | `/upscale/conservative` |
| `--mode upscale-creative` | Aumentar resolucao com detalhes | `/upscale/creative` |
| `--mode remove-bg` | Remover fundo (PNG transparente) | `/edit/remove-background` |
| `--mode inpaint` | Editar parte da imagem (mascara) | `/edit/inpaint` |
| `--mode search-replace` | Trocar objeto por descricao | `/edit/search-and-replace` |
| `--mode erase` | Apagar parte da imagem | `/edit/erase` |
## 2. Exemplos De Uso
```bash
## Geracao Basica (Sd 3.5 Large)
python scripts/generate.py --prompt "a serene mountain landscape at sunset" --mode generate
## Qualidade Maxima (Ultra)
python scripts/generate.py --prompt "cinematic portrait, dramatic lighting" --mode ultra --aspect-ratio 16:9
## Rapido Para Iteracao (Core)
python scripts/generate.py --prompt "cute cat ninja" --mode core --style anime
## Image-To-Image
python scripts/generate.py --prompt "watercolor style" --mode img2img --image foto.jpg --strength 0.7
## Upscale Conservativo
python scripts/generate.py --prompt "landscape photo" --mode upscale --image foto_pequena.jpg
## Remover Fundo
python scripts/generate.py --mode remove-bg --image produto.jpg
## Inpainting Com Mascara
python scripts/generate.py --prompt "red roses" --mode inpaint --image jardim.jpg --mask mascara.png
## Search & Replace
python scripts/generate.py --prompt "a golden retriever" --mode search-replace --image parque.jpg --search "the cat"
## Apagar Objeto
python scripts/generate.py --mode erase --image foto.jpg --mask area.png
## Listar Modelos
python scripts/generate.py --list-models
## Listar Estilos
python scripts/generate.py --list-styles
## Analisar Prompt (Sugestoes Automaticas)
python scripts/generate.py --prompt "anime warrior girl, widescreen" --analyze --json
```
## 3. Aspect Ratios
| Nome | Ratio | Aliases | Uso tipico |
|------|-------|---------|-----------|
| square | 1:1 | ig, instagram, quadrado | Feed Instagram |
| portrait | 2:3 | retrato, pinterest | Retrato, poster |
| landscape | 3:2 | paisagem, horizontal | Paisagem, banner |
| photo | 4:5 | ig-feed | Instagram feed otimizado |
| wide | 16:9 | widescreen, youtube, cinema, wallpaper | Cinema, YT |
| ultrawide | 21:9 | — | Monitor ultrawide |
| stories | 9:16 | vertical, tiktok, ig-stories | Stories, Reels |
| phone | 9:21 | — | Wallpaper celular |
## 4. Estilos (15 Presets)
Cada estilo adiciona qualificadores automaticamente ao prompt:
| Estilo | Descricao | Ideal para |
|--------|-----------|-----------|
| photorealistic | Fotorrealismo cinematografico | Retratos, cenas |
| anime | Anime/Manga japones | Personagens, cenas |
| digital-art | Arte digital detalhada | Ilustracoes gerais |
| oil-painting | Pintura a oleo classica | Arte classica |
| watercolor | Aquarela fluida | Arte delicada |
| pixel-art | Pixel art retro 8/16-bit | Games retro |
| 3d-render | Render 3D fotorrealista | Produtos, cenas 3D |
| concept-art | Concept art profissional | Games, filmes |
| comic | Comics/HQ estilizado | Quadrinhos |
| minimalist | Minimalista limpo | Design, logos |
| fantasy | Fantasy art epico | RPG, medieval |
| sci-fi | Sci-fi futurista | Cyberpunk, espaco |
| sketch | Desenho a lapis/carvao | Estudos, rascunhos |
| pop-art | Pop art vibrante | Arte moderna |
| noir | Film noir dramatico | Atmosfera sombria |
## 5. Output
Imagens salvas em `data/outputs/` com naming: `{mode}_{style}_{timestamp}_{index}.png`
Metadados salvos em `.meta.json` com: prompt original, prompt final, modelo, aspect ratio, seed, tempo, tamanho.
## Integracao Com Outras Skills
- **ai-studio-image**: Complementar — Stability AI para arte, Gemini para fotos humanizadas
- **instagram**: Gerar arte → publicar no Instagram
- **telegram**: Gerar imagem → enviar via bot
## Rate Limits & Seguranca
- **Community License**: 150 requests/10 segundos
- **Limite diario**: 100 imagens/dia (configuravel via `SAFETY_MAX_IMAGES_PER_DAY`)
- **Retry automatico** com backoff exponencial em caso de 429
- **Fallback de API keys** (primaria + backups)
## Referencia De Arquivos
| Arquivo | Quando consultar |
|---------|-----------------|
| `references/setup-guide.md` | Setup inicial, API key, troubleshooting |
| `references/prompt-engineering.md` | Tecnicas avancadas de prompt |
| `references/api-reference.md` | Endpoints, parametros, respostas, erros |
## Best Practices
- Provide clear, specific context about your project and requirements
- Review all suggestions before applying them to production code
- Combine with other complementary skills for comprehensive analysis
## Common Pitfalls
- Using this skill for tasks outside its domain expertise
- Applying recommendations without understanding your specific context
- Not providing enough project context for accurate analysis
## Related Skills
- `ai-studio-image` - Complementary skill for enhanced analysis
- `comfyui-gateway` - Complementary skill for enhanced analysis
- `image-studio` - Complementary skill for enhanced analysis
## Limitations
- Use this skill only when the task clearly matches the scope described above.
- Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
Related in Image & Video
watch
IncludedWatch a video (URL or local path). Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls the transcript from captions (or Whisper API fallback), and hands the result to Claude so it can answer questions about what's in the video.
physical-ai-defect-image-generation
IncludedUse when the user wants to orchestrate defect image generation, run associated setup, or handle outputs on OSMO. The Day 0 path handles cold-start with USD-to-ROI, image-edit augmentation, and AnomalyGen to create initial PCBA datasets. The Day 1 path performs inference and labeling on real images. This skill helps with first-time asset setup, creation of finetuning checkpoints, and configuring deployment. Trigger keywords: defect image generation, dig workflow, dig pipeline, defect image detection workflow, aoi pipeline, aoi anomalygen, usd2roi anomalygen, day 0 pcba, day 1 pcba, day 1 real-photo alignment, day 1 manual roi, metal surface anomaly, glass defect, anomalygen finetune, setup_pcb, setup_metal, setup_glass, setup_pretrained, dig setup, dig datasets, dig pretrained checkpoint, dig image-edit endpoint.
accelint-react-best-practices
IncludedReact performance optimization and best practices. ALWAYS use this skill when working with any React code - writing components, hooks, JSX; refactoring; optimizing re-renders, memoization, state management; reviewing for performance; fixing hydration mismatches; debugging infinite re-renders, stale closures, input focus loss, animations restarting; preventing remounting; implementing transitions, lazy initialization, effect dependencies. Even simple React tasks benefit from these patterns. Covers React 19+ (useEffectEvent, Activity, ref props). Triggers - useEffect, useState, useMemo, useCallback, memo, inline components, nested components, components inside components, re-render, performance, hydration, SSR, Next.js, useDeferredValue, combined hooks.
elevenlabs-agents
IncludedBuild conversational AI voice agents with ElevenLabs Platform using React, JavaScript, React Native, or Swift SDKs. Configure agents, tools (client/server/MCP), RAG knowledge bases, multi-voice, and Scribe real-time STT. Use when: building voice chat interfaces, implementing AI phone agents with Twilio, configuring agent workflows or tools, adding RAG knowledge bases, testing with CLI "agents as code", or troubleshooting deprecated @11labs packages, Android audio cutoff, CSP violations, dynamic variables, or WebRTC config. Keywords: ElevenLabs Agents, ElevenLabs voice agents, AI voice agents, conversational AI, @elevenlabs/react, @elevenlabs/client, @elevenlabs/react-native, @elevenlabs/elevenlabs-js, @elevenlabs/agents-cli, elevenlabs SDK, voice AI, TTS, text-to-speech, ASR, speech recognition, turn-taking model, WebRTC voice, WebSocket voice, ElevenLabs conversation, agent system prompt, agent tools, agent knowledge base, RAG voice agents, multi-voice agents, pronunciation dictionary, voice speed control, elevenlabs scribe, @11labs deprecated, Android audio cutoff, CSP violation elevenlabs, dynamic variables elevenlabs, case-sensitive tool names, webhook authentication
humanizer
IncludedHumanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 28 pattern detectors, 560+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.
generating-mermaid-diagrams
IncludedSalesforce architecture diagrams using Mermaid with ASCII fallback. Use this skill when generating text-based diagrams for Salesforce architecture, OAuth flows, ERDs, integration sequences, or Agentforce structure. TRIGGER when: user says "diagram", "visualize", "ERD", or asks for sequence diagrams, flowcharts, class diagrams, or architecture visualizations in Mermaid. DO NOT TRIGGER when: user wants PNG/SVG image output (use generating-visual-diagrams), or asks about non-Salesforce systems.