stability-ai

Included with Lifetime

$97 forever

Geracao de imagens via Stability AI (SD3.5, Ultra, Core). Text-to-image, img2img, inpainting, upscale, remove-bg, search-replace. 15 estilos artisticos.

Image & Videoimage-generationstable-diffusionai-artapiscripts

What this skill does


# Stability AI — Gerador de Imagens Profissional

## Overview

Geracao de imagens via Stability AI (SD3.5, Ultra, Core). Text-to-image, img2img, inpainting, upscale, remove-bg, search-replace. 15 estilos artisticos.

## When to Use This Skill

- When the user mentions "stability ai" or related topics
- When the user mentions "stable diffusion" or related topics
- When the user mentions "sd3.5" or related topics
- When the user mentions "gerar arte" or related topics
- When the user mentions "gerar ilustracao" or related topics
- When the user mentions "image to image" or related topics

## Do Not Use This Skill When

- The task is unrelated to stability ai
- A simpler, more specific tool can handle the request
- The user needs general-purpose assistance without domain expertise

## How It Works

Skill para gerar imagens artisticas e fotorrealistas usando a Stability AI API.
**Gratuito** com Community License (sem limite para uso pessoal/pequenas empresas).

## Quando Usar Esta Skill Vs Ai-Studio-Image

| Cenario | Skill recomendada |
|---------|-------------------|
| Foto humanizada para Instagram/redes sociais | ai-studio-image |
| Arte digital, ilustracao, concept art | **stability-ai** |
| Foto com camera de celular (realismo casual) | ai-studio-image |
| Fotorrealismo cinematografico (8K, detalhado) | **stability-ai** |
| Material educacional com visual profissional | ai-studio-image |
| Poster, wallpaper, book cover, game asset | **stability-ai** |
| Inpainting (editar parte de uma imagem) | **stability-ai** |
| Upscale (aumentar resolucao) | **stability-ai** |
| Remover fundo de imagem | **stability-ai** |
| Search & Replace (trocar objeto em imagem) | **stability-ai** |
| Apagar elemento de uma imagem | **stability-ai** |

## Setup Rapido

1. Criar conta em **platform.stability.ai** (gratuito)
2. Copiar API Key do dashboard
3. Colar no `.env`: `STABILITY_API_KEY=sk-sua-chave-aqui`
4. `pip install -r scripts/requirements.txt`

Detalhes completos em `references/setup-guide.md`.

## 1. Modos De Operacao

| Comando | O que faz | Endpoint |
|---------|-----------|----------|
| `--mode generate` | Texto para imagem (SD3.5) | `/generate/sd3` |
| `--mode ultra` | Texto para imagem premium | `/generate/ultra` |
| `--mode core` | Texto para imagem rapido | `/generate/core` |
| `--mode img2img` | Imagem + texto para nova imagem | `/generate/sd3` |
| `--mode upscale` | Aumentar resolucao (conservativo) | `/upscale/conservative` |
| `--mode upscale-creative` | Aumentar resolucao com detalhes | `/upscale/creative` |
| `--mode remove-bg` | Remover fundo (PNG transparente) | `/edit/remove-background` |
| `--mode inpaint` | Editar parte da imagem (mascara) | `/edit/inpaint` |
| `--mode search-replace` | Trocar objeto por descricao | `/edit/search-and-replace` |
| `--mode erase` | Apagar parte da imagem | `/edit/erase` |

## 2. Exemplos De Uso

```bash

## Geracao Basica (Sd 3.5 Large)

python scripts/generate.py --prompt "a serene mountain landscape at sunset" --mode generate

## Qualidade Maxima (Ultra)

python scripts/generate.py --prompt "cinematic portrait, dramatic lighting" --mode ultra --aspect-ratio 16:9

## Rapido Para Iteracao (Core)

python scripts/generate.py --prompt "cute cat ninja" --mode core --style anime

## Image-To-Image

python scripts/generate.py --prompt "watercolor style" --mode img2img --image foto.jpg --strength 0.7

## Upscale Conservativo

python scripts/generate.py --prompt "landscape photo" --mode upscale --image foto_pequena.jpg

## Remover Fundo

python scripts/generate.py --mode remove-bg --image produto.jpg

## Inpainting Com Mascara

python scripts/generate.py --prompt "red roses" --mode inpaint --image jardim.jpg --mask mascara.png

## Search & Replace

python scripts/generate.py --prompt "a golden retriever" --mode search-replace --image parque.jpg --search "the cat"

## Apagar Objeto

python scripts/generate.py --mode erase --image foto.jpg --mask area.png

## Listar Modelos

python scripts/generate.py --list-models

## Listar Estilos

python scripts/generate.py --list-styles

## Analisar Prompt (Sugestoes Automaticas)

python scripts/generate.py --prompt "anime warrior girl, widescreen" --analyze --json
```

## 3. Aspect Ratios

| Nome | Ratio | Aliases | Uso tipico |
|------|-------|---------|-----------|
| square | 1:1 | ig, instagram, quadrado | Feed Instagram |
| portrait | 2:3 | retrato, pinterest | Retrato, poster |
| landscape | 3:2 | paisagem, horizontal | Paisagem, banner |
| photo | 4:5 | ig-feed | Instagram feed otimizado |
| wide | 16:9 | widescreen, youtube, cinema, wallpaper | Cinema, YT |
| ultrawide | 21:9 | — | Monitor ultrawide |
| stories | 9:16 | vertical, tiktok, ig-stories | Stories, Reels |
| phone | 9:21 | — | Wallpaper celular |

## 4. Estilos (15 Presets)

Cada estilo adiciona qualificadores automaticamente ao prompt:

| Estilo | Descricao | Ideal para |
|--------|-----------|-----------|
| photorealistic | Fotorrealismo cinematografico | Retratos, cenas |
| anime | Anime/Manga japones | Personagens, cenas |
| digital-art | Arte digital detalhada | Ilustracoes gerais |
| oil-painting | Pintura a oleo classica | Arte classica |
| watercolor | Aquarela fluida | Arte delicada |
| pixel-art | Pixel art retro 8/16-bit | Games retro |
| 3d-render | Render 3D fotorrealista | Produtos, cenas 3D |
| concept-art | Concept art profissional | Games, filmes |
| comic | Comics/HQ estilizado | Quadrinhos |
| minimalist | Minimalista limpo | Design, logos |
| fantasy | Fantasy art epico | RPG, medieval |
| sci-fi | Sci-fi futurista | Cyberpunk, espaco |
| sketch | Desenho a lapis/carvao | Estudos, rascunhos |
| pop-art | Pop art vibrante | Arte moderna |
| noir | Film noir dramatico | Atmosfera sombria |

## 5. Output

Imagens salvas em `data/outputs/` com naming: `{mode}_{style}_{timestamp}_{index}.png`

Metadados salvos em `.meta.json` com: prompt original, prompt final, modelo, aspect ratio, seed, tempo, tamanho.

## Integracao Com Outras Skills

- **ai-studio-image**: Complementar — Stability AI para arte, Gemini para fotos humanizadas
- **instagram**: Gerar arte → publicar no Instagram
- **telegram**: Gerar imagem → enviar via bot

## Rate Limits & Seguranca

- **Community License**: 150 requests/10 segundos
- **Limite diario**: 100 imagens/dia (configuravel via `SAFETY_MAX_IMAGES_PER_DAY`)
- **Retry automatico** com backoff exponencial em caso de 429
- **Fallback de API keys** (primaria + backups)

## Referencia De Arquivos

| Arquivo | Quando consultar |
|---------|-----------------|
| `references/setup-guide.md` | Setup inicial, API key, troubleshooting |
| `references/prompt-engineering.md` | Tecnicas avancadas de prompt |
| `references/api-reference.md` | Endpoints, parametros, respostas, erros |

## Best Practices

- Provide clear, specific context about your project and requirements
- Review all suggestions before applying them to production code
- Combine with other complementary skills for comprehensive analysis

## Common Pitfalls

- Using this skill for tasks outside its domain expertise
- Applying recommendations without understanding your specific context
- Not providing enough project context for accurate analysis

## Related Skills

- `ai-studio-image` - Complementary skill for enhanced analysis
- `comfyui-gateway` - Complementary skill for enhanced analysis
- `image-studio` - Complementary skill for enhanced analysis

## Limitations
- Use this skill only when the task clearly matches the scope described above.
- Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.

Files: 8

Size: 64.2 KB

Complexity: 70/100

Category: Image & Video

Source: https://github.com/sickn33/antigravity-awesome-skills/tree/main/plugins/antigravity-awesome-skills/skills/stability-ai

Related in Image & Video

watch

Included

Watch a video (URL or local path). Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls the transcript from captions (or Whisper API fallback), and hands the result to Claude so it can answer questions about what's in the video.

Image & Videoscriptsfeatured

physical-ai-defect-image-generation

Included

Use when the user wants to orchestrate defect image generation, run associated setup, or handle outputs on OSMO. The Day 0 path handles cold-start with USD-to-ROI, image-edit augmentation, and AnomalyGen to create initial PCBA datasets. The Day 1 path performs inference and labeling on real images. This skill helps with first-time asset setup, creation of finetuning checkpoints, and configuring deployment. Trigger keywords: defect image generation, dig workflow, dig pipeline, defect image detection workflow, aoi pipeline, aoi anomalygen, usd2roi anomalygen, day 0 pcba, day 1 pcba, day 1 real-photo alignment, day 1 manual roi, metal surface anomaly, glass defect, anomalygen finetune, setup_pcb, setup_metal, setup_glass, setup_pretrained, dig setup, dig datasets, dig pretrained checkpoint, dig image-edit endpoint.

Image & Videoscripts

accelint-react-best-practices

Included

React performance optimization and best practices. ALWAYS use this skill when working with any React code - writing components, hooks, JSX; refactoring; optimizing re-renders, memoization, state management; reviewing for performance; fixing hydration mismatches; debugging infinite re-renders, stale closures, input focus loss, animations restarting; preventing remounting; implementing transitions, lazy initialization, effect dependencies. Even simple React tasks benefit from these patterns. Covers React 19+ (useEffectEvent, Activity, ref props). Triggers - useEffect, useState, useMemo, useCallback, memo, inline components, nested components, components inside components, re-render, performance, hydration, SSR, Next.js, useDeferredValue, combined hooks.

Image & Videoscripts

elevenlabs-agents

Included

Build conversational AI voice agents with ElevenLabs Platform using React, JavaScript, React Native, or Swift SDKs. Configure agents, tools (client/server/MCP), RAG knowledge bases, multi-voice, and Scribe real-time STT. Use when: building voice chat interfaces, implementing AI phone agents with Twilio, configuring agent workflows or tools, adding RAG knowledge bases, testing with CLI "agents as code", or troubleshooting deprecated @11labs packages, Android audio cutoff, CSP violations, dynamic variables, or WebRTC config. Keywords: ElevenLabs Agents, ElevenLabs voice agents, AI voice agents, conversational AI, @elevenlabs/react, @elevenlabs/client, @elevenlabs/react-native, @elevenlabs/elevenlabs-js, @elevenlabs/agents-cli, elevenlabs SDK, voice AI, TTS, text-to-speech, ASR, speech recognition, turn-taking model, WebRTC voice, WebSocket voice, ElevenLabs conversation, agent system prompt, agent tools, agent knowledge base, RAG voice agents, multi-voice agents, pronunciation dictionary, voice speed control, elevenlabs scribe, @11labs deprecated, Android audio cutoff, CSP violation elevenlabs, dynamic variables elevenlabs, case-sensitive tool names, webhook authentication

Image & Videoscripts

humanizer

Included

Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 28 pattern detectors, 560+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.

Image & Videoscripts

generating-mermaid-diagrams

Included

Salesforce architecture diagrams using Mermaid with ASCII fallback. Use this skill when generating text-based diagrams for Salesforce architecture, OAuth flows, ERDs, integration sequences, or Agentforce structure. TRIGGER when: user says "diagram", "visualize", "ERD", or asks for sequence diagrams, flowcharts, class diagrams, or architecture visualizations in Mermaid. DO NOT TRIGGER when: user wants PNG/SVG image output (use generating-visual-diagrams), or asks about non-Salesforce systems.

Image & Videoscripts