scientific-schematics
Create publication-quality scientific diagrams using Nano Banana Pro AI with smart iterative refinement. Uses Gemini 3 Pro for quality review. Only regenerates if quality is below threshold for your document type. Specialized in neural network architectures, system diagrams, flowcharts, biological pathways, and complex scientific visualizations.
What this skill does
# Scientific Schematics and Diagrams ## Overview Scientific schematics and diagrams transform complex concepts into clear visual representations for publication. **This skill uses Nano Banana Pro AI for diagram generation with Gemini 3 Pro quality review.** **How it works:** - Describe your diagram in natural language - Nano Banana Pro generates publication-quality images automatically - **Gemini 3 Pro reviews quality** against document-type thresholds - **Smart iteration**: Only regenerates if quality is below threshold - Publication-ready output in minutes - No coding, templates, or manual drawing required **Quality Thresholds by Document Type:** | Document Type | Threshold | Description | |---------------|-----------|-------------| | journal | 8.5/10 | Nature, Science, peer-reviewed journals | | conference | 8.0/10 | Conference papers | | thesis | 8.0/10 | Dissertations, theses | | grant | 8.0/10 | Grant proposals | | preprint | 7.5/10 | arXiv, bioRxiv, etc. | | report | 7.5/10 | Technical reports | | poster | 7.0/10 | Academic posters | | presentation | 6.5/10 | Slides, talks | | default | 7.5/10 | General purpose | **Simply describe what you want, and Nano Banana Pro creates it.** All diagrams are stored in the figures/ subfolder and referenced in papers/posters. ## Quick Start: Generate Any Diagram Create any scientific diagram by simply describing it. Nano Banana Pro handles everything automatically with **smart iteration**: ```bash # Generate for journal paper (highest quality threshold: 8.5/10) python scripts/generate_schematic.py "CONSORT participant flow diagram with 500 screened, 150 excluded, 350 randomized" -o figures/consort.png --doc-type journal # Generate for presentation (lower threshold: 6.5/10 - faster) python scripts/generate_schematic.py "Transformer encoder-decoder architecture showing multi-head attention" -o figures/transformer.png --doc-type presentation # Generate for poster (moderate threshold: 7.0/10) python scripts/generate_schematic.py "MAPK signaling pathway from EGFR to gene transcription" -o figures/mapk_pathway.png --doc-type poster # Custom max iterations (max 2) python scripts/generate_schematic.py "Complex circuit diagram with op-amp, resistors, and capacitors" -o figures/circuit.png --iterations 2 --doc-type journal ``` **What happens behind the scenes:** 1. **Generation 1**: Nano Banana Pro creates initial image following scientific diagram best practices 2. **Review 1**: **Gemini 3 Pro** evaluates quality against document-type threshold 3. **Decision**: If quality >= threshold → **DONE** (no more iterations needed!) 4. **If below threshold**: Improved prompt based on critique, regenerate 5. **Repeat**: Until quality meets threshold OR max iterations reached **Smart Iteration Benefits:** - ✅ Saves API calls if first generation is good enough - ✅ Higher quality standards for journal papers - ✅ Faster turnaround for presentations/posters - ✅ Appropriate quality for each use case **Output**: Versioned images plus a detailed review log with quality scores, critiques, and early-stop information. ### Configuration Set your OpenRouter API key: ```bash export OPENROUTER_API_KEY='your_api_key_here' ``` Get an API key at: https://openrouter.ai/keys ### AI Generation Best Practices **Effective Prompts for Scientific Diagrams:** ✓ **Good prompts** (specific, detailed): - "CONSORT flowchart showing participant flow from screening (n=500) through randomization to final analysis" - "Transformer neural network architecture with encoder stack on left, decoder stack on right, showing multi-head attention and cross-attention connections" - "Biological signaling cascade: EGFR receptor → RAS → RAF → MEK → ERK → nucleus, with phosphorylation steps labeled" - "Block diagram of IoT system: sensors → microcontroller → WiFi module → cloud server → mobile app" ✗ **Avoid vague prompts**: - "Make a flowchart" (too generic) - "Neural network" (which type? what components?) - "Pathway diagram" (which pathway? what molecules?) **Key elements to include:** - **Type**: Flowchart, architecture diagram, pathway, circuit, etc. - **Components**: Specific elements to include - **Flow/Direction**: How elements connect (left-to-right, top-to-bottom) - **Labels**: Key annotations or text to include - **Style**: Any specific visual requirements **Scientific Quality Guidelines** (automatically applied): - Clean white/light background - High contrast for readability - Clear, readable labels (minimum 10pt) - Professional typography (sans-serif fonts) - Colorblind-friendly colors (Okabe-Ito palette) - Proper spacing to prevent crowding - Scale bars, legends, axes where appropriate ## When to Use This Skill This skill should be used when: - Creating neural network architecture diagrams (Transformers, CNNs, RNNs, etc.) - Illustrating system architectures and data flow diagrams - Drawing methodology flowcharts for study design (CONSORT, PRISMA) - Visualizing algorithm workflows and processing pipelines - Creating circuit diagrams and electrical schematics - Depicting biological pathways and molecular interactions - Generating network topologies and hierarchical structures - Illustrating conceptual frameworks and theoretical models - Designing block diagrams for technical papers ## How to Use This Skill **Simply describe your diagram in natural language.** Nano Banana Pro generates it automatically: ```bash python scripts/generate_schematic.py "your diagram description" -o output.png ``` **That's it!** The AI handles: - ✓ Layout and composition - ✓ Labels and annotations - ✓ Colors and styling - ✓ Quality review and refinement - ✓ Publication-ready output **Works for all diagram types:** - Flowcharts (CONSORT, PRISMA, etc.) - Neural network architectures - Biological pathways - Circuit diagrams - System architectures - Block diagrams - Any scientific visualization **No coding, no templates, no manual drawing required.** --- # AI Generation Mode (Nano Banana Pro + Gemini 3 Pro Review) ## Smart Iterative Refinement Workflow The AI generation system uses **smart iteration** - it only regenerates if quality is below the threshold for your document type: ### How Smart Iteration Works ``` ┌─────────────────────────────────────────────────────┐ │ 1. Generate image with Nano Banana Pro │ │ ↓ │ │ 2. Review quality with Gemini 3 Pro │ │ ↓ │ │ 3. Score >= threshold? │ │ YES → DONE! (early stop) │ │ NO → Improve prompt, go to step 1 │ │ ↓ │ │ 4. Repeat until quality met OR max iterations │ └─────────────────────────────────────────────────────┘ ``` ### Iteration 1: Initial Generation **Prompt Construction:** ``` Scientific diagram guidelines + User request ``` **Output:** `diagram_v1.png` ### Quality Review by Gemini 3 Pro Gemini 3 Pro evaluates the diagram on: 1. **Scientific Accuracy** (0-2 points) - Correct concepts, notation, relationships 2. **Clarity and Readability** (0-2 points) - Easy to understand, clear hierarchy 3. **Label Quality** (0-2 points) - Complete, readable, consistent labels 4. **Layout and Composition** (0-2 points) - Logical flow, balanced, no overlaps 5. **Professional Appearance** (0-2 points) - Publication-ready quality **Example Review Output:** ``` SCORE: 8.0 STRENGTHS: - Clear flow from top to bottom - All phases properly labeled - Professional typography ISSUES: - Participant counts slightly small - Minor overlap on exclusion box VERDICT: ACCEPTABLE (for poster, threshold 7.0) ``` ### Decision Point: Continue or Stop? | If Score... | Action | |-------------|--------| | >= threshold | **STOP** - Quality is good enough for this document type | | < threshold | Continue to next iteration with improved prompt | **Example:** - For a **poster**
Related in Image & Video
watch
IncludedWatch a video (URL or local path). Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls the transcript from captions (or Whisper API fallback), and hands the result to Claude so it can answer questions about what's in the video.
physical-ai-defect-image-generation
IncludedUse when the user wants to orchestrate defect image generation, run associated setup, or handle outputs on OSMO. The Day 0 path handles cold-start with USD-to-ROI, image-edit augmentation, and AnomalyGen to create initial PCBA datasets. The Day 1 path performs inference and labeling on real images. This skill helps with first-time asset setup, creation of finetuning checkpoints, and configuring deployment. Trigger keywords: defect image generation, dig workflow, dig pipeline, defect image detection workflow, aoi pipeline, aoi anomalygen, usd2roi anomalygen, day 0 pcba, day 1 pcba, day 1 real-photo alignment, day 1 manual roi, metal surface anomaly, glass defect, anomalygen finetune, setup_pcb, setup_metal, setup_glass, setup_pretrained, dig setup, dig datasets, dig pretrained checkpoint, dig image-edit endpoint.
accelint-react-best-practices
IncludedReact performance optimization and best practices. ALWAYS use this skill when working with any React code - writing components, hooks, JSX; refactoring; optimizing re-renders, memoization, state management; reviewing for performance; fixing hydration mismatches; debugging infinite re-renders, stale closures, input focus loss, animations restarting; preventing remounting; implementing transitions, lazy initialization, effect dependencies. Even simple React tasks benefit from these patterns. Covers React 19+ (useEffectEvent, Activity, ref props). Triggers - useEffect, useState, useMemo, useCallback, memo, inline components, nested components, components inside components, re-render, performance, hydration, SSR, Next.js, useDeferredValue, combined hooks.
elevenlabs-agents
IncludedBuild conversational AI voice agents with ElevenLabs Platform using React, JavaScript, React Native, or Swift SDKs. Configure agents, tools (client/server/MCP), RAG knowledge bases, multi-voice, and Scribe real-time STT. Use when: building voice chat interfaces, implementing AI phone agents with Twilio, configuring agent workflows or tools, adding RAG knowledge bases, testing with CLI "agents as code", or troubleshooting deprecated @11labs packages, Android audio cutoff, CSP violations, dynamic variables, or WebRTC config. Keywords: ElevenLabs Agents, ElevenLabs voice agents, AI voice agents, conversational AI, @elevenlabs/react, @elevenlabs/client, @elevenlabs/react-native, @elevenlabs/elevenlabs-js, @elevenlabs/agents-cli, elevenlabs SDK, voice AI, TTS, text-to-speech, ASR, speech recognition, turn-taking model, WebRTC voice, WebSocket voice, ElevenLabs conversation, agent system prompt, agent tools, agent knowledge base, RAG voice agents, multi-voice agents, pronunciation dictionary, voice speed control, elevenlabs scribe, @11labs deprecated, Android audio cutoff, CSP violation elevenlabs, dynamic variables elevenlabs, case-sensitive tool names, webhook authentication
humanizer
IncludedHumanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 28 pattern detectors, 560+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.
generating-mermaid-diagrams
IncludedSalesforce architecture diagrams using Mermaid with ASCII fallback. Use this skill when generating text-based diagrams for Salesforce architecture, OAuth flows, ERDs, integration sequences, or Agentforce structure. TRIGGER when: user says "diagram", "visualize", "ERD", or asks for sequence diagrams, flowcharts, class diagrams, or architecture visualizations in Mermaid. DO NOT TRIGGER when: user wants PNG/SVG image output (use generating-visual-diagrams), or asks about non-Salesforce systems.