Documents
Read, write, convert, and analyze documents โ routes to PDF, DOCX, XLSX, PPTX sub-skills for creation, editing, extraction, and format conversion. USE WHEN document, process file, create document, convert format, extract text, PDF, DOCX, XLSX, PPTX, Word, Excel, spreadsheet, PowerPoint, presentation, slides, consulting report, large PDF, merge PDF, fill form, tracked changes, redlining.
What this skill does
## Customization
**Before executing, check for user customizations at:**
`~/.claude/PAI/USER/SKILLCUSTOMIZATIONS/Documents/`
If this directory exists, load and apply any PREFERENCES.md, configurations, or resources found there. These override default behavior. If the directory does not exist, proceed with skill defaults.
## ๐จ MANDATORY: Voice Notification (REQUIRED BEFORE ANY ACTION)
**You MUST send this notification BEFORE doing anything else when this skill is invoked.**
1. **Send voice notification**:
```bash
curl -s -X POST http://localhost:8888/notify \
-H "Content-Type: application/json" \
-d '{"message": "Running the WORKFLOWNAME workflow in the Documents skill to ACTION"}' \
> /dev/null 2>&1 &
```
2. **Output text notification**:
```
Running the **WorkflowName** workflow in the **Documents** skill to ACTION...
```
**This is not optional. Execute this curl command immediately upon skill invocation.**
# Documents Skill
## ๐ฏ Load Full PAI Context
**Before starting any task with this skill, load complete PAI context:**
`read ~/.claude/PAI/SKILL.md`
## When to Activate This Skill
### Word Documents (DOCX)
- User wants to create, edit, or analyze Word documents
- User mentions "tracked changes", "redlining", "document review"
- User needs to convert documents to other formats
- User wants to work with document structure, comments, or formatting
### PDF Files
- User wants to create, merge, split, or manipulate PDFs
- User mentions "extract text from PDF", "PDF tables", "fill PDF form"
- User needs to convert PDFs to/from other formats
- User wants to add watermarks, passwords, or extract images
### PowerPoint Presentations (PPTX)
- User wants to create or edit presentations
- User mentions "slides", "presentation template", "speaker notes"
- User needs to convert presentations to other formats
- User wants to work with slide layouts or design elements
### Excel Spreadsheets (XLSX)
- User wants to create or edit spreadsheets
- User mentions "formulas", "financial model", "data analysis"
- User needs to work with Excel tables, charts, or pivot tables
- User wants to convert spreadsheets to/from other formats
## Workflow Routing
| Request Pattern | Route To |
|---|---|
| Consulting report, McKinsey report, assessment report, professional PDF | `Workflows/ConsultingReport.md` |
| Large PDF, process big PDF, Gemini PDF | `Workflows/ProcessLargePdfGemini3.md` |
| Word document, DOCX, create docx, edit docx, tracked changes, redlining | `Docx/SKILL.md` |
| PDF, create PDF, merge PDF, split PDF, extract text from PDF, fill form | `Pdf/SKILL.md` |
| Presentation, PPTX, slides, PowerPoint, speaker notes | `Pptx/SKILL.md` |
| Spreadsheet, XLSX, Excel, formulas, financial model, data analysis | `Xlsx/SKILL.md` |
## Document Type Details
This skill organizes document processing across 4 document types plus specialized workflows:
### Consulting Reports (HTML + Playwright PDF)
**Reference Documentation:**
- `Workflows/ConsultingReport.md` - Complete consulting report generation workflow
**Routing Logic:**
- "Create consulting report", "generate report PDF" โ ConsultingReport workflow
- "Build assessment report", "strategic assessment" โ ConsultingReport workflow
- "McKinsey-style report", "professional report PDF" โ ConsultingReport workflow
**Pipeline:** Report Artifacts โ Structured HTML โ Playwright PDF
**Key Capabilities:**
- Parse report directories with mixed content (markdown, TypeScript data, images)
- Professional CSS typography (Georgia serif body, Inter sans headings)
- Color-coded callout boxes (red/amber/green) and severity badges
- Auto-generated linked Table of Contents
- Cover page with classification marking
- Headers/footers with CONFIDENTIAL and page numbers
- Image compression pipeline (PNG โ JPEG, max 1200px)
- A4 format with Playwright for pixel-perfect PDF output
**Reference Implementation:** Quorum Cyber assessment at `/path/to/reference/generate-pdf.mjs`
### Word Documents (DOCX)
**Reference Documentation:**
- `docx/SKILL.md` - Complete DOCX processing guide
- `docx/docx-js.md` - Creating new documents with JavaScript
- `docx/ooxml.md` - Editing existing documents with OOXML
**Routing Logic:**
- "Create Word document", "new docx" โ Create workflow (docx-js)
- "Edit Word document", "tracked changes", "redlining" โ Edit workflow (OOXML)
- "Read Word document", "extract text from docx" โ Read workflow (pandoc)
- "Document review", "track changes" โ Redlining workflow
**Supporting Resources:**
- Scripts: `~/.claude/skills/Utilities/Documents/Docx/Scripts/`
- OOXML tools: `~/.claude/skills/Utilities/Documents/Docx/ooxml/`
- License: `~/.claude/skills/Utilities/Documents/Docx/LICENSE.txt`
**Key Capabilities:**
- Create professional documents with docx-js
- Edit with tracked changes (redlining workflow)
- Extract text/comments with pandoc
- Convert to images for visual inspection
- Work with raw OOXML for advanced features
### PDF Processing
**Reference Documentation:**
- `pdf/SKILL.md` - Complete PDF processing guide
- `pdf/forms.md` - Filling PDF forms
- `pdf/reference.md` - Advanced features and troubleshooting
**Routing Logic:**
- "Create PDF" โ Creation workflow (reportlab)
- "Merge PDFs", "split PDF" โ Manipulation workflow (pypdf)
- "Extract text from PDF" โ Extraction workflow (pdfplumber)
- "Fill PDF form" โ Forms workflow (pdf-lib or pypdf)
- "Extract tables from PDF" โ Table extraction (pdfplumber + pandas)
**Supporting Resources:**
- Scripts: `~/.claude/skills/Utilities/Documents/Pdf/Scripts/`
- License: `~/.claude/skills/Utilities/Documents/Pdf/LICENSE.txt`
**Key Capabilities:**
- Create PDFs with reportlab
- Extract text/tables with pdfplumber
- Merge/split with pypdf or qpdf
- Fill forms programmatically
- Add watermarks and password protection
- Extract images from PDFs
### PowerPoint Presentations (PPTX)
**Reference Documentation:**
- `pptx/SKILL.md` - Complete PPTX processing guide
- `pptx/html2pptx.md` - Creating presentations from HTML
- `pptx/ooxml.md` - Editing existing presentations
**Routing Logic:**
- "Create presentation", "new slides" โ Creation workflow (html2pptx)
- "Edit presentation", "modify slides" โ Edit workflow (OOXML)
- "Use presentation template" โ Template workflow
- "Extract slide text" โ Read workflow (markitdown)
- "Create thumbnail grid" โ Thumbnail workflow
**Supporting Resources:**
- Scripts: `~/.claude/skills/Utilities/Documents/Pptx/Scripts/`
- OOXML tools: `~/.claude/skills/Utilities/Documents/Pptx/ooxml/`
- License: `~/.claude/skills/Utilities/Documents/Pptx/LICENSE.txt`
**Key Capabilities:**
- Create presentations with html2pptx (HTML โ PPTX)
- Professional design with color palettes and layouts
- Edit with OOXML for advanced features
- Work with templates (rearrange, inventory, replace)
- Generate thumbnail grids for visual analysis
- Convert to images for inspection
### Excel Spreadsheets (XLSX)
**Reference Documentation:**
- `xlsx/SKILL.md` - Complete XLSX processing guide
- `xlsx/recalc.py` - Formula recalculation script
**Routing Logic:**
- "Create spreadsheet", "new Excel file" โ Creation workflow (openpyxl)
- "Edit spreadsheet", "modify Excel" โ Edit workflow (openpyxl)
- "Analyze data", "read Excel" โ Analysis workflow (pandas)
- "Financial model", "formulas" โ Financial modeling workflow
- "Recalculate formulas" โ Recalculation workflow (recalc.py)
**Supporting Resources:**
- Recalc script: `~/.claude/skills/Utilities/Documents/Xlsx/recalc.py`
- License: `~/.claude/skills/Utilities/Documents/Xlsx/LICENSE.txt`
**Key Capabilities:**
- Create spreadsheets with formulas (openpyxl)
- Data analysis with pandas
- Financial modeling with color coding standards
- Formula recalculation with LibreOffice
- Error detection and validation
- Preserve formatting and formulas when editing
## ๐ Document Processing Principles
### DOCX Best Practices
1. **Tracked Changes** - Use redlining workflow for professional documRelated in Ads & Marketing
ads
IncludedMulti-platform paid advertising audit and optimization skill. Analyzes Google, Meta, YouTube, LinkedIn, TikTok, Microsoft, and Apple Ads. 250+ checks with scoring, parallel agents, industry templates, and AI creative generation.
banana
IncludedAI image generation Creative Director powered by Google Gemini Nano Banana models. Use this skill for ANY request involving image creation, editing, visual asset production, or creative direction. Triggers on: generate an image, create a photo, edit this picture, design a logo, make a banner, visual for my anything, and all /banana commands. Handles text-to-image, image editing, multi-turn creative sessions, batch workflows, and brand presets.
rpg-migration-analyzer
IncludedAnalyzes legacy RPG (Report Program Generator) programs from AS/400 and IBM i systems for migration to modern Java applications. Extracts business logic from RPG III/IV/ILE source code, identifies data structures (D-specs), file operations (F-specs), program dependencies (CALLB/CALLP), and converts RPG constructs to Java equivalents. Generates migration reports, complexity estimates, and Java implementation strategies with POJO classes, JPA entities, and service methods. Use when modernizing AS/400 or IBM i legacy systems, analyzing RPG source files (.rpg, .rpgle, .RPGLE), converting RPG to Java, mapping data specifications to Java classes, planning legacy system migration, or when user mentions RPG analysis, Report Program Generator, RPG III/IV/ILE, AS/400 modernization, IBM i migration, packed decimal conversion, or mainframe application rewrite.
brand-library-architect
IncludedBuild a complete brand library for a product โ visual asset render pipeline, brand documentation set (BRAND, COPY, MANIFESTO, BIOS, FAQ, GLOSSARY, TONE, PRICING), open-source convention files (README, CONTRIBUTING, SECURITY, CODE_OF_CONDUCT), and a self-contained press kit. This skill should be used when the user asks to "build a brand library / brand kit / press kit / brand assets" for a product, "set up a brand library workflow," "create a positioning manifesto plus visual identity," or any combination of brand documentation + visual asset pipeline. Apply phase-by-phase or run end-to-end. Templates are product-agnostic and use {{TOKEN}} placeholders the skill prompts the user to fill.
writing-tech-post
IncludedAuthors engineering blog posts end-to-end: launch deep-dives, incident postmortems, architecture migrations, performance case studies, tutorials, AI/agent system writeups, security disclosures, and research-to-product translations. Picks the correct archetype, plans the abstraction ladder, enforces an evidence cadence (diagrams, benchmarks, profiles, traces, code, ablations), tunes voice against publisher house styles (Datadog, Vercel, GitHub, AWS, Meta, Cloudflare, Jane Street), and runs a pre-publish gate for narrative momentum and disclosure ethics. Use when drafting a new engineering post, restructuring a draft that feels flat, deciding which evidence form belongs where, validating that depth and product context are balanced, or preparing a postmortem, migration, or performance narrative for external publication. Do not use for API reference documentation, README authoring, marketing copy, release notes, generic SEO content, ghost-written executive thought leadership, or non-engineering long-form essays.
blog-google
IncludedGoogle API integration for blog performance: PageSpeed Insights, CrUX Core Web Vitals with 25-week history, Search Console performance, URL Inspection, Indexing API, GA4 organic traffic, NLP entity analysis for E-E-A-T, YouTube video search for embedding, and Google Ads Keyword Planner. Progressive feature availability based on credential tier (API key, OAuth/service account, GA4, Ads). Shares config with claude-seo at ~/.config/claude-seo/google-api.json. Use when user says "google data", "page speed", "core web vitals", "search console", "indexation", "GA4", "keyword research", "nlp entities", "blog performance", "youtube search", "google api setup".