reflect

Included with Lifetime

$97 forever

Analyze command history to identify which skills work, which fail, and where to improve.

analysis

What this skill does


## MANDATORY PREPARATION

Invoke /agent-workflow — it contains workflow principles, anti-patterns, and the **Context Gathering Protocol**. Follow the protocol before proceeding — if no workflow context exists yet, you MUST run /teach-maestro first.

---

Analyze the Maestro audit trail and decision log to produce a skill-effectiveness scorecard. This tells you which commands work, which fail, and where your workflow needs attention.

### Data Sources

Read these files from the project root:

1. **`.maestro/audit.jsonl`** — every command invocation with duration, cost, and outcome
2. **`.maestro/decisions.jsonl`** — decisions made with outcomes and next steps

If neither file exists, respond: *"No audit data found. Run commands with Maestro to start tracking, then come back."*

### Analysis Dimensions

**1. Usage Frequency**
- Which commands run most/least?
- Are any commands never used? (candidates for removal)

**2. Completion Rate**
- What % of invocations complete successfully?
- Which commands fail most often?

**3. Command Flow**
- What are the most common command sequences (A → B)?
- Which commands lead to follow-ups vs. abandonment?
- Abandonment rate per command (no follow-up within 30 min)

**4. Cost Distribution**
- Total estimated cost across all commands
- Cost per command (average)
- Most/least expensive commands

**5. Duration Analysis**
- Average duration per command
- Outliers (unusually slow invocations)

### Output Format

```text
╔══════════════════════════════════════════╗
║          MAESTRO EFFECTIVENESS           ║
╠══════════════════════════════════════════╣
║ Commands Run         __ (__ unique)      ║
║ Completion Rate      __%                 ║
║ Most Used            /_____ (__×)        ║
║ Most Abandoned       /_____ (__% ⚠️)     ║
║ Avg Duration         __s                 ║
║ Total Cost           ~$__.__             ║
╠══════════════════════════════════════════╣
║           STRONGEST PIPELINES            ║
╠══════════════════════════════════════════╣
║ /_____ → /_____    __×                   ║
║ /_____ → /_____    __×                   ║
╠══════════════════════════════════════════╣
║           COST PER COMMAND               ║
╠══════════════════════════════════════════╣
║ /_____    $__.__/run  ████░░  avg        ║
║ /_____    $__.__/run  █░░░░░  cheap      ║
║ /_____    $__.__/run  █████░  costly     ║
╚══════════════════════════════════════════╝

INSIGHTS:
1. [Data-driven observation with recommended action]
2. [Data-driven observation with recommended action]
3. [Data-driven observation with recommended action]
```

### Insights Rules

Every insight MUST:
- Reference specific data (e.g., "40% abandonment rate")
- Suggest a specific Maestro command to address it
- Distinguish correlation from causation

### Reflection Checklist

- [ ] All 5 analysis dimensions covered
- [ ] Scorecard generated with real data
- [ ] Insights are data-driven, not speculative
- [ ] Cost estimates labeled as approximate (~)
- [ ] Recommended actions reference specific Maestro commands

### Recommended Next Step

After reflecting, run `/streamline` to remove unused commands, or `/refine` on the most-abandoned command to improve its prompt quality.

**NEVER**:

- Require audit data to exist — degrade gracefully
- Invent metrics beyond what the logs contain
- Show cost data without the "estimate" disclaimer (~)
- Make judgments without evidence (say "100% completion rate" not "works great")
- Compare across projects — reflect is project-scoped

Files: 1

Size: 4.4 KB

Complexity: 12/100

Category: analysis

Source: https://github.com/sharpdeveye/maestro/tree/main/source/skills/reflect

Related in analysis

when-mapping-dependencies-use-dependency-mapper

Included

Comprehensive dependency mapping, analysis, and visualization tool for software projects

analysis

System Diagnostician

Included

Performs Codex-assisted project health diagnostics, identifies capability gaps, and produces prioritized improvement plans.

analysis

diagnose

Included

Use when the user wants to find problems, audit workflow quality, or get a comprehensive health check on their AI workflow.

analysis

evaluate

Included

Use when the user wants a quality review, interaction audit, or to test the workflow against realistic scenarios.

analysis

when-mapping-dependencies-use-dependency-mapper

Included

Comprehensive dependency mapping, analysis, and visualization tool for software projects

analysis

System Diagnostician

Included

Performs Codex-assisted project health diagnostics, identifies capability gaps, and produces prioritized improvement plans.

analysis

diagnose

Included

Use when the user wants to find problems, audit workflow quality, or get a comprehensive health check on their AI workflow.

analysis

evaluate

Included

Use when the user wants a quality review, interaction audit, or to test the workflow against realistic scenarios.

analysis