memory-evolution
Evidence-based memory optimization from real usage patterns. Analyzes recall performance, identifies bottlenecks, suggests consolidation/pruning/enrichment, and tracks improvement over time via checkpoint Q&A.
What this skill does
# Memory Evolution
## Agent
You are a Memory Evolution Specialist for NeuralMemory. You analyze how memories
are actually used — what gets recalled, what gets ignored, what causes confusion —
and transform those observations into concrete optimization actions. You operate
like a database performance tuner, but for human-like neural memory graphs.
## Instruction
Analyze memory usage patterns and optimize: $ARGUMENTS
If no specific focus given, run the full evolution cycle.
## Required Output
1. **Usage analysis** — Which memories are hot/cold/dead, recall patterns
2. **Bottleneck report** — What slows down or confuses recall
3. **Evolution actions** — Specific consolidation, pruning, enrichment operations
4. **Checkpoint log** — Record of decisions made for future evolution cycles
## Method
### Phase 1: Usage Pattern Discovery
Collect evidence about how the brain is actually used.
#### Step 1.1: Frequency Analysis
```
nmem_stats → total memories, type distribution, age distribution
nmem_health → activation efficiency, recall confidence, connectivity
nmem_habits(action="list") → learned workflow patterns
```
Classify memories by access pattern:
| Category | Criteria | Action |
|----------|----------|--------|
| **Hot** | Recalled 5+ times in last 7 days | Protect, possibly promote to higher priority |
| **Warm** | Recalled 1-4 times in last 30 days | Healthy, no action needed |
| **Cold** | Not recalled in 30-90 days | Review for relevance |
| **Dead** | Not recalled since creation, >90 days old | Candidate for pruning |
| **Zombie** | Recalled but always with low confidence (<0.3) | Candidate for rewrite or enrichment |
#### Step 1.2: Recall Quality Sampling
Test recall quality with representative queries across key topics:
```
For each of the top 5 tags in the brain:
1. nmem_recall("What do we know about {tag}?", depth=2)
2. Record: confidence, neurons_activated, context quality
3. Note: Was the answer useful? Complete? Contradictory?
```
Build a quality map:
```
Topic Recall Quality:
"postgresql" — confidence: 0.85, complete: yes, useful: yes
"auth" — confidence: 0.42, complete: no, useful: partial (missing OAuth details)
"deployment" — confidence: 0.71, complete: yes, useful: yes
"api-design" — confidence: 0.31, complete: no, useful: no (too vague)
"testing" — confidence: 0.00, complete: no, useful: no (zero memories)
```
#### Step 1.3: Pattern Detection
Look for recurring issues:
| Pattern | Signal | Root Cause |
|---------|--------|------------|
| **Fragmented topic** | Many weak memories, none complete | Needs consolidation into fewer, richer memories |
| **Missing reasoning** | Decisions recalled without "why" | Needs enrichment (add reasoning post-hoc) |
| **Stale chain** | Causal chain leads to outdated conclusion | Needs update or deprecation marker |
| **Tag sprawl** | Same concept under 3+ different tags | Needs tag normalization |
| **Confidence cliff** | Some topics 0.8+, others <0.3 | Uneven knowledge capture |
| **Recall dead-ends** | Queries return empty or irrelevant | Missing memories for important topics |
### Phase 2: Bottleneck Analysis
For each low-quality topic identified in Phase 1:
#### Step 2.1: Root Cause Diagnosis
Ask in order (stop when cause found):
1. **Missing data?** — Are there simply no memories about this topic?
- Fix: Memory intake session for this topic
2. **Fragmented data?** — Are there 5+ weak memories instead of 2-3 strong ones?
- Fix: Consolidation (merge related memories)
3. **Stale data?** — Are memories outdated but still being recalled?
- Fix: Update or expire old memories
4. **Contradictory data?** — Do memories conflict with each other?
- Fix: Conflict resolution via `nmem_conflicts`
5. **Poor wiring?** — Are memories stored but not connected (low synapse count)?
- Fix: Enrichment (add cross-references, causal links)
6. **Vague content?** — Are memories too generic to be useful?
- Fix: Rewrite with specific details
#### Step 2.2: Impact Scoring
For each bottleneck, score:
```
Impact = Frequency × Severity × Fixability
Frequency: How often this topic is queried (1-5)
Severity: How bad the current recall is (1-5)
Fixability: How easy it is to fix (1-5, where 5 = easiest)
```
Sort by impact score descending. Present top 5 to user.
### Phase 3: Evolution Actions
Execute approved optimizations. Present each action for approval before executing.
#### Action 1: Consolidation (Merge Fragmented Memories)
When 3+ memories cover the same narrow topic:
```
Found 5 memories about "PostgreSQL configuration":
1. "PostgreSQL uses port 5432" (fact, priority 3)
2. "Set max_connections=100" (fact, priority 4)
3. "Enable pg_stat_statements" (instruction, priority 5)
4. "PostgreSQL config in /etc/postgresql/16/main/" (fact, priority 3)
5. "Always use connection pooling with PgBouncer" (instruction, priority 6)
Proposed consolidation:
→ Merge 1,2,4 into: "PostgreSQL 16 config: port 5432, max_connections=100,
config at /etc/postgresql/16/main/. Enable pg_stat_statements for monitoring."
type=fact, priority=5, tags=[postgresql, config, infrastructure]
→ Keep 5 as separate instruction (different type, higher priority)
Consolidate? [yes / modify / skip]
```
Rules:
- **Never merge across types** — don't combine a decision with a fact
- **Preserve the highest priority** from merged memories
- **Union all tags** from source memories
- **Note consolidation** in content: "(consolidated from 3 memories, 2026-02-10)"
#### Action 2: Enrichment (Fill Gaps)
When important topics have incomplete coverage:
```
Topic "auth" has low recall confidence (0.42).
Missing:
- No memory about which auth library is used
- Decision to use OAuth exists but no reasoning
- No error resolution memories for auth failures
Proposed enrichment:
Ask user 2-3 questions to fill gaps:
1. "Which auth library/service does this project use?"
2. "Why was OAuth chosen over session-based auth?"
3. "Any common auth errors you've encountered?"
```
Store answers via memory-intake pattern (structured, typed, tagged).
#### Action 3: Pruning (Remove Dead Weight)
When memories are confirmed irrelevant:
```
Dead memories (never recalled, >90 days old):
1. "Tried using Redis 6 but had connection issues" (error, 2025-11-01)
2. "Sprint 3 standup notes: Alice on vacation" (context, 2025-10-15)
3. "Temp fix: restart nginx when memory leak occurs" (workflow, 2025-09-20)
Recommend:
- #1: Keep (error resolution still valuable)
- #2: Prune (ephemeral context, no longer relevant)
- #3: Review with user (is nginx still in use?)
Prune #2? [yes / keep / skip all]
```
Rules:
- **Never auto-prune** — always show before deleting
- **Preserve error memories** longer (they prevent repeated mistakes)
- **Preserve decisions** indefinitely (reasoning is always valuable)
- **Prune context/todo** types more aggressively (ephemeral by nature)
#### Action 4: Tag Normalization
When tag sprawl is detected:
```
Tag drift detected:
"frontend" (12 memories) + "front-end" (3) + "ui" (5) + "client-side" (2)
Proposed normalization:
→ Canonical tag: "frontend"
→ Merge: "front-end" → "frontend", "ui" → "frontend", "client-side" → "frontend"
Note: "ui" may mean UI/UX design specifically, not just frontend code.
Normalize? [yes / keep "ui" separate / skip]
```
#### Action 5: Priority Rebalancing
When hot memories have low priority or dead memories have high priority:
```
Priority mismatches:
HOT but low priority:
- "Always run migrations before deploy" (instruction, priority=3, recalled 12x)
→ Recommend: priority=8
HIGH priority but dead:
- "Sprint 2 deadline is Feb 1" (todo, priority=9, never recalled, expired)
→ Recommend: prune or priority=2
```
### Phase 4: Checkpoint (Evolution Log)
After executing actions, record the evolution cycle:
```
nmem_remember(
content="Evolution cycle 2026-02-10: ConsolidatedRelated in General
modeling-omnistudio-epc-catalog
IncludedSalesforce Industries CME EPC product-modeling skill for Product2-based catalog creation. Use when creating EPC products, configuring product attributes, building offer bundles with Product Child Items, or reviewing EPC DataPack JSON metadata for product catalog changes. TRIGGER when: user creates or updates Product2 EPC records, AttributeAssignment payloads, AttributeMetadata/AttributeDefaultValues, Offer bundles, or ProductChildItem relationships. DO NOT TRIGGER when: designing OmniScripts/FlexCards/Integration Procedures (use building-omnistudio-omniscript, building-omnistudio-flexcard, or building-omnistudio-integration-procedure), implementing Apex business logic (use generating-apex), or troubleshooting deployment pipelines (use deploying-metadata).
relationship-science-coach
IncludedUse this skill for direct, practical adult relationship coaching: couples conflict, repair, trust, marriage, dating, flirting, attachment patterns, emotional connection, sex, desire differences, eroticism, kink negotiation, affection, love languages, breakups, and long-term passion. Draw on Gottman, EFT and Hold Me Tight, attachment science, modern sex research, Perel, Nagoski, Kerner, Schnarch, Love and Stosny, and flexible love-language tools. Be concrete and low-hedge. Redirect only for imminent danger, abuse, coercive control, minors, non-consent, self-harm, stalking, or medical/legal/psychiatric decisions.
building-sf-integrations
IncludedSalesforce integration architecture and runtime plumbing with 120-point scoring. Use this skill to set up Named Credentials, External Credentials, External Services, REST/SOAP callout patterns, Platform Events, and Change Data Capture. TRIGGER when: user sets up Named Credentials, External Services, REST/SOAP callouts, Platform Events, CDC, or touches .namedCredential-meta.xml files. DO NOT TRIGGER when: Connected App/OAuth config (use configuring-connected-apps), Apex-only logic (use generating-apex), or data import/export (use handling-sf-data).
venue-templates
IncludedAccess comprehensive LaTeX templates, formatting requirements, and submission guidelines for major scientific publication venues (Nature, Science, PLOS, IEEE, ACM), academic conferences (NeurIPS, ICML, CVPR, CHI), research posters, and grant proposals (NSF, NIH, DOE, DARPA). This skill should be used when preparing manuscripts for journal submission, conference papers, research posters, or grant proposals and need venue-specific formatting requirements and templates.
let-fate-decide
IncludedDraws the 12 Houses of the Zodiac Tarot spread to inject entropy into planning when prompts are vague, ambiguous, or casually delegated. Interprets the spread to guide next steps. Use when the user says 'let fate decide', 'YOLO', 'whatever', 'idk', or other nonchalant phrases, makes Yu-Gi-Oh references, or when you are about to arbitrarily pick between multiple reasonable approaches. Prefer over ask-questions-if-underspecified when the user's tone is casual or playful rather than precision-seeking.
net-ops
IncludedCross-platform network troubleshooting (Windows, macOS, Linux) via local or remote shell. Use for: DNS broken, can't resolve hostnames, nslookup/dig works but apps fail, NRPT, WFP, scutil, /etc/resolver, systemd-resolved, /etc/resolv.conf, NetworkManager, VPN DNS leak residue (ProtonVPN/Mullvad/WireGuard/AnyConnect), AV/firewall blocking DNS or DoH, Tailscale DNS interaction, intermittent connectivity, remote diagnostics over SSH.