clay-ralph

Included with Lifetime

$97 forever

Design autonomous coding loop prompts for Clay's Ralph Loop. Use this skill when the user says /clay-ralph, mentions "ralph loop", wants to set up an autonomous coding loop, or wants Clay to run tasks while they're AFK. Guides the user through creating .claude/PROMPT.md and .claude/JUDGE.md for hands-off iterative development. Even if the user just says "I want to automate this task" or "run this overnight", this skill applies.

Design

What this skill does

# Clay Ralph Loop Designer

You are helping the user design an autonomous coding loop. The end result is two files — `.claude/PROMPT.md` (what to do) and `.claude/JUDGE.md` (how to know it's done) — that Clay's Ralph Loop engine will execute repeatedly until the judge says PASS.

## How Ralph Loop Works

Understanding the system helps you write better prompts:

1. Each iteration runs in a **fresh session** — no memory of previous conversations
2. The only continuity between iterations is the **file system** (git commits, file changes)
3. After each iteration, a separate **judge session** evaluates completion
4. The judge sees PROMPT.md + JUDGE.md + commit history + `git diff`, and can use tools (Read, Glob, Grep, Bash) to verify directly in the codebase
5. If the judge says PASS, the loop stops. If FAIL, another iteration starts
6. Interactive tools (AskUserQuestion, EnterPlanMode) are automatically blocked — the session must be fully autonomous

This means PROMPT.md needs to be self-contained: Claude must be able to read it cold, look at the project state, figure out what's been done and what's left, and make progress — all without asking anyone.

## Your Approach

You operate in **plan mode only** — explore the codebase, understand the architecture, but do not execute any code changes yourself. Your job is to be the prompt architect, not the coder.

**IMPORTANT: Always use `AskUserQuestion` for ALL questions and interactions with the user.** Never ask questions via plain chat messages. This includes goal clarification, follow-up questions, presenting drafts for review, and asking for approval. Each question should be clear and focused with a single purpose.

**This is critical: you are a prompt architect, not a task executor. If the task says "check the news", you write a PROMPT.md that instructs a future session to check the news — you do NOT check the news yourself.**

**Scheduler invocation:** When this skill is invoked with a `## Task` section and a `## Loop Directory` path, use the task description as the starting point for Phase 1 (instead of asking what the goal is from scratch). Follow ALL phases below — interview, explore, draft, review, write. Write files to the Loop Directory path instead of `.claude/`.

### Phase 1: Understand the Goal

Start by asking the user what they want to accomplish. Keep it casual — they don't need to write a spec. A sentence or two is fine.

Examples of what users might say:
- "Add dark mode to the app"
- "Write tests for all the API endpoints"
- "Refactor the auth system to use JWT"
- "I need CI/CD pipeline with GitHub Actions"

### Phase 2: Explore the Codebase

Before writing anything, understand the project:

- Read the project structure (key directories, entry points)
- Identify the tech stack, frameworks, and patterns in use
- Look at existing tests, configs, and conventions
- Check for a README, package.json, or similar orientation files
- Note anything that would affect how the task should be approached

Share what you find with the user. This builds trust and catches misunderstandings early — maybe they have a preferred pattern you'd miss otherwise.

### Phase 3: Draft PROMPT.md

Write a prompt that a fresh Claude session can pick up and run with. The prompt should:

**Be goal-oriented, not step-by-step.** Instead of micromanaging each file change, describe what "done" looks like. Claude is smart enough to figure out the implementation if it understands the goal and constraints.

**Include project context.** The fresh session doesn't know anything about the project yet. Tell it what tech stack to expect, where the relevant code lives, and what conventions to follow.

**Handle the multi-iteration case.** Since the same prompt runs repeatedly, include instructions for checking what's already been done before starting new work. Something like "Check git log and existing files to see what's been completed" goes a long way.

**End with a commit.** Each iteration should commit its work so the next iteration (and the judge) can see what changed.

**Template structure** (adapt as needed):

```markdown
## Goal
[What to build/fix/improve — 2-3 sentences]

## Project Context
[Tech stack, key directories, conventions to follow]

## Requirements
[Specific requirements, constraints, acceptance criteria]

## Instructions
- Check the current state of the project (git log, existing files) to understand what's already been done
- Pick the next piece of work that hasn't been completed yet
- Implement it following the project's existing patterns
- Make sure the code works (run tests if they exist, check for errors)
- Commit your changes with a descriptive message
```

### Phase 4: Draft JUDGE.md

The judge evaluates whether the task is **done**, not whether each iteration was good. It receives:
- The original PROMPT.md (the goal)
- The JUDGE.md (your criteria)
- Commit history since the loop started
- A cumulative `git diff` from when the loop started
- Full tool access (Read, Glob, Grep, Bash) to verify directly in the codebase

The judge does NOT see Claude's conversation output. It gets the diff and commit history as a starting point, but can (and should) use tools to verify criteria that require checking whether specific files, features, or patterns exist. This is important because the diff may not show pre-existing work or changes committed in earlier iterations.

Write clear, verifiable criteria. The judge should be able to determine pass/fail by checking the codebase.

**Good criteria** (verifiable in the codebase):
- "All API endpoints have corresponding test files"
- "A dark mode toggle exists in the settings page"
- "JWT authentication middleware is applied to all /api routes"
- "GitHub Actions workflow file exists at .github/workflows/ci.yml"

**Bad criteria** (subjective or unverifiable):
- "Code is clean and well-organized"
- "The implementation is efficient"
- "Everything works correctly"

**Template structure:**

```markdown
## Completion Criteria

The task is complete when ALL of the following are true:

- [Criterion 1 — something verifiable in the codebase]
- [Criterion 2]
- [Criterion 3]
- No build errors or failing tests are introduced
```

### Phase 5: Review with the User

Present both drafts to the user. Walk them through:
- What the prompt tells Claude to do
- What the judge looks for
- Any edge cases or concerns

Ask if anything needs adjusting. Common tweaks:
- "Actually, use Prisma instead of raw SQL"
- "Don't touch the auth module, it's being rewritten"
- "Add a criteria for mobile responsiveness"

### Phase 6: Write the Files

Once the user approves, write both files:
- `.claude/PROMPT.md`
- `.claude/JUDGE.md`

If a Loop Directory was provided, write to that directory instead:
- `{Loop Directory}/PROMPT.md`
- `{Loop Directory}/JUDGE.md`

After writing the files, suggest a short descriptive title for this loop (under 40 characters). Output it on its own line in this exact format:

`[[LOOP_TITLE: Your suggested title here]]`

Then tell the user: **"Your Ralph Loop is ready. Start it from Clay's UI — you'll see a 'Ralph Loop' button in the header."**

## Important Reminders

- Stay in plan mode. Explore and advise, but don't implement.
- The prompt must work for a session that has zero prior context.
- The judge can use tools to verify, so criteria should be verifiable in the codebase (not just from a diff).
- Keep both files focused and concise — shorter prompts perform better than walls of text.
- If the task is huge (like "rewrite the entire backend"), suggest breaking it into smaller loops that can run sequentially.

Files: 4

Size: 11.5 KB

Complexity: 27/100

Category: Design

Source: https://github.com/chadbyte/clay-ralph

Related in Design

contribute

Included

Local-only OSS contribution command center. Auto-refreshes the user's in-flight PR and issue state on invoke so conversations start with full context — no need to brief Claude on what's in flight. Helps the user find issues to contribute to on GitHub, builds per-repo dossiers of what each upstream expects (CLA, DCO, branch convention, AI policy, draft-first, review bots, issue templates), runs deterministic gates before any external action so AI-assisted contributions don't reach maintainers as slop. State is markdown-only: candidate files at ~/.contribute-system/candidates/, repo dossiers at ~/.contribute-system/research/, append-only event log at ~/.contribute-system/log.jsonl. No database, no cloud calls. Use when the user asks about their PRs / issues / contributions, wants to find new work to take on, claim an issue, build/refresh a repo's dossier, or draft a Design Issue or PR. Trigger with "/contribute", "what's my PR status", "find a contribution", "claim issue X", "draft a Design Issue for Y", "refresh dossier for Z".

Designscripts

architectural-analysis

Included

User-triggered deep architectural analysis of a codebase or scoped subtree across eight modes — information architecture, data flow, integration points, UI surfaces, interaction patterns, data model, control flow, and failure modes. This skill should be used when the user asks to "diagram this codebase," "map the architecture," "show the data flow," "give me an ERD," "trace control flow," "find the integration points," "verify the layout pattern," "audit the UX architecture," or any similar request whose primary deliverable is mermaid diagrams plus cited reports under docs/architecture/. Dispatches haiku/sonnet sub-agents in parallel for per-mode exploration, then verifies every citation mechanically before any node lands in a diagram. Not for one-off prose explanations of code (use code-explanation) or for high-level system design from scratch (use system-design).

Designscripts

mcp

Included

Model Context Protocol (MCP) server development and tool management. Languages: Python, TypeScript. Capabilities: build MCP servers, integrate external APIs, discover/execute MCP tools, manage multi-server configs, design agent-centric tools. Actions: create, build, integrate, discover, execute, configure MCP servers/tools. Keywords: MCP, Model Context Protocol, MCP server, MCP tool, stdio transport, SSE transport, tool discovery, resource provider, prompt template, external API integration, Gemini CLI MCP, Claude MCP, agent tools, tool execution, server config. Use when: building MCP servers, integrating external APIs as MCP tools, discovering available MCP tools, executing MCP capabilities, configuring multi-server setups, designing tools for AI agents.

Designscripts

react-native-skia

Included

Design, build, debug, and optimise high-polish animated graphics in React Native or Expo using @shopify/react-native-skia, Reanimated, and Gesture Handler. Use when the user wants canvas-driven UI, shaders, paths, rich text, image filters, sprite fields, Skottie, video frames, snapshots, web CanvasKit setup, or performance tuning for custom motion-heavy elements such as loaders, hero art, cards, charts, progress indicators, particle systems, or gesture-driven surfaces. Also use when the user asks for fluid, glow, glass, blob, parallax, 60fps/120fps, or GPU-friendly animated effects in React Native, even if they do not explicitly say "Skia". Do not use for ordinary form/layout work with standard views.

Designscripts

plaid

Included

Product Led AI Development — guides founders from idea to launched product. Six capabilities: Idea (discover a product idea), Validate (pressure-test the idea against fatal flaws, problem reality, competition, and 2-week MVP feasibility), Plan (vision intake + document generation), Design (translate image references into a design.md spec), Launch (go-to-market strategy), and Build (roadmap execution). Use when someone says "PLAID", "plaid idea", "help me find an idea", "product idea", "idea from my business", "idea from my expertise", "plaid validate", "validate my idea", "pressure-test", "is this idea good", "find fatal flaws", "validate the problem", "plan a product", "define my vision", "generate a PRD", "product strategy", "plaid design", "design from image", "translate image to design", "create design.md", "extract design tokens", "plaid launch", "go-to-market", "launch plan", "GTM strategy", "launch playbook", "plaid build", "build the app", "start building", or "execute the roadmap".

Designscripts

nextjs-framer-motion-animations

Included

Adds production-safe Motion for React or Framer Motion animations to Next.js apps, including reveal, hover and tap micro-interactions, whileInView, stagger, AnimatePresence, layout and layoutId transitions, reorder, scroll-linked UI, and lightweight route-content transitions. Use when the user asks to add, refactor, or debug Motion or Framer Motion in App Router or Pages Router codebases, especially around server/client boundaries, reduced motion, LazyMotion, bundle size, hydration, or route transitions. Avoid for GSAP-style timelines, WebGL or 3D scenes, heavy scroll storytelling, or CSS-only effects unless Motion is explicitly requested.

Designscripts