test-driven-development
Write unit test first before writing new production code, watch the unit test fail, then write the minimal production code to pass the test. Use when the user asks to implement/add/create/build features or functionality, fix/resolve/patch bugs or issues, refactor/restructure/reorganize code, modify/update/change existing behavior, add new methods/functions/classes, or make code changes. Activate for substantial changes that affect behavior, not formatting or comments.
What this skill does
# Test-Driven Development (TDD)
Write the test first. Watch it fail. Write minimal code to pass.
**Core principle:** If you didn't watch the test fail, you don't know if it tests the right thing.
## The Iron Law
```
NO PRODUCTION CODE WITHOUT A FAILING TEST FIRST
```
Write code before the test? Delete it. Start over.
**No exceptions:**
- Don't keep it as "reference"
- Don't "adapt" it while writing tests
- Don't look at it
- Delete means delete
## Red-Green-Refactor
### RED - Write Failing Test
Write one minimal test showing what should happen.
<Good>
```kotlin
@Test
fun `should apply 10 percent discount when order total exceeds 100`() {
val order = Order(items = listOf(Item("Laptop", 120.0)))
val discountedTotal = calculateTotal(order)
discountedTotal shouldBe 108.0
}
```
Clear name, tests real behavior, one thing
</Good>
<Bad>
```kotlin
@Test
fun `test discount`() {
val mockCalculator = mockk<PriceCalculator>()
every { mockCalculator.calculate(any()) } returns 108.0
val result = mockCalculator.calculate(Order(emptyList()))
verify { mockCalculator.calculate(any()) }
}
```
Vague name, tests mock not code
</Bad>
**Requirements:**
- One behavior
- Clear name
- Real code (no mocks unless unavoidable)
### Verify RED - Watch It Fail
**MANDATORY. Never skip.**
```bash
./gradlew test --tests "path.to.TestClass"
```
Confirm:
- Test fails (not errors)
- Failure message is expected
- Fails because feature missing (not typos)
**Test passes?** You're testing existing behavior. Fix test.
**Test errors?** Fix error, re-run until it fails correctly.
### GREEN - Minimal Code
Write simplest code to pass the test.
<Good>
```kotlin
data class Order(val items: List<Item>)
data class Item(val name: String, val price: Double)
fun calculateTotal(order: Order): Double {
val subtotal = order.items.sumOf { it.price }
return if (subtotal > 100) subtotal * 0.9 else subtotal
}
```
Just enough to pass
</Good>
<Bad>
```kotlin
fun calculateTotal(
order: Order,
discountRules: List<DiscountRule> = defaultRules,
taxStrategy: TaxStrategy = DefaultTaxStrategy(),
currencyConverter: CurrencyConverter? = null
): Money {
// YAGNI - over-engineered
}
```
Over-engineered
</Bad>
Don't add features, refactor other code, or "improve" beyond the test.
### Verify GREEN - Watch It Pass
**MANDATORY.**
```bash
./gradlew test --tests "path.to.TestClass"
```
Confirm:
- Test passes
- Other tests still pass
- Output pristine (no errors, warnings)
**Test fails?** Fix code, not test.
**Other tests fail?** Fix now.
### REFACTOR - Clean Up
After green only:
- Remove duplication
- Improve names
- Extract helpers
Keep tests green. Don't add behavior.
### Repeat
Next failing test for next feature.
## Red Flags -> Start Over
| Flag | Reality |
|--------|---------|
| "Test passes immediately" | You're testing existing behavior, not new feature |
| "I'll test after the code" | Tests passing immediately prove nothing |
| "Already manually tested" | Ad-hoc ≠ systematic. No record, can't re-run |
| "Test hard = design unclear" | Listen to test. Hard to test = hard to use |
## Example: Bug Fix
**Bug:** Empty email accepted
**RED**
```kotlin
@Test
fun `should reject empty email`() {
val result = submitForm(FormData(email = ""))
result.error shouldBe "Email required"
}
```
**Verify RED**
```bash
$ ./gradlew test
FAIL: expected "Email required", got null
```
**GREEN**
```kotlin
data class FormData(val email: String?)
data class FormResult(val error: String? = null, val success: Boolean = false)
fun submitForm(data: FormData): FormResult {
if (data.email?.trim().isNullOrEmpty()) {
return FormResult(error = "Email required")
}
return FormResult(success = true)
}
```
**Verify GREEN**
```bash
$ ./gradlew test
PASS
```
**REFACTOR**
Extract validation for multiple fields if needed.
## Verification Checklist
- [ ] Every new function/method has a test
- [ ] Watched each test fail before implementing
- [ ] Each test failed for expected reason (feature missing, not typo)
- [ ] Wrote minimal code to pass each test
- [ ] All tests pass
- [ ] Output pristine (no errors, warnings)
- [ ] Tests use real code (mocks only if unavoidable)
- [ ] Edge cases and errors covered
Can't check all boxes? You skipped TDD. Start over.
## When Stuck
| Problem | Solution |
|---------|----------|
| Don't know how to test | Write wished-for API first. Write assertion first. Ask human partner. |
| Test too complicated | Design too complicated. Simplify interface. |
| Must mock everything | Code too coupled. Use dependency injection. |
| Test setup huge | Extract helpers. Still complex? Simplify design. |
| Bug found | Write failing test reproducing it. Follow TDD cycle. |
Related in Writing & Docs
jax-development
IncludedUse this skill when the user is writing, debugging, profiling, refactoring, reviewing, benchmarking, parallelising, exporting, or explaining JAX code, or when they mention JAX, jax.numpy, jit, grad, value_and_grad, vmap, scan, lax, random keys, pytrees, jax.Array, sharding, Mesh, PartitionSpec, NamedSharding, pmap, shard_map, Pallas, XLA, StableHLO, checkify, profiler, or the JAX repo. It helps turn NumPy or PyTorch-style code into pure functional JAX, fix tracer/control-flow/shape/PRNG bugs, remove recompiles and host-device syncs, choose transforms and sharding strategies, inspect jaxpr/lowering/IR, and benchmark compiled code correctly.
nature-article-writer
IncludedDrafts, rewrites, diagnostically critiques, and style-calibrates primary research manuscripts for Nature and Nature Portfolio journals. Use when the user wants a Nature-style title, summary paragraph or abstract, introduction, results, discussion, methods, figure legends, presubmission enquiry, cover letter, reviewer response, or when a scientific draft sounds generic, jargon-heavy, structurally weak, or AI-ish and needs precise, broad-reader-friendly prose without inventing data, analyses, or references. Best for primary research articles and letters rather than reviews or press releases unless explicitly adapting one.
deckrd
IncludedDocument-driven framework that derives requirements, specifications, implementation plans, and executable tasks from goals through structured AI dialogue. Use when user says "write requirements", "create spec", "plan implementation", "derive tasks", "structure this feature", "break down into tasks", or "document this module". Also use for reverse engineering existing code into docs (/deckrd rev). Do NOT use for direct code writing — use /deckrd-coder after tasks are generated. Do NOT use when the user only wants to run or fix existing code without planning.
clinical-decision-support
IncludedGenerate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug development, clinical research, and evidence synthesis.
handling-sf-data
IncludedSalesforce data operations with 130-point scoring. Use this skill to create, update, delete, bulk import/export, generate test data, and clean up org records using sf CLI and anonymous Apex. TRIGGER when: user creates test data, performs bulk import/export, uses sf data CLI commands, needs data factory patterns for Apex tests, or needs to seed/clean records in a Salesforce org. DO NOT TRIGGER when: SOQL query writing only (use querying-soql), Apex test execution (use running-apex-tests), or metadata deployment (use deploying-metadata).
accelint-ac-to-playwright
IncludedConvert and validate acceptance criteria for Playwright test automation. Use when user asks to (1) review/evaluate/check if AC are ready for automation, (2) assess if AC can be converted as-is, (3) validate AC quality for Playwright, (4) turn AC into tests, (5) generate tests from acceptance criteria, (6) convert .md bullets or .feature Gherkin files to Playwright specs, (7) create test automation from requirements. Handles both bullet-style markdown and Gherkin syntax with JSON test plan generation and validation.