Oh-my-toong-playground testing

Use when writing tests, generating test skeletons, deciding mock strategies, learning test patterns, or understanding test levels. Triggers include "테스트 작성", "테스트 패턴", "mock 전략", "test skeleton".

install
source · Clone the upstream repo
git clone https://github.com/toongri/oh-my-toong-playground
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/toongri/oh-my-toong-playground "$T" && mkdir -p ~/.claude/skills && cp -r "$T/projects/loopers-kotlin-spring-template/skills/testing" ~/.claude/skills/toongri-oh-my-toong-playground-testing && rm -rf "$T"
manifest: projects/loopers-kotlin-spring-template/skills/testing/SKILL.md
source content

Testing Skill

Test writing standards and quality guidelines following Classical TDD (state verification only).

Quick Reference

Test LevelFile PatternWhen to UseExternal Deps
Unit
*Test.kt
Domain logic, value objects, pure functionsNone
Integration
*IntegrationTest.kt
Service + Repository, transactionsReal DB
Concurrency
*ConcurrencyTest.kt
Locking, race conditionsReal DB
Adapter
*AdapterTest.kt
External API clients, queriesWireMock, Testcontainers
E2E
*E2ETest.kt
Full API flow, authFull stack
Batch
*BatchTest.kt
Spring Batch jobsReal DB

Core Philosophy

Tests serve three purposes: verify correctness, document behavior, and enable safe refactoring.

  • Tests as Executable Documentation: Tests are the most accurate documentation because they're always up-to-date.
  • Tests Enable Fearless Refactoring: With good tests, you can safely change implementation. But this only works if tests verify behavior, not implementation.
  • Tests Reveal Design Problems: If a test is hard to write, that's feedback about your design.

The Iron Law

VERIFY STATE, NEVER INTERACTIONS.
NO EXCEPTIONS. NO NEGOTIATIONS.

This applies to ALL tests:

  • Not "except for simple utilities"
  • Not "except when state verification is hard"
  • Not "except when the team already uses verify()"
  • Not "just this once"

Violating the letter of this rule IS violating the spirit.


🚨 Red Flags - STOP If You Think These

These thoughts mean you're rationalizing. STOP and reconsider:

ThoughtReality
"This is too simple for BDD structure"Simple code deserves consistent structure. Lower cost = less excuse.
"verify() is fine for external services"Use WireMock/Adapter test. If impossible, it's design feedback.
"State verification is impossible here"Redesign to return verifiable result. "Hard to test" = "bad design".
"This is just a utility class"Utilities need BDD too. Consistency > convenience.
"It's overkill for this case"Rules exist precisely for these "exception" moments.
"Factory method is boilerplate"5 minutes now saves hours of confusion later.
"Service has domain logic, needs Unit Test"Domain logic belongs in Domain model. Service only orchestrates.
"Mock Unit Test is faster than Integration"Speed is not the goal. Correct test level is.
"verify() is just extra insurance"Any verify() use is forbidden. No "insurance" exceptions.
"Following the spirit, not the letter"Violating the letter IS violating the spirit. No exceptions.
"Happy path + one failure is enough"Boundaries and class edges are where bugs hide. Systematic coverage required.
"Any value in the valid range works"Every test value must represent a named equivalence class. Arbitrary = untested.
"Too many combinations, test the main ones"Document which combinations are skipped and why. Silent omission is a defect.
"BVA only applies to numeric types"Boundaries exist in dates, string lengths, collection sizes. Apply BVA to all.
"Commenting test values is over-documenting"If you can't name the class a value represents, you don't know why you chose it.
"@CsvSource rows exceed 8 while verifying conditions from 2+ independent responsibilities in a single test"Eager Test anti-pattern. Split by responsibility so each test fails for exactly one reason.

All of these mean: Follow the rules anyway.


Rationalization Table

Common excuses and why they're wrong:

ExcuseWhy It's WrongWhat To Do Instead
"verify() is the only way to test this"WireMock, Testcontainers exist. Or redesign.Use Adapter test pattern
"BDD structure is overhead for simple tests"Consistency trumps perceived efficiencyApply same structure everywhere
"Factory methods are boilerplate"They're investment, not costCreate factory with all defaults
"DRY - share setup between tests"Test isolation > code reuseFresh fixtures per test
"State verified, verify() is extra safety"ANY verify() usage is forbidden. No hybrids.Remove verify(), state only
"Service does X, so Unit Test for X"If X is domain logic, test Domain modelUnit Test Domain, Integration Service
"Mock is faster than real DB"Speed < correctness. Mocks hide real bugs.Use real DB via Integration Test
"One success + one failure covers it"Boundaries between success/failure are untestedApply BVA: boundary-1, boundary, boundary+1
"ParameterizedTest values don't need explanation"Every value must represent a named classComment which equivalence class each value covers
"Too many combinations, not practical"Undocumented reduction = hidden riskApply Decision Table, document reduction rationale
"BVA is only for integers"Dates, string lengths, prices all have boundariesApply BVA to any ordered/comparable type
"Test value comments clutter the code"Unnamed values = arbitrary values = untestedComment which equivalence class each value covers
"Verifying all combinations in one place is faster"Eager Test anti-pattern. Separating by responsibility ensures each test fails for exactly one reasonSplit @CsvSource by responsibility, one test per concern

Handling Genuine Technical Limitations

If state verification appears technically impossible:

  1. Verify it's truly impossible — Can you redesign to return/expose verifiable state?
  2. If redesign is possible — Follow the rule. Redesign first, then test.
  3. If truly impossible — Document the limitation and propose a design change. Do not use verify() as a workaround.

CRITICAL: State/Result Verification ONLY (Classical TDD)

This project follows Classical TDD (Detroit School). All tests MUST verify outcomes, NOT interactions.

Verify WHAT happened, not HOW it happened.

✅ ALLOWED

assertThat(result).isEqualTo(expected)
assertThat(point.balance).isEqualTo(700L)
assertThatThrownBy { point.use(500L) }.isInstanceOf(CoreException::class.java)

❌ FORBIDDEN

verify(repository).save(any())
verify(mock, times(1)).method()
verifyNoInteractions(mock)

// ❌ ALSO FORBIDDEN: "Hybrid approach" - state + verify()
assertThat(order.status).isEqualTo(OrderStatus.PLACED)  // state verification ✅
verify(paymentClient).requestPayment(any())              // then verify ❌ STILL FORBIDDEN!

No "extra insurance" verify(): If you did state verification, you're done. Adding verify() for "safety" is still forbidden.

Test Level Overview

For level classification criteria, decision flow, and file naming conventions, see

references/test-level-guide.md

BDD Structure

Nested Classes

Use

@Nested
per behavior (method/endpoint). No more than 1 level of nesting.

@Nested
@DisplayName("use")
inner class Use {
    // All cases for use()
}

Naming Convention

  • @DisplayName
    : Korean description
  • Method name: English with backticks,
    [result] when [condition]

Exception Naming Patterns

Two patterns for exception test names:

PatternKorean (DisplayName)English (Method name)
Specific Error Type
[condition]하면 [SPECIFIC_ERROR] 예외가 발생한다
throws [SPECIFIC_ERROR] when [condition]
CoreException
[condition]하면 [ErrorType] CoreException 발생
throws [ErrorType] CoreException when [condition]

Examples:

// Specific Error Type - when the exception class name is descriptive
@Test
@DisplayName("잔액이 부족하면 InsufficientBalanceException 예외가 발생한다")
fun `throws InsufficientBalanceException when balance is insufficient`()

// CoreException - when using CoreException with ErrorType enum
@Test
@DisplayName("존재하지 않는 사용자면 NotFound CoreException 발생")
fun `throws NotFound CoreException when user does not exist`()

@Test
@DisplayName("권한이 없으면 Forbidden CoreException 발생")
fun `throws Forbidden CoreException when user has no permission`()

Given/When/Then

Every test must have comments specifying concrete values and expected results.

@Test
@DisplayName("주문 금액이 올바르게 계산된다")
fun `calculates total correctly`() {
    // given
    val initialBalance = 1000L
    val point = createPoint(balance = initialBalance)

    // when
    val deductAmount = 300L
    point.deduct(deductAmount)

    // then
    assertThat(point.balance).isEqualTo(initialBalance - deductAmount)
}

Factory Method Pattern

Every test class must have private factory methods with all parameters defaulted.

// Unit Test: domain object creation
private fun createPoint(
    id: Long = 0L,
    userId: Long = 1L,
    balance: Long = 1000L,
    status: PointStatus = PointStatus.ACTIVE,
): Point = Point.of(id, userId, balance, status)

// Integration Test: includes DB persistence
private fun createProduct(
    price: Money = Money.krw(10000),
    stockQuantity: Int = 100,
): Product {
    val brand = brandRepository.save(Brand.create("Test Brand"))
    val product = productRepository.save(Product.create(name = "Test Product", price = price, brand = brand))
    stockRepository.save(Stock.create(product.id, stockQuantity))
    return product
}

Essential Rules

Expose Only What Matters

// ❌ Bad: What is this test about?
val point = Point.of(id = 1L, userId = 42L, balance = 1000L, status = PointStatus.ACTIVE)

// ✅ Good: Clearly about balance deduction
val point = createPoint(balance = 1000L)

Single Logical Assertion

Each test verifies one behavior. Multiple

assertThat
is fine if they verify aspects of the same result.

Meaningful Variable Names

// ❌ Bad
assertThat(result).isEqualTo(700)

// ✅ Good
val initialBalance = 1000L
val deductAmount = 300L
assertThat(point.balance).isEqualTo(initialBalance - deductAmount)

Test Isolation

  • No shared mutable state
  • Database cleanup in
    @AfterEach
  • No test interdependence

When to Skip Test Generation

Pure data objects with no behavior

  • Command - use case input (e.g.,
    CreateOrderCommand
    )
  • Event - immutable fact record (e.g.,
    OrderCreatedEvent
    )
  • DTO / Request / Response - data transfer only

Infrastructure triggers with no business logic

  • Scheduler -
    @Scheduled
    methods that only invoke service methods
    • Scheduler's responsibility is only "when to call", not "what to do"
    • Test the invoked service method instead (Integration Test)
    • Cron expression correctness is Spring Framework's responsibility

References

Load references based on the current task. Each file provides detailed patterns and real code examples.

When determining test level

  • references/test-level-guide.md
    - Level classification criteria and decision flow

When generating test skeletons

  • references/test-generation.md
    - Spec to test skeleton process, quality checklist

When writing tests by level

  • references/unit-test.md
    - Unit test patterns (state change, validation, ParameterizedTest, domain events)
  • references/integration-test.md
    - Integration patterns (rollback, Spring Event, Kafka Consumer)
  • references/concurrency-test.md
    - Concurrency patterns (thread pool, locking, idempotency)
  • references/adapter-test.md
    - Adapter patterns (WireMock, Circuit Breaker, Retry, complex queries)
  • references/e2e-test.md
    - E2E patterns (HTTP status codes, auth failures, API contract)
  • references/batch-test.md
    - Spring Batch patterns (Processor unit test, Step/Job integration test)

When deciding external dependencies strategy

  • references/external-dependencies.md
    - External dependencies by test level (Real DB, WireMock, Testcontainers)

Common Mistakes

MistakeWhy It's WrongFix
verify(mock).save(any())
Interaction verification, not stateAssert on returned/persisted state
Shared mutable state between testsTest pollution, flaky resultsCreate fresh fixtures per test
Testing implementation detailsBreaks on refactorTest observable behavior only
Magic numbers in assertionsUnclear what's being testedUse named variables:
initialBalance - deductAmount
Multiple behaviors per testHard to diagnose failuresOne logical assertion per test
Missing
@AfterEach
cleanup
DB pollution across testsClean up created entities
Unit Test for pure delegationMock returns mock - no real testingSkip Unit, write Integration Test instead
State + verify() hybridAny verify() is forbiddenRemove verify(), keep state verification only