Harness-engineering harness-state-management

Harness State Management

install
source · Clone the upstream repo
git clone https://github.com/Intense-Visions/harness-engineering
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/Intense-Visions/harness-engineering "$T" && mkdir -p ~/.claude/skills && cp -r "$T/agents/skills/claude-code/harness-state-management" ~/.claude/skills/intense-visions-harness-engineering-harness-state-management && rm -rf "$T"
manifest: agents/skills/claude-code/harness-state-management/SKILL.md
source content

Harness State Management

Manage persistent state across agent sessions so that context, decisions, progress, and learnings survive context resets. Load state at session start, track position and decisions throughout, and save state for the next session.

When to Use

  • At the start of every session that continues previous work (load state)
  • When completing a task, phase, or milestone (update progress)
  • When making a decision that future sessions need to know about (record decision)
  • When discovering something non-obvious that would be lost on context reset (capture learning)
  • When hitting a blocker that cannot be resolved in the current session (log blocker)
  • At the end of every session (save state)
  • NOT for storing code — code belongs in git commits, not state files
  • NOT for storing large outputs or logs — state should be concise and navigable
  • NOT as a replacement for a plan document — plans live in
    docs/
    , state tracks progress through plans

Process

Phase 1: LOAD — Restore Context from Previous Sessions

  1. Resolve the stream. State is organized into streams — isolated directories under

    .harness/streams/<name>/
    . Before loading any state files:

    • If you know which work item you're resuming, pass
      --stream <name>
      or use
      manage_state
      with
      stream: "<name>"
      .
    • Otherwise, the system auto-resolves from the current git branch (e.g.,
      feature/auth-rework
      auth-rework
      stream) or falls back to the active stream.
    • If resolution fails, ask the user: "Which stream should I use?" and list known streams via
      harness state streams list
      or the
      list_streams
      MCP tool.
    • When starting new work on a new branch, create a new stream:
      harness state streams create <name> --branch <branch>
      .
    • Announce which stream was resolved so the human has visibility.
  2. Read

    .harness/state.json
    . This is the primary state file. It contains:

    • Current position (phase, task, step)
    • Progress map (which tasks are complete, in progress, or blocked)
    • Decisions made in previous sessions (date, what, why)
    • Blockers encountered and their status
    • Last session summary
  3. Run

    harness state show
    to get a formatted view of current state. This is equivalent to reading the JSON but formatted for readability.

  4. Read

    .harness/learnings.md
    . This is the append-only knowledge base. Scan for:

    • Recent learnings (last 2-3 sessions) — these are most likely still relevant
    • Gotchas and warnings — these prevent repeating mistakes
    • Decisions with rationale — these explain why things are the way they are
  5. Read

    .harness/failures.md
    if exists. Scan for active anti-patterns and dead ends.

  6. Read

    .harness/handoff.json
    if exists. Structured context from last skill.

  7. Check

    .harness/archive/
    for historical failure logs.

  8. If no state exists, this is a fresh start. Create

    .harness/state.json
    with initial structure:

    {
      "schemaVersion": 1,
      "position": { "phase": "start", "task": null },
      "progress": {},
      "decisions": [],
      "blockers": [],
      "lastSession": { "date": null, "summary": null }
    }
    
  9. Announce the loaded context. Briefly summarize: "Resuming from [position]. [N] tasks complete. [N] blockers. Key learnings: [summary]." This confirms the state was loaded and gives the human visibility.

Phase 2: TRACK — Maintain State During the Session

  1. Update position when moving between phases or tasks. Every time work shifts to a new task or phase, update

    position
    in state:

    "position": { "phase": "execute", "task": "Task 3", "step": "writing tests" }
    
  2. Record decisions when they are made. Decisions are choices that affect future work. Record them immediately — do not wait until the end of the session:

    "decisions": [
      {
        "date": "2026-03-14",
        "what": "Use WebSocket instead of SSE for real-time notifications",
        "why": "SSE does not support bidirectional communication, which Task 5 requires"
      }
    ]
    
  3. Log blockers when encountered. A blocker is anything that prevents the current task from completing:

    "blockers": [
      {
        "date": "2026-03-14",
        "task": "Task 4",
        "description": "Payment gateway API returns 403 — API key may be expired",
        "status": "open"
      }
    ]
    
  4. Update progress after each completed task:

    "progress": {
      "Task 1": "complete",
      "Task 2": "complete",
      "Task 3": "in_progress",
      "Task 4": "blocked"
    }
    
  5. Keep state concise. State is not a log. Each field should contain the current status, not a history of all changes. History belongs in

    .harness/learnings.md
    and git commits.

Phase 3: LEARN — Capture Knowledge for Future Sessions

  1. Identify learnings as they happen. A learning is anything that:

    • Was surprising or non-obvious
    • Took significant effort to figure out
    • Would cause repeated wasted time if forgotten
    • Represents a decision that needs rationale preserved
  2. Capture learnings with

    harness state learn
    :

    harness state learn "Date comparison needs UTC normalization — use Date.now() not new Date()"
    

    This appends to

    .harness/learnings.md
    with a timestamp.

  3. Or append directly to

    .harness/learnings.md
    with structured format:

    ## 2026-03-14 — Task 3: Notification Expiry
    
    - [learning]: PostgreSQL's `now()` returns timestamp with timezone, but our
      application uses UTC epoch milliseconds. Always convert before comparing.
    - [gotcha]: The notifications table has a unique constraint on (userId, type).
      Use upsert (ON CONFLICT DO UPDATE) instead of plain INSERT.
    - [decision]: Chose to store expiry as epoch milliseconds rather than
      ISO timestamp for consistency with the rest of the codebase.
    
  4. Learnings are append-only. Never edit or delete previous learnings. They are a chronological record. Even if a learning turns out to be wrong, append a correction rather than modifying the original.

  5. What belongs in learnings vs. git commits:

    • Learnings: Context, rationale, gotchas, decisions, warnings — things that explain why and what to watch out for
    • Git commits: Code changes, what was done — things that explain what changed
    • Example: The commit says "feat: add UTC normalization to date comparison." The learning says "Date comparison needs UTC normalization because PostgreSQL returns timezone-aware timestamps but our app uses epoch milliseconds."

Phase 4: SAVE — Persist State for Next Session

  1. Update

    .harness/state.json
    with final position, progress, and session summary:

    {
      "schemaVersion": 1,
      "position": { "phase": "execute", "task": "Task 4" },
      "progress": {
        "Task 1": "complete",
        "Task 2": "complete",
        "Task 3": "complete"
      },
      "decisions": [ ... ],
      "blockers": [ ... ],
      "lastSession": {
        "date": "2026-03-14",
        "summary": "Completed Tasks 2-3. Task 3 required UTC date normalization (see learnings). Starting Task 4 next session."
      }
    }
    
  2. Verify learnings were captured. Review

    .harness/learnings.md
    — were all non-obvious discoveries recorded? If something was tricky during the session, it should be in learnings.

  3. State is saved to the active stream. All writes (state, learnings, handoff, failures) go to the resolved stream's directory (e.g.,

    .harness/streams/auth-rework/state.json
    ). Switching to a different stream in the next session does not affect the current stream's files.

  4. Decide whether to commit state files. State files (

    .harness/streams/*/state.json
    ,
    .harness/streams/*/learnings.md
    ) should be committed to git so other team members and agents can access them. Commit state updates separately from code changes so they do not clutter code diffs.

Building Institutional Knowledge Over Time

The

.harness/learnings.md
file grows over the lifetime of the project. It becomes a valuable resource:

  • Week 1: A few gotchas about the development environment and initial setup decisions.
  • Month 1: Patterns emerge — recurring issues, architectural decisions with rationale, team conventions that were established through experience.
  • Month 6: New team members read learnings and avoid months of rediscovery. The file captures knowledge that no single person holds.
  • Year 1: Learnings are the project's institutional memory. They explain why the architecture looks the way it does, why certain patterns were adopted, and what was tried and abandoned.

Treat learnings as a first-class project artifact. They are as valuable as tests and documentation.

Archival Workflow

  • Archive failures: Move
    failures.md
    to
    .harness/archive/
    at milestone boundaries.
  • Do NOT archive learnings — permanent. Learnings accumulate for the lifetime of the project.
  • Do NOT archive state — git handles history. The current
    state.json
    is always the source of truth.
  • Handoff is ephemeral — overwritten by each skill. No archival needed.

Harness Integration

  • harness state show [--stream <name>]
    — Display current state in a formatted, readable view. Use at session start to quickly orient.
  • harness state reset [--stream <name>]
    — Reset state to initial values. Use when starting a completely new effort and old state is no longer relevant. Use with caution — this discards progress tracking.
  • harness state learn "<message>" [--stream <name>]
    — Append a learning with automatic timestamp formatting.
  • harness state streams list
    — List all known streams with branch associations and active status.
  • harness state streams create <name> [--branch <branch>]
    — Create a new stream, optionally associated with a git branch.
  • harness state streams archive <name>
    — Archive a completed stream.
  • harness state streams activate <name>
    — Set the active stream for the project.
  • .harness/streams/<name>/state.json
    — Primary state file per stream. Read at session start, updated throughout, saved at session end.
  • .harness/streams/<name>/learnings.md
    — Append-only knowledge base per stream.
  • .harness/streams/<name>/failures.md
    — Active anti-patterns per stream.
  • .harness/streams/<name>/handoff.json
    — Structured context from last skill per stream.
  • .harness/streams/index.json
    — Stream index tracking known streams, branch associations, and active stream.
  • .harness/trace.md
    — Optional reasoning trace. Useful for debugging agent behavior across sessions.
  • .harness/archive/
    — Archived failure logs. Check for historical context when encountering recurring issues.

Success Criteria

  • State is loaded at the start of every session that continues previous work
  • Position is updated whenever the current phase or task changes
  • Decisions are recorded with date, what, and why — immediately when made, not deferred
  • Blockers are logged with task reference, description, and status
  • Progress is updated after each completed task
  • Learnings are captured for every non-obvious discovery during the session
  • .harness/learnings.md
    entries follow the structured format (date, task, tagged items)
  • Learnings are append-only — no edits or deletions of previous entries
  • State is saved before session end with an accurate session summary
  • State files are committed to git separately from code changes

Rationalizations to Reject

RationalizationReality
"The session is short — I'll update state at the end rather than after each task."Context resets happen without warning. A session that ends mid-task with no state update forces the next session to reconstruct position by reading git history and code, which takes longer and produces an inaccurate picture. State is updated after each task, not at the end of the session.
"This decision is obvious from the code — I don't need to record the rationale in state."What is obvious to the agent that made the decision is opaque to the agent that resumes three weeks later with no memory of the session. Decisions are recorded because the context that made them obvious does not survive a context reset. The rationale is exactly what needs to be saved.
"The learnings file is getting long — I'll trim old entries that are no longer relevant."Learnings are append-only by design. An entry that seems irrelevant may become relevant when a related pattern resurfaces. Trimming destroys the chronological record and the ability to understand why earlier decisions were made. Entries are never deleted, only supplemented with corrections.
"I can re-read the plan to figure out where I am — I don't need to update the position in state."The plan describes what to do; state records what has been done. Re-reading the plan without state requires the next session to infer progress from code, which produces uncertain position. Uncertain position leads to re-executing completed tasks or skipping tasks that appear complete but are not.
"The stream auto-resolves from the branch — I don't need to explicitly verify which stream is active before writing."Auto-resolution works when branch names match stream names and the index is current. When branches are renamed, stale, or when multiple streams exist for the same feature, auto-resolution can write to the wrong stream silently. Always announce the resolved stream before writing state.

Examples

Example: Starting a New Session (Resuming Work)

LOAD:

Run: harness state show
Output:
  Position: execute / Task 3 (writing tests)
  Progress: Task 1 complete, Task 2 complete
  Blockers: none
  Last session: 2026-03-13 — "Completed Tasks 1-2. Task 2 required
    adding a new index on notifications.userId for query performance."

Read: .harness/learnings.md
  Most recent:
  - [gotcha]: notifications table needs index on userId — queries
    were timing out without it
  - [decision]: used partial index (WHERE deleted_at IS NULL) to
    avoid indexing soft-deleted rows

Summary: "Resuming from Task 3 (writing tests). Tasks 1-2 complete.
  Note: notifications table has a partial index on userId — see learnings."

Example: Recording a Decision Mid-Session

Context: Implementing Task 4, need to choose between polling and WebSocket.

Record decision:
  date: "2026-03-14"
  what: "Use WebSocket for real-time notification delivery"
  why: "Polling would require 1-second intervals for acceptable latency,
    which creates too much load. WebSocket gives instant delivery with
    one persistent connection per client."

Capture learning:
  harness state learn "WebSocket chosen over polling for notifications.
    Polling at 1s intervals = ~86k requests/day per client. WebSocket =
    1 persistent connection. See Task 4 decision in state."

Example: Ending a Session

SAVE:

Update .harness/state.json:
{
  "schemaVersion": 1,
  "position": { "phase": "execute", "task": "Task 5" },
  "progress": {
    "Task 1": "complete",
    "Task 2": "complete",
    "Task 3": "complete",
    "Task 4": "complete"
  },
  "decisions": [
    {
      "date": "2026-03-14",
      "what": "Use WebSocket for real-time notification delivery",
      "why": "Polling creates too much load at acceptable latency intervals"
    }
  ],
  "blockers": [],
  "lastSession": {
    "date": "2026-03-14",
    "summary": "Completed Tasks 3-4. Task 3 added expiry logic with UTC normalization. Task 4 implemented WebSocket delivery (chose over polling — see decisions). Starting Task 5 (UI integration) next session."
  }
}

Verify: .harness/learnings.md has entries for UTC normalization and WebSocket decision.
Commit: git add .harness/ && git commit -m "chore: update harness state after Tasks 3-4"

Example: What Belongs Where

InformationWhere It GoesWhy
"Added WebSocket handler in src/ws/"Git commit messageDescribes what changed in code
"Chose WebSocket over polling because..."
.harness/state.json
decisions
Records the choice and rationale for future sessions
"WebSocket requires sticky sessions in load balancer"
.harness/learnings.md
Non-obvious operational concern future sessions need
"Task 4 complete"
.harness/state.json
progress
Tracks execution position
"The WebSocket library auto-reconnects by default"
.harness/learnings.md
Gotcha that saves future debugging time
"Tried approach X, failed because Y"
.harness/failures.md
Active anti-pattern to avoid repeating
"Completed Tasks 1-3, Task 4 pending"
.harness/handoff.json
Structured context for next skill
"[PREPARE 10:30] Loaded 3 failures"
.harness/trace.md
Reasoning trace for debugging agent behavior