Clawhub-skills Agent Dashboard

Real-time agent monitoring with health scoring, cost tracking, and web dashboard

install
source · Clone the upstream repo
git clone https://github.com/traygerbig/clawhub-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/traygerbig/clawhub-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/archive/agent-dashboard" ~/.claude/skills/traygerbig-clawhub-skills-agent-dashboard && rm -rf "$T"
manifest: archive/agent-dashboard/SKILL.md
source content
    ╭──────────────────────────────────────────╮
    │                                          │
    │   📊 A G E N T  D A S H B O A R D 📊   │
    │                                          │
    │   ┌──────────────────────────────┐       │
    │   │ Health: ████████░░  82/100   │       │
    │   │ Tasks:  47 ✓  3 ✗  2 ●      │       │
    │   │ Cost:   $4.23 today          │       │
    │   │ ─────────────────────────    │       │
    │   │  📈 ╱╲  ╱╲  ╱╲              │       │
    │   │    ╱  ╲╱  ╲╱  ╲ trend       │       │
    │   └──────────────────────────────┘       │
    │                                          │
    │    "See everything. Miss nothing."       │
    ╰──────────────────────────────────────────╯

Agent Dashboard

📊 Real-Time
💰 Cost Tracking
🔔 Alerts
🌐 Web Export
v1.0.0

hanabi-jpn Version License Commands

Real-time monitoring and analytics for OpenClaw agents. Track performance, costs, task completion, and skill health with terminal and web dashboards.

Author: hanabi-jpn | Version: 1.0.0 | License: MIT Tags:

monitoring
analytics
dashboard
performance
cost-tracking


Overview

Agent Dashboard gives you full visibility into your OpenClaw agent's operations. Track health scores, task metrics, API costs, skill performance, and session history — all in beautiful terminal displays or exportable web dashboards.

┌─────────────────────────────────────────────┐
│       AGENT DASHBOARD ARCHITECTURE          │
│                                             │
│  ┌─────────┐  ┌──────────┐  ┌───────────┐  │
│  │ COLLECT │─▶│ ANALYZE  │─▶│ VISUALIZE │  │
│  │ Metrics │  │ Trends   │  │ Terminal  │  │
│  └─────────┘  └──────────┘  └───────────┘  │
│       │                          │          │
│  ┌─────────┐  ┌──────────┐  ┌───────────┐  │
│  │  STORE  │  │  ALERT   │  │  EXPORT   │  │
│  │  JSONL  │  │ Threshld │  │ HTML/JSON │  │
│  └─────────┘  └──────────┘  └───────────┘  │
└─────────────────────────────────────────────┘

System Prompt Instructions

You are an agent equipped with Agent Dashboard. Track and report on agent performance when requested.

Metric Collection

After every significant action, log metrics to

.agent-dashboard/metrics/
:

Health Metrics (

health.jsonl
):

{
  "timestamp": "2026-03-01T14:30:00Z",
  "session_id": "sess-abc123",
  "response_time_ms": 2300,
  "tokens_input": 4500,
  "tokens_output": 1200,
  "tool_calls": 3,
  "errors": 0,
  "task_completed": true
}

Cost Tracking (

costs.jsonl
):

{
  "timestamp": "2026-03-01T14:30:00Z",
  "provider": "anthropic",
  "model": "claude-sonnet-4-20250514",
  "tokens_input": 4500,
  "tokens_output": 1200,
  "cost_usd": 0.0234
}

Skill Usage (

skills.jsonl
):

{
  "timestamp": "2026-03-01T14:30:00Z",
  "skill": "summarize-pro",
  "command": "summarize",
  "response_time_ms": 2300,
  "success": true,
  "tokens_used": 5700
}

Health Score Calculation (0-100)

health = (
  error_rate_score × 0.30 +
  task_completion_score × 0.30 +
  response_quality_score × 0.20 +
  efficiency_score × 0.20
)

where:
  error_rate_score = max(0, 100 - (errors/total × 200))
  task_completion_score = (completed/attempted) × 100
  response_quality_score = 100 - (corrections/responses × 200)
  efficiency_score = max(0, 100 - (avg_tokens/1000 × 10))

Commands

dashboard
— Terminal dashboard:

╔═══════════════════════════════════════════════════╗
║              Agent Dashboard v1.0                 ║
╠═══════════════════════════════════════════════════╣
║ Health:   ████████░░  82/100    Uptime: 99.2%     ║
║ Tasks:    47 completed │ 3 failed │ 2 active      ║
║ Cost:     $4.23 today  │ $28.91 this week         ║
║ Tokens:   1.2M input   │ 340K output              ║
╠═══════════════════════════════════════════════════╣
║ Skill Performance        │ Calls │ Errs │ Avg ms  ║
║ summarize-pro            │  23   │  0   │ 2,300   ║
║ fx-trader-pro            │  18   │  1   │ 4,700   ║
║ brain-trust              │  12   │  0   │ 8,100   ║
║ humanize-ai-pro          │   8   │  0   │ 1,900   ║
║ skill-guardian            │   5   │  0   │ 12,400  ║
╠═══════════════════════════════════════════════════╣
║ Cost Breakdown (Today)                            ║
║ Anthropic:  $2.89  ██████████████░░░  68%         ║
║ Google:     $0.84  ████░░░░░░░░░░░░░  20%         ║
║ OpenAI:     $0.50  ███░░░░░░░░░░░░░░  12%         ║
╠═══════════════════════════════════════════════════╣
║ Recent Activity                                   ║
║ 14:32 ✅ Analyzed EUR/USD M15                     ║
║ 14:28 ✅ Generated summary for report.pdf         ║
║ 14:25 ❌ API timeout on Gemini (retried → ok)     ║
║ 14:20 ✅ Humanized blog post (score 73%→12%)      ║
║ 14:15 ✅ Brain Trust standup completed             ║
╚═══════════════════════════════════════════════════╝

dashboard health
— Detailed health breakdown

Output:

╔══════════════════════════════════════════════════════╗
║           Agent Health — Detailed Breakdown           ║
╠══════════════════════════════════════════════════════╣
║  Overall Health Score: 82/100                        ║
║  ████████████████░░░░ 82%                            ║
║                                                      ║
║  Component Scores:                                   ║
║  ──────────────────────────────────────────────       ║
║  Error Rate:         92/100  (2.1% error rate)       ║
║    Weight: 30% │ Contribution: 27.6                  ║
║                                                      ║
║  Task Completion:    88/100  (47/52 completed)       ║
║    Weight: 30% │ Contribution: 26.4                  ║
║                                                      ║
║  Response Quality:   75/100  (5 corrections in 52)   ║
║    Weight: 20% │ Contribution: 15.0                  ║
║                                                      ║
║  Efficiency:         65/100  (avg 3,500 tokens/task) ║
║    Weight: 20% │ Contribution: 13.0                  ║
║                                                      ║
║  Trend: ▲ +4 points from yesterday (78 → 82)        ║
║  Recommendation: Response quality can improve —      ║
║    review common correction patterns.                ║
╚══════════════════════════════════════════════════════╝

dashboard tasks [--period day|week|month]
— Task analytics:

  • Completed, failed, in-progress counts
  • Average completion time
  • Success rate trend
  • Most common task types

Output:

╔══════════════════════════════════════════════════════╗
║          Task Analytics — This Week                  ║
╠══════════════════════════════════════════════════════╣
║  Completed: 47  │  Failed: 3  │  Active: 2           ║
║  Success Rate: 94.0%  (▲ +2.1% from last week)      ║
║  Avg Completion Time: 45 seconds                     ║
║                                                      ║
║  Daily Breakdown:                                    ║
║  Mon: ████████████ 12 tasks (100% success)           ║
║  Tue: ██████████░ 10 tasks (90% success)             ║
║  Wed: ████████░░ 8 tasks (87.5% success)             ║
║  Thu: ███████████ 11 tasks (100% success)            ║
║  Fri: █████████░ 9 tasks (88.9% success)             ║
║  Sat: ██░░░░░░░░ 2 tasks (100% success)              ║
║                                                      ║
║  Most Common Task Types:                             ║
║  1. Code analysis/editing:     18 (34%)              ║
║  2. FX trading operations:     12 (23%)              ║
║  3. Content generation:         8 (15%)              ║
║  4. File search/navigation:     7 (13%)              ║
║  5. Other:                      7 (13%)              ║
╚══════════════════════════════════════════════════════╝

dashboard skills
— Skill performance ranking:

  • Most used skills
  • Fastest/slowest skills
  • Error rates per skill
  • Token consumption per skill
  • Unused skills (installed but never called)

Output:

╔══════════════════════════════════════════════════════════╗
║            Skill Performance Ranking                     ║
╠══════════════════════════════════════════════════════════╣
║  Skill              │ Calls │ Errs │ Avg ms │ Tokens    ║
║  ───────────────────┼───────┼──────┼────────┼────────── ║
║  summarize-pro      │  23   │  0   │ 2,300  │ 131,100   ║
║  fx-trader-pro      │  18   │  1   │ 4,700  │  84,600   ║
║  brain-trust        │  12   │  0   │ 8,100  │  97,200   ║
║  humanize-ai-pro    │   8   │  0   │ 1,900  │  45,600   ║
║  skill-guardian     │   5   │  0   │ 12,400 │  62,000   ║
║  context-slim       │   3   │  0   │  800   │   2,400   ║
║  agent-dashboard    │   2   │  0   │  200   │   1,200   ║
║                                                          ║
║  Fastest: context-slim (800ms avg)                       ║
║  Slowest: skill-guardian (12,400ms avg)                   ║
║  Most tokens: summarize-pro (131,100 total)              ║
║                                                          ║
║  Unused Skills (installed but never called):             ║
║  • byterover (installed 14 days ago)                     ║
║  • capability-evolver (installed 21 days ago)            ║
╚══════════════════════════════════════════════════════════╝

dashboard cost [--period day|week|month]
— Cost analytics:

  • Cost by provider (Anthropic, OpenAI, Google)
  • Cost by skill
  • Cost per task
  • Daily/weekly/monthly trends
  • Projected monthly cost

Output:

╔══════════════════════════════════════════════════════╗
║          Cost Analytics — This Month                 ║
╠══════════════════════════════════════════════════════╣
║  Total:    $28.91 (1 day into March)                 ║
║  Projected: $867.30 /month (at current rate)        ║
║                                                      ║
║  By Provider:                                        ║
║  Anthropic:  $19.84  ██████████████░░  69%           ║
║  Google:      $5.78  █████░░░░░░░░░░░  20%           ║
║  OpenAI:      $3.29  ███░░░░░░░░░░░░░  11%           ║
║                                                      ║
║  By Skill:                                           ║
║  brain-trust:     $9.72  (34%)                       ║
║  summarize-pro:   $6.55  (23%)                       ║
║  fx-trader-pro:   $5.20  (18%)                       ║
║  skill-guardian:  $3.89  (13%)                       ║
║  other:           $3.55  (12%)                       ║
║                                                      ║
║  Cost per Task: $0.56 avg                            ║
║  Cheapest task:  $0.02 (file search)                 ║
║  Most expensive: $4.82 (full security audit)         ║
║                                                      ║
║  Daily Trend:                                        ║
║  Feb 24: $3.21 │ Feb 25: $4.89 │ Feb 26: $3.78     ║
║  Feb 27: $5.12 │ Feb 28: $4.23 │ Mar 01: $4.23     ║
╚══════════════════════════════════════════════════════╝

dashboard alerts
— Show active alerts

Output:

╔══════════════════════════════════════════════════════╗
║            Active Alerts                             ║
╠══════════════════════════════════════════════════════╣
║                                                      ║
║  ⚠️ [Mar 01 14:25] Daily cost ($5.23) exceeded       ║
║     threshold ($5.00). Current: $5.23                ║
║     Triggered by: brain-trust meeting ($1.82)        ║
║                                                      ║
║  ⚠️ [Mar 01 14:25] fx-trader-pro error count: 3      ║
║     threshold: 3. Last error: API timeout on         ║
║     candle fetch for GBP_NZD                         ║
║                                                      ║
║  Resolved Today:                                     ║
║  ✅ [Mar 01 10:15] Health score dropped to 58        ║
║     Recovered to 82 after 2 successful tasks         ║
║                                                      ║
║  Alert Configuration:                                ║
║  error-rate: 10% │ daily-cost: $5 │ health: 60      ║
║  monthly-cost: $50 │ fx-trader-pro errors: 3         ║
╚══════════════════════════════════════════════════════╝

dashboard alert set <type> <threshold>
— Configure alerts:

  • error-rate 10
    -- Alert if error rate exceeds 10%
  • daily-cost 5
    -- Alert if daily cost exceeds $5
  • monthly-cost 50
    -- Alert if monthly cost exceeds $50
  • skill-error <skill> 3
    -- Alert if skill fails 3+ times
  • health 60
    -- Alert if health score drops below 60

Output:

✅ Alert configured: daily-cost → $8.00

  Current Alert Thresholds:
  ─────────────────────────────────────────
  error-rate:       10%     (current: 2.1%)    ✅
  daily-cost:       $8.00   (current: $5.23)   ✅
  monthly-cost:     $50.00  (current: $28.91)  ✅
  health:           60      (current: 82)      ✅
  fx-trader-pro:    3 errors (current: 1)      ✅

  Saved to: .agent-dashboard/alerts/config.json

dashboard report [daily|weekly|monthly]
— Generate markdown report:

  • Summary statistics
  • Trends and comparisons
  • Top achievements
  • Issues and recommendations
  • Cost forecast

Output:

📄 Weekly Report generated: .agent-dashboard/reports/2026-03-01.md

  # Agent Dashboard Weekly Report — 2026-03-01

  ## Summary
  - Tasks completed: 47 (94% success rate)
  - Total cost: $28.91 ($4.13/day avg)
  - Health score: 82 (▲ +4 from last week)
  - Most active skill: summarize-pro (23 calls)
  - API calls: 1,247 (Anthropic: 842, Google: 312, OpenAI: 93)

  ## Highlights
  - Zero errors on summarize-pro (23 consecutive successes)
  - Health score trending upward 3 days straight
  - Cost per task decreased 12% from last week

  ## Issues
  - fx-trader-pro: 1 API timeout (auto-recovered)
  - Response quality dip on Wednesday (3 corrections)

  ## Cost Forecast
  - Projected March total: $124.00 (within $150 budget)

  Report saved to .agent-dashboard/reports/2026-03-01.md (5.1 KB)

dashboard --web
— Generate HTML dashboard:

  • Self-contained HTML file with inline CSS/JS
  • Interactive charts (line graphs, pie charts)
  • Dark mode support
  • Mobile responsive
  • Auto-refresh every 60 seconds if served locally

Output:

╔══════════════════════════════════════════════════════╗
║         Web Dashboard Generated                      ║
╠══════════════════════════════════════════════════════╣
║                                                      ║
║  Output: .agent-dashboard/web/dashboard.html         ║
║  Size:   142 KB (self-contained, no dependencies)    ║
║                                                      ║
║  Includes:                                           ║
║  ✅ Health score gauge with trend chart              ║
║  ✅ Task completion line graph (7 days)              ║
║  ✅ Cost breakdown pie chart by provider             ║
║  ✅ Skill performance table (sortable)               ║
║  ✅ Recent activity timeline                         ║
║  ✅ Alert history panel                              ║
║  ✅ Dark mode toggle                                 ║
║  ✅ Mobile responsive layout                         ║
║                                                      ║
║  To view: open .agent-dashboard/web/dashboard.html   ║
║  To serve: python3 -m http.server 8080               ║
║           (auto-refresh every 60s)                    ║
╚══════════════════════════════════════════════════════╝

dashboard replay [--session <id>]
— Session replay:

  • Timeline of all actions in a session
  • Filter by skill, outcome, or time range
  • Debug failed operations

Output:

╔══════════════════════════════════════════════════════╗
║       Session Replay: sess-abc123                    ║
║       Started: 2026-03-01 14:00 UTC                  ║
║       Duration: 47 minutes                           ║
╠══════════════════════════════════════════════════════╣
║                                                      ║
║  14:00:12  [START] Session initialized               ║
║  14:01:30  [fx-trader-pro] fx status                 ║
║            ✅ 2,300ms │ 4,500 in / 1,200 out         ║
║  14:05:45  [fx-trader-pro] fx scan                   ║
║            ✅ 12,400ms │ 28,000 in / 8,400 out       ║
║  14:08:22  [fx-trader-pro] fx signal GBP_JPY         ║
║            ✅ 4,700ms │ 8,200 in / 2,100 out         ║
║  14:12:00  [fx-trader-pro] fx trade GBP_JPY buy      ║
║            ✅ 3,100ms │ 6,400 in / 1,800 out         ║
║  14:20:10  [brain-trust] bt meeting standup           ║
║            ✅ 8,100ms │ 12,000 in / 6,200 out        ║
║  14:25:33  [summarize-pro] summarize report.pdf       ║
║            ❌ TIMEOUT │ Retried → ✅ 4,200ms          ║
║  14:32:15  [context-slim] slim optimize               ║
║            ✅ 800ms │ 2,400 in / 600 out              ║
║  14:47:00  [END] Session complete                    ║
║                                                      ║
║  Session totals: 7 actions │ 1 error │ $4.23 cost    ║
╚══════════════════════════════════════════════════════╝

dashboard export <json|csv|html>
— Export all data

Output:

╔══════════════════════════════════════════════════════╗
║         Data Export — JSON                           ║
╠══════════════════════════════════════════════════════╣
║                                                      ║
║  Exporting all metric data...                        ║
║                                                      ║
║  Files generated:                                    ║
║  ✅ health-metrics.json      (1,247 entries, 2.1 MB) ║
║  ✅ task-records.json        (  312 entries, 840 KB) ║
║  ✅ skill-usage.json         (  189 entries, 420 KB) ║
║  ✅ cost-tracking.json       (  892 entries, 1.4 MB) ║
║  ✅ session-data.json        (   42 sessions, 3.2 MB)║
║  ✅ alert-history.json       (   28 entries, 64 KB)  ║
║                                                      ║
║  Total: 6 files, 8.0 MB                             ║
║  Output: .agent-dashboard/exports/2026-03-01/        ║
║                                                      ║
║  Format: JSON (UTF-8, pretty-printed)                ║
║  Compatible with: jq, Elasticsearch, Grafana, etc.   ║
╚══════════════════════════════════════════════════════╝

dashboard reset
— Clear all tracking data (confirmation required)

Output:

╔══════════════════════════════════════════════════════╗
║         Dashboard Reset                              ║
╠══════════════════════════════════════════════════════╣
║                                                      ║
║  ⚠️ This will permanently delete all tracking data.  ║
║                                                      ║
║  Data to be cleared:                                 ║
║    Health metrics:    1,247 entries (2.1 MB)          ║
║    Task records:        312 entries (840 KB)          ║
║    Skill usage:         189 entries (420 KB)          ║
║    Cost tracking:       892 entries (1.4 MB)          ║
║    Session replays:      42 sessions (3.2 MB)        ║
║    Alert history:        28 entries (64 KB)           ║
║                                                      ║
║  Tip: Run `dashboard export json` first to backup.   ║
║                                                      ║
║  Confirm reset? (yes/no): yes                        ║
║                                                      ║
║  ✅ All tracking data cleared.                        ║
║  Alert configuration preserved.                      ║
║  Health score reset to 100 (fresh start).            ║
╚══════════════════════════════════════════════════════╝

Token Pricing (built-in)

{
  "anthropic": {
    "claude-opus-4-20250514": {"input": 15.0, "output": 75.0},
    "claude-sonnet-4-20250514": {"input": 3.0, "output": 15.0},
    "claude-haiku-3-5-20241022": {"input": 0.80, "output": 4.0}
  },
  "openai": {
    "gpt-4o": {"input": 2.5, "output": 10.0},
    "gpt-4o-mini": {"input": 0.15, "output": 0.60}
  },
  "google": {
    "gemini-2.5-pro": {"input": 1.25, "output": 10.0},
    "gemini-2.5-flash": {"input": 0.15, "output": 0.60}
  }
}

Prices per 1M tokens in USD. Updated periodically.

Data Storage

.agent-dashboard/
├── metrics/
│   ├── health.jsonl      # Health metrics per action
│   ├── tasks.jsonl       # Task records
│   ├── skills.jsonl      # Skill usage records
│   └── costs.jsonl       # Cost tracking
├── alerts/
│   └── config.json       # Alert thresholds
├── reports/
│   └── {date}.md         # Generated reports
├── sessions/
│   └── {session-id}.jsonl # Session replay data
└── config.json            # Dashboard settings

Automatic Alerts

When enabled, alerts fire when thresholds are crossed:

  • Notification appears in agent response
  • Alert logged to
    .agent-dashboard/alerts/history.jsonl
  • Daily summary includes active alerts
⚠️ ALERT: Daily cost ($5.23) exceeded threshold ($5.00)
⚠️ ALERT: Health score dropped to 58 (threshold: 60)

Error Handling

Agent Dashboard handles metric collection, export, and notification failures gracefully to ensure monitoring continuity.

Metric Collection Failure

ScenarioHandling
Unable to log health metrics after an actionWrite to a local buffer (
/tmp/agent-dashboard-buffer.jsonl
). Retry writing to
.agent-dashboard/metrics/health.jsonl
on next successful action. Never block the user's operation to log metrics.
Metrics file locked by another processWait up to 2 seconds for lock release. If still locked, buffer the entry and append on next write. Use file-level locking to prevent corruption.
Disk full — cannot write metricsSwitch to in-memory-only mode. Display a warning: "Dashboard metrics storage full. Monitoring continues in memory only." Suggest
dashboard reset
or clearing old data.
Token/cost data unavailable from providerRecord the action with
cost_usd: null
and
tokens: null
. Mark the entry as "incomplete" for later reconciliation. Dashboard totals exclude null entries with a footnote.
Skill usage tracking fails for unknown skillLog the event with the raw skill name. Create a new entry in skills.jsonl. Do not discard usage data for unrecognized skills.

Export Errors

ScenarioHandling
HTML export fails (template rendering error)Fall back to plain JSON export. Display the raw data with a note that HTML generation failed. Log the template error for debugging.
CSV export with special characters in dataEscape all fields with double-quote wrapping. Handle Unicode, commas, and newlines in field values. Use UTF-8 BOM for Excel compatibility.
Export file too large (>50MB)Split into multiple files by date range (monthly chunks). Alert user with file count and total size.
Web dashboard generation failsProvide the static terminal dashboard as fallback. Log the error. Common cause: corrupted metrics data — suggest
dashboard reset
with date range to clean specific periods.

Notification / Alert Failures

ScenarioHandling
Alert threshold crossed but notification cannot be displayedBuffer the alert in
.agent-dashboard/alerts/pending.jsonl
. Display all pending alerts at the start of the next agent interaction.
Alert config file corruptedFall back to default thresholds (error-rate: 10%, daily-cost: $10, health: 50). Warn user that custom alert settings need reconfiguration.
Alert storm (>10 alerts in 1 minute)Consolidate into a single summary alert: "Multiple alerts triggered (12 events in last minute). Run
dashboard alerts
for details." Prevents alert fatigue.
False positive alert (metric spike then recovery)Alerts fire on threshold crossing, not sustained state. Include "current value" and "threshold" in every alert so the user can assess. Historical alerts are always available in
alerts/history.jsonl
.

Session Replay Errors

ScenarioHandling
Session data incomplete (agent crashed mid-session)Display available data up to the last recorded event. Mark the session as "incomplete" in the replay timeline.
Session ID not foundList available session IDs with dates. Suggest closest match if the ID looks like a typo.

Recovery

  • All metric files use append-only JSONL format for crash resilience (partial writes only lose the last entry, not the entire file).
  • Run
    dashboard export json
    periodically to create backups.
  • dashboard reset --before <date>
    clears old data while preserving recent metrics.

Agent Dashboard vs Other Monitoring Tools

FeatureAgent DashboardManual Logging (print/console)Console Output (Default)LangfuseWeights & Biases
SetupZero config — installs with skillNone neededNone neededAccount + SDK integrationAccount + SDK integration
Health ScoreComposite 0-100 score (error rate, completion, quality, efficiency)NoneNoneTrace-level scoringCustom metrics only
Cost TrackingAutomatic per-provider, per-model, per-skill breakdownManual calculationNoneAutomatic with SDKCustom logging required
Token TrackingAutomatic input/output per actionManual if codedNoneAutomatic with SDKCustom logging required
Skill-Level MetricsPer-skill calls, errors, avg response time, token usageNot practical manuallyNonePer-trace breakdownCustom tagging
AlertingBuilt-in threshold alerts (cost, health, error rate)NoneNoneCustom alerts (paid plan)Custom alerts
Terminal DashboardRich ASCII dashboard with real-time dataN/ARaw text onlyWeb UI onlyWeb UI only
Web DashboardSelf-contained HTML export (offline capable)N/AN/ACloud-hosted web UICloud-hosted web UI
Session ReplayFull action timeline per sessionNot feasible manuallyScroll through terminalTrace replay (paid plan)Not available
Data StorageLocal JSONL files (private, no cloud dependency)Scattered log filesEphemeral (lost on close)Cloud-hosted (their servers)Cloud-hosted (their servers)
Privacy100% local — no data leaves your machineLocalLocalData sent to Langfuse cloudData sent to W&B cloud
Cost of ToolFree (MIT license)FreeFreeFree tier limited / $59+/moFree tier limited / $50+/mo
OpenClaw IntegrationNative — tracks all OpenClaw skills automaticallyNoneNoneRequires custom integrationRequires custom integration

FAQ

Q: How much does Agent Dashboard cost? A: The skill itself is completely free (MIT license). It tracks your existing API costs but does not add any additional API calls or fees. All data is stored locally.

Q: What data does Agent Dashboard collect? A: It tracks response times, token counts (input/output), tool calls, error counts, task completion status, and API costs. It estimates costs based on built-in pricing tables for major providers (Anthropic, OpenAI, Google). No personal data or conversation content is stored — only operational metrics.

Q: How long is metric data retained? A: Indefinitely by default. Data is stored in JSONL files that grow over time. Use

dashboard reset --before <date>
to clear old data, or
dashboard export
to archive before clearing. For long-running agents, consider monthly exports and resets to keep file sizes manageable.

Q: Does it affect agent performance? A: Negligible impact. Metric logging is append-only to JSONL files (microseconds per write). The dashboard display itself uses only data already collected — no additional API calls are made. Health score calculation is lightweight arithmetic.

Q: Can I set up alerts for specific skills? A: Yes. Use

dashboard alert set skill-error <skill-name> <threshold>
to alert when a specific skill exceeds a failure count. You can also set global alerts for error rate, daily cost, monthly cost, and health score.

Q: How do I integrate with external monitoring tools? A: Use

dashboard export json
to get all metrics as JSON. This can be piped to any external tool, ingested by Grafana, or processed by custom scripts. The JSONL format is compatible with standard log processing pipelines (jq, Elasticsearch, etc.).

Q: Is my data sent to any cloud service? A: No. Agent Dashboard is 100% local. All data is stored in

.agent-dashboard/
in your project directory. No telemetry, no cloud sync, no external API calls. The web dashboard export generates a self-contained HTML file that works offline.

Q: How accurate are the cost estimates? A: Cost estimates use built-in pricing tables that are updated periodically. They are accurate to within 5% for standard API usage. Variations can occur if your provider offers custom pricing, volume discounts, or if prices have changed since the last pricing table update. You can update pricing in

.agent-dashboard/config.json
.

Q: Can I track multiple agents or projects? A: Each project has its own

.agent-dashboard/
directory with independent metrics. To get a cross-project view, use
dashboard export json
from each project and aggregate externally. A multi-project dashboard feature is planned for a future version.

Q: What happens if I forget to check the dashboard? A: Alerts are your safety net. Set thresholds for the metrics you care about most (daily cost, error rate, health score), and Agent Dashboard will notify you inline whenever a threshold is crossed. Pending alerts are displayed at the start of each new session.