Skills aegis-protocol

Self-healing stability monitor for AI agents - 5 core checks + 15 extended checks, auto-recovery, health scoring

install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/ankechenlab-node/aegis-protocol" ~/.claude/skills/openclaw-skills-aegis-protocol && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/ankechenlab-node/aegis-protocol" ~/.openclaw/skills/openclaw-skills-aegis-protocol && rm -rf "$T"
manifest: skills/ankechenlab-node/aegis-protocol/SKILL.md
source content

Aegis Protocol 🛡️

Self-Healing Stability Monitor for AI Agents

Version: 0.12.6
Author: Dream
License: MIT


Features

  • 20-Dimension Monitoring: System, services, AI agent, security, maintenance
  • Auto Recovery: Terminate stuck sessions, restart services, compact context
  • Health Scoring: Quantified health score (0-100)
  • Healing Memory: Record and learn from recovery strategies
  • Result Caching: 5-minute TTL for reduced system calls

Commands

aegis-protocol init

Initialize configuration file

aegis-protocol status

Show system health summary

aegis-protocol check

Run full health check

aegis-protocol heal

Execute automatic recovery

aegis-protocol config

View current configuration


Usage Examples

# Initialize
python3 aegis-protocol.py init

# Check health
python3 aegis-protocol.py check

# Auto recover
python3 aegis-protocol.py heal

# View config
python3 aegis-protocol.py config

Configuration

File:

~/.openclaw/workspace/.watchdog-config.json

{
  "thresholds": {
    "sessionTimeoutMinutes": 60,
    "pm2RestartAlert": 50,
    "diskUsagePercent": 90,
    "memoryUsagePercent": 95,
    "contextUsagePercent": 80
  }
}

Monitoring Dimensions

CategoryChecks
SystemCPU, Memory, Disk, Zombies, FD, Connections
ServicesPM2, Nginx, Docker, Cron
AI AgentSessions, Context, Tasks, Loops
SecuritySSL, Updates, Git
MaintenanceBackup, Cleanup, Network

Health Score

ScoreStatus
90-100Excellent
70-89Good
50-69Warning
0-49Critical

Testing

# Unit tests
python3 -m pytest tests/ -v

# Coverage
python3 -m pytest tests/ --cov=aegis_protocol -v

Coverage: 82%
Tests: 20+ passing


Quality Metrics

MetricValue
Type hints>90%
Test coverage82%
Documentation100%
Bugs0

Version History

v0.7.0 (2026-04-05)

  • Result caching with 5-minute TTL
  • Type hints >90% coverage
  • Exception classification (4 types)
  • 20-dimension monitoring
  • Health scoring system

Aegis Protocol - The Never-Sleeping Guardian 🌀