Claude-skill-registry atft-autonomy
Coordinate Claude Code skills with OpenAI Codex autonomous workflows for end-to-end ATFT-GAT-FAN maintenance.
install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/atft-autonomy" ~/.claude/skills/majiayu000-claude-skill-registry-atft-autonomy && rm -rf "$T"
manifest:
skills/data/atft-autonomy/SKILL.mdsource content
ATFT Autonomy Skill
Mission
- Seamlessly orchestrate Claude and Codex agents to keep the ATFT-GAT-FAN stack production-ready.
- Decide when to run deep Codex analyses versus fast Claude remediations.
- Maintain shared context (AGENTS.md, health snapshots, run logs) across agents.
Engagement Signals
- Requests to “run full autonomous maintenance”, “coordinate Claude and Codex”, or “schedule daily self-healing”.
- Situations where an issue spans multiple domains (dataset + training + quality).
- Need to generate or refresh
,.mcp.json
, or other shared configs.AGENTS.md
Preflight Checklist
- Confirm tool availability:
(expect npm packagecommand -v codex
).@openai/codex
.command -v ./tools/claude-code.sh
- Verify configuration files:
exists with filesystem/git servers (.mcp.json
recreates if missing).tools/codex.sh
up to date with latest guidelines.AGENTS.md
- Run snapshot diagnostics:
(logs totools/project-health-check.sh --summary
)._logs/health-checks/
andnvidia-smi
to ensure resource headroom before long autonomous jobs.df -h
Core Playbooks
1. Full Daily Autonomy Loop
— quick fixes + TODO updates../tools/claude-code.sh --no-check "Run proactive maintenance checklist"
— longer Codex session with MCP../tools/codex.sh --max --exec "Perform deep optimization sweep across dataset, training, and research modules"- Append combined findings to
(capture commands run, deltas, next steps).docs/ops/autonomy_log.md
→ review diffs, ensure no unintended drift.git status --short
2. Incident Response (multi-domain failure)
- Launch Claude for rapid triage:
../tools/claude-code.sh "Investigate failed training and prepare briefing for Codex" - Pass Claude findings to Codex:
../tools/codex.sh --exec "Read docs/ops/incident_brief.md and propose remedial plan" - Execute agreed actions (dataset rebuild, training rerun, quality checks).
- Update
with both agents’ actions and resolutions.docs/ops/incident_log.md
3. Scheduled Autonomous Optimization
- Add cron entry (example):
.0 2 * * * cd /workspace/gogooku3 && ./tools/codex.sh --max >> _logs/daily-optimization.log 2>&1 - Pair with weekly Claude sweep:
.0 7 * * MON cd /workspace/gogooku3 && ./tools/claude-code.sh --no-check >> _logs/weekly-claude.log 2>&1 - Summarize improvements weekly by aggregating
and_logs/codex-sessions/*.log
; record highlights in_logs/claude-code/*.log
.docs/ops/weekly_autonomy_report.md
Shared Context Management
- Ensure both agents read
andCLAUDE.md
before editing high-risk files.AGENTS.md - Rotate
anddocs/SKILLS_GUIDE.md
when procedures change.claude/skills/ - Store cross-agent decisions in
orEXPERIMENT_STATUS.md
.docs/ops/autonomy_log.md
Failure Handling
- Codex CLI missing →
.npm install -g @openai/codex - MCP server errors → regenerate
via.mcp.json
and restart run../tools/codex.sh - Conflicting edits → consolidate drafts under
and resolve manually.docs/ops/autonomy_pending/ - Resource saturation → stagger Claude/Codex runs, cap concurrency in cron.
Handoff
- After combined sessions, notify stakeholders by updating
.docs/ops/weekly_autonomy_report.md - Attach session logs
and_logs/codex-sessions/*.log
for audit._logs/claude-code/*.log - Re-sync
so Claude picks up any procedural updates discovered during Codex runs.~/.claude/skills/