Nexus-agents codex-delegator
install
source · Clone the upstream repo
git clone https://github.com/williamzujkowski/nexus-agents
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/williamzujkowski/nexus-agents "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/codex-delegator" ~/.claude/skills/williamzujkowski-nexus-agents-codex-delegator && rm -rf "$T"
manifest:
skills/codex-delegator/SKILL.mdsource content
Codex Delegator Skill
<!-- CANONICAL SOURCES: - docs/architecture/ROUTING_SYSTEM.md - packages/nexus-agents/src/mcp/tools/delegate-to-model.ts - packages/nexus-agents/src/cli-adapters/adapters/codex-adapter.ts -->Full documentation:
Real-World Performance (Tested 2026-01-18)
| Metric | Value | Notes |
|---|---|---|
| Model version | gpt-5.2-codex | Latest research preview |
| Latency (simple) | 3-4 seconds | Single function generation |
| Latency (complex) | 5-6 seconds | Interface + class boilerplate |
| Token overhead | ~900-1050 per session | Fixed session initialization cost |
| Default sandbox | read-only | Safe for code generation |
| Code quality | Production-ready | Proper generics, types, error handling |
When to Use Codex
Codex excels at:
| Task Type | Examples |
|---|---|
| Code Generation | Implement functions, create classes, add features |
| Test Generation | Write unit tests, integration tests, test fixtures |
| Refactoring | Extract methods, rename variables, restructure |
| Code Completion | Complete partial implementations, fill in stubs |
| Bulk Code Changes | Apply patterns across multiple files |
Do NOT use for: Architecture decisions, security review, documentation-heavy tasks.
Delegation Methods
Method 1: MCP Tool (Recommended)
Use the
delegate_to_model MCP tool for intelligent routing:
delegate_to_model(task: "Implement a binary search function in TypeScript")
The CompositeRouter automatically routes to Codex when:
- Task contains code generation keywords ("implement", "write", "create")
- Task is primarily code-focused (low reasoning complexity)
- Budget constraints favor Codex cost efficiency
Method 2: Direct CLI Execution
For explicit Codex invocation, use
codex exec for non-interactive mode:
# Basic code generation (non-interactive) codex exec "Implement a binary search function in TypeScript" # Via nexus-agents CLI nexus-agents orchestrate "Implement sorting algorithm" --cli=codex # Explicit model selection (default is gpt-5.2-codex) codex exec -m gpt-5.2-codex "Implement a debounce function" # Full autonomous mode (dangerous - writes files) codex exec --full-auto "Add error handling to all functions in utils.ts"
Important: Use
codex exec NOT codex -p. The -p flag is for interactive sessions.
Method 3: Routing Audit
Check routing decisions before execution:
nexus-agents routing-audit "Implement authentication middleware" --format=json
Task Profiling
The router analyzes tasks using these signals:
| Signal | Boost Codex When |
|---|---|
| Keywords: implement, write, create |
| No "design", "architect", "analyze" |
| Keywords: quick, simple, straightforward |
| References to APIs, frameworks, libs |
Examples
Generate Unit Tests
delegate_to_model(task: "Write Jest tests for the UserService class covering: - createUser with valid input - createUser with invalid email - getUserById with existing user - getUserById with non-existent user")
Implement Feature
delegate_to_model(task: "Implement a rate limiter middleware for Express.js that: - Uses sliding window algorithm - Supports configurable limits per route - Returns 429 status when limit exceeded")
Refactor Code
delegate_to_model(task: "Refactor the payment processing module to: - Extract validation logic into separate functions - Replace callbacks with async/await - Add TypeScript types for all parameters")
Bulk Changes
delegate_to_model(task: "Add JSDoc comments to all exported functions in src/utils/")
Routing Decision Flow
Task Input | v TaskAnalyzer (profile task) | v BudgetRouter (filter by constraints) | v TopsisRouter (rank: quality 50%, cost 30%, latency 20%) | v LinUCBBandit (contextual learning) | v Codex selected if: high codeGeneration + low reasoningComplexity
Codex Capabilities
| Capability | Score | Notes |
|---|---|---|
| Code Generation | 0.95 | Primary strength |
| Context Window | 400K | Large file support |
| Cost Efficiency | High | Lower than Claude for code tasks |
| Latency | 3-6s | Fast for focused code tasks |
Best Practices (From Real Testing)
- Be explicit in prompts - Include "Just output the function/code, no explanation" for cleaner output
- Batch related requests - Session overhead is ~900 tokens, batch similar generations
- Validate generated code - Run
on TypeScript before integrationtsc --noEmit - Set appropriate timeouts - Use 90s timeout for complex generations
- Handle ambiguity - Codex asks clarifying questions for ambiguous prompts (graceful)
Fallback Behavior
If Codex is unavailable:
- Circuit breaker detects failure
- Router falls back to Claude or Gemini
- Task executes with alternative CLI
- Outcome recorded for learning
Configuration
# nexus-agents.yaml routing: enableTopsisRanking: true enableLinUCBSelection: true topsis: qualityWeight: 0.5 costWeight: 0.3 latencyWeight: 0.2
Debugging
# Check Codex availability nexus-agents doctor # View routing decision for task nexus-agents routing-audit "task description" --verbose # Show bandit learning stats nexus-agents routing-audit "task" --bandit-stats