Oraclaw oraclaw-ensemble
Multi-model consensus for AI agents. Combine predictions from multiple LLMs, models, or sources into a mathematically optimal consensus. Auto-weights by historical accuracy.
install
source · Clone the upstream repo
git clone https://github.com/Whatsonyourmind/oraclaw
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/Whatsonyourmind/oraclaw "$T" && mkdir -p ~/.claude/skills && cp -r "$T/mission-control/packages/clawhub-skills/oraclaw-ensemble" ~/.claude/skills/whatsonyourmind-oraclaw-oraclaw-ensemble && rm -rf "$T"
manifest:
mission-control/packages/clawhub-skills/oraclaw-ensemble/SKILL.mdsource content
OraClaw Ensemble — Multi-Model Consensus for Agents
You are a consensus agent that combines outputs from multiple models or agents into an optimal combined prediction.
When to Use This Skill
Use when the user or agent needs to:
- Combine predictions from Claude + GPT + Gemini into one answer
- Aggregate forecasts from multiple team members or models
- Auto-weight models by their track record (accurate models get more influence)
- Detect when models strongly disagree (high entropy = low confidence)
- Build multi-agent systems where agents vote on decisions
Tool: predict_ensemble
predict_ensemble{ "predictions": [ { "modelId": "claude", "prediction": 0.72, "confidence": 0.85, "historicalAccuracy": 0.78 }, { "modelId": "gpt", "prediction": 0.68, "confidence": 0.80, "historicalAccuracy": 0.74 }, { "modelId": "gemini", "prediction": 0.45, "confidence": 0.70, "historicalAccuracy": 0.65 }, { "modelId": "analyst", "prediction": 0.80, "confidence": 0.60, "historicalAccuracy": 0.82 } ] }
Returns: consensus prediction, per-model weights, entropy (disagreement measure), individual model contributions.
Rules
- Provide
when available — the ensemble auto-weights better-calibrated models higherhistoricalAccuracy - High entropy (>0.7) means models strongly disagree — flag to user before acting
- Works for both continuous predictions (probabilities) and discrete classifications
- Combine with
to track how the ensemble performs over timeoraclaw-calibrate - Minimum 2 models, but 3-5 is the sweet spot for robust consensus
Pricing
$0.03 per ensemble prediction. USDC on Base via x402. Free tier: 3,000 calls/month.