AGENTS-COLLECTION add-image-vision
Add image vision to NanoClaw agents. Resizes and processes WhatsApp image attachments, then sends them to Claude as multimodal content blocks.
install
source · Clone the upstream repo
git clone https://github.com/mk-knight23/AGENTS-COLLECTION
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/mk-knight23/AGENTS-COLLECTION "$T" && mkdir -p ~/.claude/skills && cp -r "$T/SKILLS/NANOCLAW/ADD-IMAGE-VISION" ~/.claude/skills/mk-knight23-agents-collection-add-image-vision && rm -rf "$T"
manifest:
SKILLS/NANOCLAW/ADD-IMAGE-VISION/SKILL.mdsource content
Image Vision Skill
Adds the ability for NanoClaw agents to see and understand images sent via WhatsApp. Images are downloaded, resized with sharp, saved to the group workspace, and passed to the agent as base64-encoded multimodal content blocks.
Phase 1: Pre-flight
- Check
for.nanoclaw/state.yaml
— skip if already appliedadd-image-vision - Confirm
is installable (native bindings require build tools)sharp
Phase 2: Apply Code Changes
-
Initialize the skills system if not already done:
npx tsx -e "import { initNanoclawDir } from './skills-engine/init.ts'; initNanoclawDir();" -
Apply the skill:
npx tsx skills-engine/apply-skill.ts add-image-vision -
Install new dependency:
npm install sharp -
Validate:
npm run typecheck npm test
Phase 3: Configure
-
Rebuild the container (agent-runner changes need a rebuild):
./container/build.sh -
Sync agent-runner source to group caches:
for dir in data/sessions/*/agent-runner-src/; do cp container/agent-runner/src/*.ts "$dir" done -
Restart the service:
launchctl kickstart -k gui/$(id -u)/com.nanoclaw
Phase 4: Verify
- Send an image in a registered WhatsApp group
- Check the agent responds with understanding of the image content
- Check logs for "Processed image attachment":
tail -50 groups/*/logs/container-*.log
Troubleshooting
- "Image - download failed": Check WhatsApp connection stability. The download may timeout on slow connections.
- "Image - processing failed": Sharp may not be installed correctly. Run
to verify.npm ls sharp - Agent doesn't mention image content: Check container logs for "Loaded image" messages. If missing, ensure agent-runner source was synced to group caches.