Skills openclaw-media-gen
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/aisadocs/openclaw-aisa-llm-image-video-qwen-wan26-gemini-3-pro-image" ~/.claude/skills/clawdbot-skills-openclaw-media-gen-20fd5a && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/aisadocs/openclaw-aisa-llm-image-video-qwen-wan26-gemini-3-pro-image" ~/.openclaw/skills/clawdbot-skills-openclaw-media-gen-20fd5a && rm -rf "$T"
manifest:
skills/aisadocs/openclaw-aisa-llm-image-video-qwen-wan26-gemini-3-pro-image/SKILL.mdsource content
OpenClaw Media Gen 🎬
Generate images and videos with one AIsa API key:
- Image:
(Gemini GenerateContent)gemini-3-pro-image-preview - Video:
(Qwen Wan 2.6 / Tongyi Wanxiang, async task)wan2.6-t2v
API Reference: AIsa API Reference (all pages available at
https://aisa.mintlify.app/llms.txt)
🎯 Pricing Advantage
Video Generation (WAN) - Cost Comparison
| Resolution | AIsa (Contract) | AIsa (Official) | Bailian (Official) | OpenRouter |
|---|---|---|---|---|
| 720P | $0.06/sec | ~$0.08 | ~$0.10 | ❌ |
| 1080P | $0.09/sec | ~$0.12 | ~$0.15 | ❌ |
| Pro/Animate | $0.108–0.156 | ~$0.18 | ~$0.25 | ❌ |
Key Benefits:
- 25-40% cheaper than Bailian official pricing
- OpenRouter doesn't support video - AIsa is the only unified API with video generation
- Contract pricing available for production workloads
- Single API key for both image and video generation
🔥 What You Can Do
Image Generation (Gemini)
"Generate a cyberpunk cityscape at night, neon lights, rainy, cinematic"
Video Generation (Wan 2.6)
"Use a reference image to generate a 5-second shot: slow camera push-in, wind blowing hair, cinematic, shallow depth of field"
Quick Start
export AISA_API_KEY="your-key"
🖼️ Image Generation (Gemini)
Endpoint
- Base URL:
https://api.aisa.one/v1 POST /models/{model}:generateContent
Documentation:
google-gemini-chat (GenerateContent) at https://aisa.mintlify.app/api-reference/chat/chat-api/google-gemini-chat.md
curl Example (returns inline_data for images)
curl -X POST "https://api.aisa.one/v1/models/gemini-3-pro-image-preview:generateContent" \ -H "Authorization: Bearer $AISA_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "contents":[ {"role":"user","parts":[{"text":"A cute red panda, ultra-detailed, cinematic lighting"}]} ] }'
Note: Response may contain
(typically with base64 data and mime type); client script automatically parses and saves the file.candidates[].parts[].inline_data
🎞️ Video Generation (Qwen Wan 2.6 / Tongyi Wanxiang)
Create Task
- Base URL:
https://api.aisa.one/apis/v1 POST /services/aigc/video-generation/video-synthesis- Header:
(required for async)X-DashScope-Async: enable
Documentation:
video-generation at https://aisa.mintlify.app/api-reference/aliyun/video/video-generation.md
curl -X POST "https://api.aisa.one/apis/v1/services/aigc/video-generation/video-synthesis" \ -H "Authorization: Bearer $AISA_API_KEY" \ -H "Content-Type: application/json" \ -H "X-DashScope-Async: enable" \ -d '{ "model":"wan2.6-t2v", "input":{ "prompt":"cinematic close-up, slow push-in, shallow depth of field", "img_url":"https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" }, "parameters":{ "resolution":"720P", "duration":5, "shot_type":"single", "watermark":false } }'
Poll Task Status
GET /services/aigc/tasks?task_id=...
Documentation:
task at https://aisa.mintlify.app/api-reference/aliyun/video/task.md
curl "https://api.aisa.one/apis/v1/services/aigc/tasks?task_id=YOUR_TASK_ID" \ -H "Authorization: Bearer $AISA_API_KEY"
Python Client
# Generate image (save to local file) python3 {baseDir}/scripts/media_gen_client.py image \ --prompt "A cute red panda, cinematic lighting" \ --out "out.png" # Create video task (requires img_url) python3 {baseDir}/scripts/media_gen_client.py video-create \ --prompt "cinematic close-up, slow push-in" \ --img-url "https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" \ --duration 5 # Poll task status python3 {baseDir}/scripts/media_gen_client.py video-status --task-id YOUR_TASK_ID # Wait until success (optional: print video_url on success) python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600 # Wait until success and auto-download mp4 python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --download --out out.mp4
💡 Use Cases
- AI Agents: Automate visual content generation for social media, marketing materials
- Content Creators: Generate custom images and videos programmatically
- Developers: Build apps with multimodal generation capabilities
- Businesses: Cost-effective alternative to Bailian with better pricing
🚀 Why AIsa for Media Generation?
- Unified API: Single key for both images (Gemini) and videos (WAN)
- Best Pricing: 25-40% cheaper than alternatives
- Production Ready: Contract pricing and enterprise support available
- No Competition: OpenRouter doesn't support video generation
- Simple Integration: Python client with async task management built-in