Skills doubao-web
Use Playwright to host a browser and call Doubao Web's image generation function. Call this skill when the user requests to draw or generate an image using Doubao. (使用 Playwright 托管浏览器的方式,调用豆包 Web 端生图功能。当用户要求使用豆包画图、生成图片时调用此技能。)
install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/atmosphere16happy/doubao-web-image" ~/.claude/skills/clawdbot-skills-doubao-web && rm -rf "$T"
manifest:
skills/atmosphere16happy/doubao-web-image/SKILL.mdsource content
Doubao Web Image Generator
This project/skill uses Playwright to automate browser control, directly utilizing the real environment of Doubao Web for image generation, perfectly bypassing the
a_bogus signature risk control issue. (这个项目/技能通过 Playwright 自动化控制浏览器的方式,直接利用豆包 Web 端的真实环境进行图片生成,从而完美避开 a_bogus 签名风控问题。)
Features (功能)
- Auto-save login status to
(自动保存登录状态在~/.doubao-web-session
)~/.doubao-web-session - Send image generation Prompts in a real browser environment (在真实浏览器环境中发送生图 Prompt)
- Intercept and parse SSE stream responses to get the watermark-free original image URL (拦截并解析 SSE 流式响应,获取无水印原图 URL)
How to Run (如何运行)
# Default headless mode (silent background run) and original image quality, saving to generated.png # 默认使用无头模式 (后台静默运行) 和 获取原图画质,并默认保存为 generated.png npx ts-node scripts/main.ts "A cyberpunk style cat (一只赛博朋克风格的猫咪)" # Specify image save path (--output or --image) # 指定图片保存路径 (--output 或 --image) npx ts-node scripts/main.ts "A cyberpunk style cat" --output="./my_cyber_cat.png" # Specify image quality (--quality=preview or --quality=original) # preview fetches faster, original attempts to get high-res quality (default) # 指定图片画质 (--quality=preview 或 --quality=original) # preview 抓取速度更快,original 尝试获取大图画质 (默认) npx ts-node scripts/main.ts "A cyberpunk style cat" --quality=preview --output="./preview_cat.png" # For the first run or when login is required, you must use the --ui parameter to show the browser window # 首次运行或需要登录时,必须使用 --ui 参数显示浏览器窗口 npx ts-node scripts/main.ts "Test" --ui
Command Line Arguments (命令行参数说明)
| Parameter (参数) | Description (说明) | Default (默认值) |
|---|---|---|
| (Required) Prompt for generating the image / (必填) 生成图片的提示词 | |
/ | Local path to save the downloaded image / 图片下载保存的本地路径 | |
| Image quality requirement: or (High-res) / 图片画质要求,可选 (预览图) 或 (高清原图) | |
| Image aspect ratio selection. Supported: (Square avatar), (Social media selfie), (Classic photo), (Article illustration), (Mobile wallpaper portrait), (Desktop wallpaper landscape) / 图片比例选择,支持: (正方形头像), (社交媒体自拍), (经典比例拍照), (文章配图插画), (手机壁纸人像), (桌面壁纸风景) | |
| Show browser interface (must be used for first login) / 显示浏览器界面(首次登录时必须使用) | Background silent run (后台静默运行) |
, | Show help menu / 显示帮助菜单 |
Technical Principle (技术原理)
- Browser Hosting (浏览器托管): Use Playwright to launch a real Chromium browser, loading the user data directory. (利用 Playwright 启动一个真实的 Chromium 浏览器,加载用户数据目录。)
- UI Automation (UI 自动化): Locate the input box, auto-fill
and simulate pressing Enter. (定位输入框,自动填入Help me generate an image: {prompt}
并模拟回车。)帮我生成图片:{prompt} - Network Interception (网络拦截): Listen to the POST request response of
to get the complete SSE data stream. (监听/samantha/chat/completion
的 POST 请求响应,获取完整的 SSE 数据流。)/samantha/chat/completion - Data Parsing (数据解析): Use regex to match the
URL in the response stream. (使用正则匹配响应流中的image_ori
的 URL。)image_ori
Directory Structure (目录结构)
- Core Playwright client logic (核心 Playwright 客户端逻辑)scripts/doubao-webapi/client.ts
- Command line entry file (命令行入口文件)scripts/main.ts
- Project dependencies (项目依赖)package.json