Skills powerskills-browser
Edge browser automation via Chrome DevTools Protocol (CDP). List tabs, navigate, take screenshots, extract page content/HTML, execute JavaScript, click elements, type text, fill forms, scroll. Use when needing to control Edge browser, scrape web content, automate web forms, or take browser screenshots on Windows. Requires Edge with --remote-debugging-port=9222.
install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/aloth/powerskills/skills/browser" ~/.claude/skills/openclaw-skills-powerskills-browser && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/aloth/powerskills/skills/browser" ~/.openclaw/skills/openclaw-skills-powerskills-browser && rm -rf "$T"
manifest:
skills/aloth/powerskills/skills/browser/SKILL.mdsource content
PowerSkills — Browser
Edge browser automation via CDP (Chrome DevTools Protocol).
Requirements
- Microsoft Edge running with remote debugging:
Start-Process "msedge" -ArgumentList "--remote-debugging-port=9222" - Default port configurable in
(config.json
)edge_debug_port
Actions
.\powerskills.ps1 browser <action> [--params]
| Action | Params | Description |
|---|---|---|
| List open browser tabs | |
| | Navigate to URL |
| | Capture page as PNG |
| | Get page text content |
| | Get full page HTML |
| | Execute JavaScript expression |
| | Click element by CSS selector |
| | Type into element |
| | Open new tab |
| | Close tab by ID |
| | Scroll page |
| | Fill multiple form fields |
| | Wait N seconds (default: 3) |
Examples
# List open tabs .\powerskills.ps1 browser tabs # Navigate and screenshot .\powerskills.ps1 browser navigate --url "https://example.com" .\powerskills.ps1 browser screenshot --out-file page.png # Extract page text .\powerskills.ps1 browser content # Run JavaScript .\powerskills.ps1 browser evaluate --expression "document.title" # Fill a login form .\powerskills.ps1 browser fill --fields-json '[{"selector":"#user","value":"alex"},{"selector":"#pass","value":"secret","submit":"#login"}]'
Multi-Tab Support
Pass
--target-id (from tabs output) to operate on a specific tab. Without it, actions target the first page.
Fill Fields Format
JSON array of objects with
selector, value, and optional submit:
[ {"selector": "#search-input", "value": "PowerShell automation"}, {"selector": "#filter-type", "value": "recent", "submit": "#apply-btn"} ]
Supports text inputs, selects, and checkboxes. Last field can include
submit to click a button.