Skills powerskills-browser

Edge browser automation via Chrome DevTools Protocol (CDP). List tabs, navigate, take screenshots, extract page content/HTML, execute JavaScript, click elements, type text, fill forms, scroll. Use when needing to control Edge browser, scrape web content, automate web forms, or take browser screenshots on Windows. Requires Edge with --remote-debugging-port=9222.

install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/aloth/powerskills/skills/browser" ~/.claude/skills/openclaw-skills-powerskills-browser && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/aloth/powerskills/skills/browser" ~/.openclaw/skills/openclaw-skills-powerskills-browser && rm -rf "$T"
manifest: skills/aloth/powerskills/skills/browser/SKILL.md
source content

PowerSkills — Browser

Edge browser automation via CDP (Chrome DevTools Protocol).

Requirements

  • Microsoft Edge running with remote debugging:
    Start-Process "msedge" -ArgumentList "--remote-debugging-port=9222"
    
  • Default port configurable in
    config.json
    (
    edge_debug_port
    )

Actions

.\powerskills.ps1 browser <action> [--params]
ActionParamsDescription
tabs
List open browser tabs
navigate
--url URL
Navigate to URL
screenshot
--out-file path.png [--target-id id]
Capture page as PNG
content
[--target-id id]
Get page text content
html
[--target-id id]
Get full page HTML
evaluate
--expression "js"
Execute JavaScript expression
click
--selector "#btn"
Click element by CSS selector
type
--selector "#input" --text "hello"
Type into element
new-tab
--url URL
Open new tab
close-tab
--target-id id
Close tab by ID
scroll
--scroll-target top|bottom|selector
Scroll page
fill
--fields-json '[{"selector":"#a","value":"b"}]'
Fill multiple form fields
wait
--seconds N
Wait N seconds (default: 3)

Examples

# List open tabs
.\powerskills.ps1 browser tabs

# Navigate and screenshot
.\powerskills.ps1 browser navigate --url "https://example.com"
.\powerskills.ps1 browser screenshot --out-file page.png

# Extract page text
.\powerskills.ps1 browser content

# Run JavaScript
.\powerskills.ps1 browser evaluate --expression "document.title"

# Fill a login form
.\powerskills.ps1 browser fill --fields-json '[{"selector":"#user","value":"alex"},{"selector":"#pass","value":"secret","submit":"#login"}]'

Multi-Tab Support

Pass

--target-id
(from
tabs
output) to operate on a specific tab. Without it, actions target the first page.

Fill Fields Format

JSON array of objects with

selector
,
value
, and optional
submit
:

[
  {"selector": "#search-input", "value": "PowerShell automation"},
  {"selector": "#filter-type", "value": "recent", "submit": "#apply-btn"}
]

Supports text inputs, selects, and checkboxes. Last field can include

submit
to click a button.