Marketplace agent-browser
Browser automation CLI for AI agents. Use for website interaction, form automation, screenshots, scraping, and web app verification. Prefer snapshot refs (@e1, @e2) for deterministic actions.
install
source · Clone the upstream repo
git clone https://github.com/aiskillstore/marketplace
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/aiskillstore/marketplace "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/supercent-io/agent-browser" ~/.claude/skills/aiskillstore-marketplace-agent-browser-541936 && rm -rf "$T"
manifest:
skills/supercent-io/agent-browser/SKILL.mdsource content
agent-browser - Browser Automation for AI Agents
When to use this skill
- Open websites and automate UI actions
- Fill forms, click controls, and verify outcomes
- Capture screenshots/PDFs or extract content
- Run deterministic web checks with accessibility refs
- Execute parallel browser tasks via isolated sessions
Core workflow
Always use the deterministic ref loop:
agent-browser open <url>agent-browser snapshot -i- interact with refs (
,@e1
, ...)@e2
again after page/DOM changesagent-browser snapshot -i
agent-browser open https://example.com/form agent-browser wait --load networkidle agent-browser snapshot -i agent-browser fill @e1 "user@example.com" agent-browser click @e2 agent-browser snapshot -i
Command patterns
Use
&& chaining when intermediate output is not needed.
# Good chaining: open -> wait -> snapshot agent-browser open https://example.com && agent-browser wait --load networkidle && agent-browser snapshot -i # Separate calls when output is needed first agent-browser snapshot -i # parse refs agent-browser click @e2
High-value commands:
- Navigation:
,openclose - Snapshot:
,snapshot -i
,snapshot -i -Csnapshot -s "#selector" - Interaction:
,click
,fill
,type
,select
,checkpress - Verification:
,diff snapshotdiff screenshot --baseline <file> - Capture:
,screenshot
,screenshot --annotatepdf - Wait:
,wait --load networkidlewait <selector|@ref|ms>
Verification patterns
Use explicit evidence after actions.
# Baseline -> action -> verify structure agent-browser snapshot -i agent-browser click @e3 agent-browser diff snapshot # Visual regression agent-browser screenshot baseline.png agent-browser click @e5 agent-browser diff screenshot --baseline baseline.png
Safety and reliability
- Refs are invalid after navigation or significant DOM updates; re-snapshot before next action.
- Prefer
or selector/ref waits over fixed sleeps.wait --load networkidle - For multi-step JS, use
(or base64) to avoid shell escaping breakage.eval --stdin - For concurrent tasks, isolate with
.--session <name> - Use output controls in long pages to reduce context flooding.
- Optional hardening in sensitive flows: domain allowlist and action policies.
Optional hardening examples:
# Wrap page content with boundaries to reduce prompt-injection risk export AGENT_BROWSER_CONTENT_BOUNDARIES=1 # Limit output volume for long pages export AGENT_BROWSER_MAX_OUTPUT=50000 # Restrict navigation and network to trusted domains export AGENT_BROWSER_ALLOWED_DOMAINS="example.com,*.example.com" # Restrict allowed action types export AGENT_BROWSER_ACTION_POLICY=./policy.json
Example
policy.json:
{"default":"deny","allow":["navigate","snapshot","click","fill","scroll","wait","get"],"deny":["eval","download","upload","network","state"]}
CLI-flag equivalent:
agent-browser --content-boundaries --max-output 50000 --allowed-domains "example.com,*.example.com" --action-policy ./policy.json open https://example.com
Troubleshooting
: install and runcommand not found
.agent-browser install- Wrong element clicked: run
again and use fresh refs.snapshot -i - Dynamic SPA content missing: wait with
or targeted--load networkidle
selector.wait - Session collisions: assign unique
names and close each session.--session - Large output pressure: narrow snapshots (
,-i
,-c
,-d
) and extract only needed text.-s
References
Deep-dive docs in this skill:
Related resources:
Ready templates:
./templates/form-automation.sh./templates/capture-workflow.sh
Metadata
- Version: 1.1.0
- Last updated: 2026-02-26
- Scope: deterministic browser automation for agent workflows