Skills browser
Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications. Supports remote Browserbase sessions with automatic CAPTCHA solving, anti-bot stealth mode, and residential proxies — ideal for scraping protected websites, bypassing bot detection, and interacting with JavaScript-heavy pages.
git clone https://github.com/browserbase/skills
T=$(mktemp -d) && git clone --depth=1 https://github.com/browserbase/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/browser" ~/.claude/skills/browserbase-skills-browser && rm -rf "$T"
skills/browser/SKILL.mdBrowser Automation
Automate browser interactions using the browse CLI with Claude.
Setup check
Before running any browser commands, verify the CLI is available:
which browse || npm install -g @browserbasehq/browse-cli
Environment Selection (Local vs Remote)
The CLI supports explicit per-session environment overrides. If you do nothing, the next session defaults to Browserbase when
BROWSERBASE_API_KEY is set and to local otherwise.
Local mode
starts a clean isolated local browserbrowse env local
reuses an already-running debuggable Chrome and falls back to isolated if nothing is availablebrowse env local --auto-connect
attaches to a specific CDP targetbrowse env local <port|url>- Best for: development, localhost, trusted sites, and reproducible runs
Remote mode (Browserbase)
switches the current session to Browserbasebrowse env remote- Without a local override, Browserbase is also the default when
is setBROWSERBASE_API_KEY - Provides: anti-bot stealth, automatic CAPTCHA solving, residential proxies, session persistence
- Use remote mode when: the target site has bot detection, CAPTCHAs, IP rate limiting, Cloudflare protection, or requires geo-specific access
- Get credentials at https://browserbase.com/settings
When to choose which
- Repeatable local testing / clean state:
browse env local - Reuse your local login/cookies:
browse env local --auto-connect - Simple browsing (docs, wikis, public APIs): local mode is fine
- Protected sites (login walls, CAPTCHAs, anti-scraping): use remote mode
- If local mode fails with bot detection or access denied: switch to remote mode
Commands
All commands work identically in both modes. The daemon auto-starts on first command.
Navigation
browse open <url> # Go to URL (aliases: goto) browse open <url> --context-id <id> # Load Browserbase context (remote only) browse open <url> --context-id <id> --persist # Load context + save changes back browse reload # Reload current page browse back # Go back in history browse forward # Go forward in history
Page state (prefer snapshot over screenshot)
browse snapshot # Get accessibility tree with element refs (fast, structured) browse screenshot [path] # Take visual screenshot (slow, uses vision tokens) browse get url # Get current URL browse get title # Get page title browse get text <selector> # Get text content (use "body" for all text) browse get html <selector> # Get HTML content of element browse get value <selector> # Get form field value
Use
browse snapshot as your default for understanding page state — it returns the accessibility tree with element refs you can use to interact. Only use browse screenshot when you need visual context (layout, images, debugging).
Interaction
browse click <ref> # Click element by ref from snapshot (e.g., @0-5) browse type <text> # Type text into focused element browse fill <selector> <value> # Fill input and press Enter browse select <selector> <values...> # Select dropdown option(s) browse press <key> # Press key (Enter, Tab, Escape, Cmd+A, etc.) browse drag <fromX> <fromY> <toX> <toY> # Drag from one point to another browse scroll <x> <y> <deltaX> <deltaY> # Scroll at coordinates browse highlight <selector> # Highlight element on page browse is visible <selector> # Check if element is visible browse is checked <selector> # Check if element is checked browse wait <type> [arg] # Wait for: load, selector, timeout
Session management
browse stop # Stop the browser daemon (also clears env override) browse status # Check daemon status (includes env) browse env # Show current environment (local or remote) browse env local # Use clean isolated local browser browse env local --auto-connect # Reuse existing Chrome, fallback to isolated browse env local <port|url> # Attach to a specific CDP target browse env remote # Switch to Browserbase (requires API keys) browse pages # List all open tabs browse tab_switch <index> # Switch to tab by index browse tab_close [index] # Close tab
Typical workflow
If the environment matters, set it first with
browse env local, browse env local --auto-connect, or browse env remote.
— navigate to the pagebrowse open <url>
— read the accessibility tree to understand page structure and get element refsbrowse snapshot
/browse click <ref>
/browse type <text>
— interact using refs from snapshotbrowse fill <selector> <value>
— confirm the action workedbrowse snapshot- Repeat 3-4 as needed
— close the browser when donebrowse stop
Quick Example
browse open https://example.com browse snapshot # see page structure + element refs browse click @0-5 # click element with ref 0-5 browse get title browse stop
Mode Comparison
| Feature | Local | Browserbase |
|---|---|---|
| Speed | Faster | Slightly slower |
| Setup | Chrome required | API key required |
| Reuse existing local cookies | With | N/A |
| Stealth mode | No | Yes (custom Chromium, anti-bot fingerprinting) |
| CAPTCHA solving | No | Yes (automatic reCAPTCHA/hCaptcha) |
| Residential proxies | No | Yes (201 countries, geo-targeting) |
| Session persistence | No | Yes (cookies/auth persist via contexts) |
| Best for | Development/simple pages | Protected sites, bot detection, production scraping |
Best Practices
- Choose the local strategy deliberately: use
for clean state,browse env local
for existing local credentials, andbrowse env local --auto-connect
for protected sitesbrowse env remote - Always
first before interactingbrowse open - Use
to check page state — it's fast and gives you element refsbrowse snapshot - Only screenshot when visual context is needed (layout checks, images, debugging)
- Use refs from snapshot to click/interact — e.g.,
browse click @0-5
when done to clean up the browser session and clear the env overridebrowse stop
Troubleshooting
- "No active page": Run
, then checkbrowse stop
. If it still says running, kill the zombie daemon withbrowse status
, then retrypkill -f "browse.*daemon"browse open - Chrome not found: Install Chrome, use
if you already have a debuggable Chrome running, or switch tobrowse env local --auto-connectbrowse env remote - Action fails: Run
to see available elements and their refsbrowse snapshot - Browserbase fails: Verify API key is set
Switching to Remote Mode
Switch to remote when you detect: CAPTCHAs (reCAPTCHA, hCaptcha, Turnstile), bot detection pages ("Checking your browser..."), HTTP 403/429, empty pages on sites that should have content, or the user asks for it.
Don't switch for simple sites (docs, wikis, public APIs, localhost).
browse env local # clean isolated local browser browse env local --auto-connect # reuse existing Chrome state browse env remote # switch to Browserbase
Overrides are scoped per session and stay in effect until you switch again or run
browse stop. After browse stop, the next start falls back to env-var-based auto detection. Use browse status to inspect the resolved local strategy while the daemon is running.
For detailed examples, see EXAMPLES.md. For API reference, see REFERENCE.md.