Skills selenium-browser
Start a Selenium‑controlled Chrome browser, open a URL, take a screenshot, and report progress. Supports headless mode and optional proxy.
install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/andreybespalov89/selenium-browser" ~/.claude/skills/clawdbot-skills-selenium-browser && rm -rf "$T"
manifest:
skills/andreybespalov89/selenium-browser/SKILL.mdsource content
Usage
The skill triggers on any message that contains Chrome, browser, Selenium, screenshot, or open.
selenium-browser <URL> [--headless] [--proxy=<url>]
Command flow
- Launch Chrome (or Chromium) under Selenium.
- Navigate to
.<URL> - Take a screenshot of the loaded page.
- Save the image in
and report the path back to the chat./home/main/clawd/diffusion_pdfs/ - If anything fails, send an error message.
Scripts
scripts/launch_browser.py
#!/usr/bin/env python3 import os import sys import time import base64 from selenium import webdriver from selenium.webdriver.chrome.service import Service from selenium.webdriver.chrome.options import Options # CLI parsing import argparse parser = argparse.ArgumentParser(description="Launch Selenium Chrome and take a screenshot.") parser.add_argument("url", help="URL to open") parser.add_argument("--headless", action="store_true", help="Run Chrome headless") parser.add_argument("--proxy", help="Proxy URL (e.g., http://proxy:3128)") args = parser.parse_args() # Prepare Chrome options chrome_options = Options() if args.headless: chrome_options.add_argument("--headless") chrome_options.add_argument("--disable-gpu") chrome_options.add_argument("--no-sandbox") chrome_options.add_argument("--disable-dev-shm-usage") if args.proxy: chrome_options.add_argument(f"--proxy-server={args.proxy}") # Locate binaries chrome_bin = os.getenv("CHROME_BIN", "/usr/bin/google-chrome") chromedriver_path = os.getenv("CHROMEDRIVER_PATH", "/usr/local/bin/chromedriver") service = Service(executable_path=chromedriver_path) # Start browser try: driver = webdriver.Chrome(service=service, options=chrome_options) except Exception as e: print(f"❌ Failed to start Chrome: {e}", file=sys.stderr) sys.exit(1) # Navigate and wait for page load try: driver.get(args.url) time.sleep(5) # simple wait; can replace with WebDriverWait for better reliability except Exception as e: print(f"❌ Navigation error: {e}", file=sys.stderr) driver.quit() sys.exit(1) # Take screenshot screenshot_path = os.path.join(os.getenv("HOME", "/tmp"), "screenshot.png") try: driver.save_screenshot(screenshot_path) except Exception as e: print(f"❌ Screenshot error: {e}", file=sys.stderr) driver.quit() sys.exit(1) # Clean up driver.quit() # Output a JSON object that OpenClaw can parse for the reply print({"status": "ok", "screenshot": screenshot_path})
scripts/_env.sh
# Optional: set paths to Chrome/Chromedriver if not in standard locations # export CHROME_BIN="/opt/google/chrome/google-chrome" # export CHROMEDRIVER_PATH="/usr/local/bin/chromedriver"
References
How the skill reports
The skill runs the Python script and captures its stdout as a JSON payload. OpenClaw parses the JSON and sends a message back:
✅ Screenshot saved: /home/main/clawd/diffusion_pdfs/screenshot.png
If the script prints an error, the skill forwards the error text.
Installation notes
- Make sure
is inchromedriver
or set/usr/local/bin/chromedriver
.CHROMEDRIVER_PATH - Make sure
(orgoogle-chrome
) is inchromium
or set/usr/bin/google-chrome
.CHROME_BIN - Install Python dependencies:
(inside the virtual env you use for the skill).pip install selenium
pip install selenium
Logging & timeouts
The script uses a 5‑second static wait after navigation; replace with Selenium's
WebDriverWait for dynamic waits.
If you encounter timeouts, adjust the
time.sleep(5) value or use WebDriverWait(driver, 20).until(...).
Feel free to tweak the script to fit your environment (proxy, authentication, etc.).