Claude-skill-registry browser-agent

install

source · Clone the upstream repo

git clone https://github.com/majiayu000/claude-skill-registry

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/browser-agent" ~/.claude/skills/majiayu000-claude-skill-registry-browser-agent && rm -rf "$T"

manifest: skills/data/browser-agent/SKILL.md

browser_agent - Autonomous Web Automation

Core Concept

mcp__plugin_kg_kodegen__browser_agent

is an autonomous agent that performs complex multi-step web tasks. It:

Plans and executes browser actions autonomously
Handles dynamic content and SPAs
Supports session management for parallel tasks
Runs in background with progress monitoring

Uses session-based execution. Actions: PROMPT, READ, KILL.

Key Parameters

action (required)

```
PROMPT
```
: Start agent with a task
```
READ
```
: Check agent progress
```
KILL
```
: Terminate agent session

task (required for PROMPT)

Natural language description of what to accomplish.

agent (optional, default: 0)

Agent slot number for parallel execution.

start_url (optional)

Initial URL to navigate to before starting task.

max_steps (optional, default: 10)

Maximum iterations the agent will perform.

await_completion_ms (optional, default: 600000)

Timeout in ms. 0 = fire-and-forget.

temperature (optional, default: 0.7)

LLM creativity (0.0-2.0).

max_actions_per_step (optional, default: 3)

Actions per iteration.

additional_info (optional)

Extra context for the agent.

Usage Examples

Start Agent Task

browser_agent({
  "action": "PROMPT",
  "task": "Find the latest Rust release notes and extract key features",
  "start_url": "https://www.rust-lang.org",
  "max_steps": 8
})

Form Filling Workflow

browser_agent({
  "action": "PROMPT",
  "task": "Fill out the contact form with name 'John Doe' and email 'john@example.com', then submit",
  "start_url": "https://example.com/contact",
  "temperature": 0.5,
  "max_actions_per_step": 2
})

Check Progress

browser_agent({
  "action": "READ",
  "agent": 0
})

Fire and Forget

browser_agent({
  "action": "PROMPT",
  "task": "Download the PDF from the resources page",
  "start_url": "https://example.com",
  "await_completion_ms": 0
})
// Returns immediately, agent works in background

Kill Agent

browser_agent({
  "action": "KILL",
  "agent": 0
})

Response Format

{
  "agent": 0,
  "task": "Find latest Rust release notes",
  "steps_taken": 5,
  "completed": true,
  "summary": "Found release notes for Rust 1.75...",
  "history": [
    {"step": 1, "actions": ["navigate"], "summary": "Navigated to rust-lang.org"},
    {"step": 2, "actions": ["click"], "summary": "Clicked releases link"}
  ]
}

When to Use What

Complexity	Tool	Use Case
Single action	browser_click/type/etc	Click a button, type text
2-3 actions	Manual tool calls	Login form, simple search
4+ actions	browser_agent	Multi-page workflows
Research	browser_research	Topic research with summaries

Configuration Guide

Scenario	Settings
Web research	temperature: 0.8, max_steps: 12
Form filling	temperature: 0.5, max_actions_per_step: 2
Complex workflow	max_steps: 15-20
Quick task	max_steps: 5, await_completion_ms: 60000

Remember

Use for complex multi-step tasks only
Simple tasks are faster with direct tool calls
Agent runs in background (check with READ)
Provide clear task descriptions
Include start_url when known
Kill agent when done to free resources