Claude-code-templates scrape
Scrape any webpage as clean markdown via Bright Data Web Unlocker API. Bypasses bot detection and CAPTCHA. Requires BRIGHTDATA_API_KEY and BRIGHTDATA_UNLOCKER_ZONE environment variables.
install
source · Clone the upstream repo
git clone https://github.com/davila7/claude-code-templates
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/davila7/claude-code-templates "$T" && mkdir -p ~/.claude/skills && cp -r "$T/cli-tool/components/skills/web-data/scrape" ~/.claude/skills/davila7-claude-code-templates-scrape && rm -rf "$T"
manifest:
cli-tool/components/skills/web-data/scrape/SKILL.mdsource content
Bright Data - Web Scraper
Scrape any webpage and get clean markdown content using Bright Data's Web Unlocker API. Automatically bypasses bot detection and CAPTCHA.
Setup
1. Get your API Key: Get a key from Bright Data Dashboard.
2. Create a Web Unlocker zone: Create a zone at brightdata.com/cp by clicking "Add" (top-right), selecting "Unlocker zone".
3. Set environment variables:
export BRIGHTDATA_API_KEY="your-api-key" export BRIGHTDATA_UNLOCKER_ZONE="your-zone-name"
Usage
bash scripts/scrape.sh "url"
Parameters:
(required): The webpage URL to scrapeurl
Examples:
# Scrape a news article bash scripts/scrape.sh "https://example.com/article" # Scrape a product page bash scripts/scrape.sh "https://shop.example.com/product/123"
Output Format
Returns clean markdown content extracted from the webpage:
# Page Title Main content of the page converted to markdown format... ## Section Heading More content...
Features
- Bot Detection Bypass: Automatically handles anti-bot measures
- CAPTCHA Solving: Bypasses CAPTCHA challenges
- Clean Markdown: Returns well-formatted markdown content
- JavaScript Rendering: Handles JavaScript-heavy pages
Dependencies
- For API requestscurl