Awesome-omni-skill clawpage
Extract and structure any web page into clean JSON. Returns text, tables, prices, contacts, hours, ratings, dates, links, and images. Handles JavaScript-rendered SPAs and bot-blocked sites. Free tier: 10 extractions/day with API key. Also supports x402 micropayments and prepaid USDC.
install
source · Clone the upstream repo
git clone https://github.com/diegosouzapw/awesome-omni-skill
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/diegosouzapw/awesome-omni-skill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/development/clawpage" ~/.claude/skills/diegosouzapw-awesome-omni-skill-clawpage && rm -rf "$T"
manifest:
skills/development/clawpage/SKILL.mdsource content
ClawPage
Web extraction API for agents. Converts any URL into structured JSON.
Base URL:
https://api.clawpage.xyz
When to Use This Skill
Use ClawPage when you need to:
- Extract structured data from a web page (prices, contacts, hours, ratings)
- Get clean text from a page that blocks bots or requires JavaScript
- Parse HTML tables into structured JSON
- Get metadata (title, description, language) from any URL
Quick Start
1. Register (Free)
curl -X POST https://api.clawpage.xyz/register \ -H "Content-Type: application/json" \ -d '{"email": "you@example.com"}'
Response:
{"api_key": "cpk_...", "free_extractions": 10}
2. Extract Any URL
curl -X POST https://api.clawpage.xyz/extract \ -H "Content-Type: application/json" \ -H "X-API-Key: cpk_..." \ -d '{"url": "https://example.com", "sync": true}'
3. Get Structured JSON
{ "status": "done", "data": { "url": "https://example.com", "title": "Example Domain", "description": "...", "language": "en", "content": { "main_text": "...", "sections": [{"heading": "...", "text": "..."}] }, "structured": { "tables": [{"headers": [...], "rows": [[...]]}], "prices": [{"item": "...", "amount": 9.99, "currency": "USD"}], "contacts": {"phones": [...], "emails": [...], "addresses": [...]}, "hours": [{"day": "Monday", "open": "09:00", "close": "17:00"}], "ratings": {"score": 4.5, "max": 5, "count": 120, "source": "Google"}, "dates": [{"label": "Event date", "iso": "2026-03-15"}], "links": [{"text": "...", "url": "..."}], "images": [{"alt": "...", "url": "..."}], "metadata": {} }, "fetched_at": "2026-02-06T08:00:00Z", "cache_ttl_seconds": 86400, "render_method": "raw_fetch" } }
Authentication
Three options (choose one):
API Key (Recommended)
with email to get API keyPOST /register- Pass
header with every requestX-API-Key - 10 free extractions per day (rolling 24h recharge)
- Need more? Deposit USDC via
POST /account/deposit
x402 Micropayments (No Account Needed)
- Send request without auth → get 402 with payment details
- Send 0.01 USDC on Base to the recipient address
- Retry with
headerX-Payment-Proof: <tx_hash>
Cache Hits (Always Free)
- If another agent already extracted the URL in the last 24h, you get it free
- No auth required for cache hits
Modes
Sync Mode (Recommended)
Pass
"sync": true to wait for the result inline.
Async Mode
Omit
sync or set "sync": false. You get a job ID to poll:
{"status": "processing", "job_id": "abc123", "poll_url": "/result/abc123", "estimated_seconds": 15}
Poll:
GET /result/abc123
What Gets Extracted
| Field | Description |
|---|---|
| Clean page text |
| Text split by headings |
| HTML tables (headers + rows) |
| Products/services with amounts and currency |
| Phone numbers, emails, physical addresses |
| Business hours by day |
| Review scores, counts, source |
| Dates with labels in ISO format |
| Up to 50 hyperlinks |
| Up to 20 images with alt text |
| Page title |
| Meta description |
| Page language |
Endpoints
| Method | Path | Auth | Description |
|---|---|---|---|
| | None | Create account, get API key |
| | API Key | Account info (balance, free remaining, wallets) |
| | API Key | Associate wallet address |
| | API Key | Credit USDC from on-chain tx |
| | API Key or x402 | Extract and structure a URL |
| | None | Poll for async results |
| | None | OpenAPI specification |
| | None | Agent discovery manifest |
| | None | Health check |
Pricing
- Free tier: 10 extractions per day (rolling 24h recharge)
- Prepaid: Deposit USDC, deducts $0.01 per extraction
- x402: 0.01 USDC per URL on Base (no account needed)
- Cache hits: Always free (24h TTL)