Learn-skills.dev convert-url-html-to-md
Convert HTML web pages to Markdown for documentation extraction. Use when fetching documentation from websites, extracting structured content, getting clean main content (clean=true), or getting full page with nav/sidebar to discover URLs (clean=false). Ideal for learning libraries and building context from web sources.
install
source · Clone the upstream repo
git clone https://github.com/NeverSight/learn-skills.dev
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/NeverSight/learn-skills.dev "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data/skills-md/1naichii/ai-code-tools/convert-url-html-to-md" ~/.claude/skills/neversight-learn-skills-dev-convert-url-html-to-md && rm -rf "$T"
manifest:
data/skills-md/1naichii/ai-code-tools/convert-url-html-to-md/SKILL.mdsource content
Convert URL HTML to Markdown
Extract web content as markdown using a two-phase approach for comprehensive documentation gathering.
Two-Phase Workflow
For optimal documentation extraction:
- Discovery (clean=false): Get full page including navigation and sidebars to discover all documentation URLs
- Extraction (clean=true): Extract main content from discovered URLs
Usage
# From the skill directory cd ~/.claude/skills/convert-url-html-to-md # Clean mode - main content only (recommended for docs) node scripts/convert_url.js <url> --clean=true # Full page mode - includes nav/sidebar (for discovering URLs) node scripts/convert_url.js <url> --clean=false # Default is clean=true node scripts/convert_url.js <url>
Examples
# Get all navigation links from a docs site node scripts/convert_url.js https://ui.shadcn.com/docs --clean=false # Extract specific documentation content node scripts/convert_url.js https://ui.shadcn.com/docs/components/radix/aspect-ratio --clean=true
Installation
Dependencies are included. Run once:
cd ~/.claude/skills/convert-url-html-to-md npm install
Output
The script outputs markdown directly to stdout. Redirect to file if needed:
node scripts/convert_url.js <url> --clean=true > output.md
Credits
This skill is based on urltomarkdown by Lee Hanken, licensed under MIT. Modified and adapted as a Claude skill by 1naichii.