Claude-scholar defuddle
Extract clean markdown content from web pages using Defuddle CLI, removing clutter and navigation to save tokens. Use instead of WebFetch when the user provides a URL to read or analyze, for online documentation, articles, blog posts, or any standard web page.
install
source · Clone the upstream repo
git clone https://github.com/Galaxy-Dawn/claude-scholar
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/Galaxy-Dawn/claude-scholar "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/defuddle" ~/.claude/skills/galaxy-dawn-claude-scholar-defuddle && rm -rf "$T"
manifest:
skills/defuddle/SKILL.mdsource content
Defuddle
Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.
If not installed:
npm install -g defuddle
Usage
Always use
--md for markdown output:
defuddle parse <url> --md
Save to file:
defuddle parse <url> --md -o content.md
Extract specific metadata:
defuddle parse <url> -p title defuddle parse <url> -p description defuddle parse <url> -p domain
Output formats
| Flag | Format |
|---|---|
| Markdown (default choice) |
| JSON with both HTML and markdown |
| (none) | HTML |
| Specific metadata property |