Claude-scholar defuddle

Extract clean markdown content from web pages using Defuddle CLI, removing clutter and navigation to save tokens. Use instead of WebFetch when the user provides a URL to read or analyze, for online documentation, articles, blog posts, or any standard web page.

install
source · Clone the upstream repo
git clone https://github.com/Galaxy-Dawn/claude-scholar
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/Galaxy-Dawn/claude-scholar "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/defuddle" ~/.claude/skills/galaxy-dawn-claude-scholar-defuddle && rm -rf "$T"
manifest: skills/defuddle/SKILL.md
source content

Defuddle

Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.

If not installed:

npm install -g defuddle

Usage

Always use

--md
for markdown output:

defuddle parse <url> --md

Save to file:

defuddle parse <url> --md -o content.md

Extract specific metadata:

defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain

Output formats

FlagFormat
--md
Markdown (default choice)
--json
JSON with both HTML and markdown
(none)HTML
-p <name>
Specific metadata property