Claude-scholar defuddle

Extract clean markdown content from web pages using Defuddle CLI, removing clutter and navigation to save tokens. Use instead of WebFetch when the user provides a URL to read or analyze, for online documentation, articles, blog posts, or any standard web page.

install

source · Clone the upstream repo

git clone https://github.com/Galaxy-Dawn/claude-scholar

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/Galaxy-Dawn/claude-scholar "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/defuddle" ~/.claude/skills/galaxy-dawn-claude-scholar-defuddle && rm -rf "$T"

manifest: skills/defuddle/SKILL.md

source content

Defuddle

Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.

If not installed:

npm install -g defuddle

Usage

Always use

--md

for markdown output:

defuddle parse <url> --md

Save to file:

defuddle parse <url> --md -o content.md

Extract specific metadata:

defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain

Output formats

Flag	Format
`--md`	Markdown (default choice)
`--json`	JSON with both HTML and markdown
(none)	HTML
`-p <name>`	Specific metadata property