Awesome-omni-skill md-fetch
Fetch and read markdown files from the internet with support for partial content extraction and intelligent section ranking. Use when Claude needs to retrieve markdown documentation from URLs and extract specific portions by line numbers, section headers, search patterns, or query-based relevance ranking (BM25). Triggers include requests to fetch/download/read markdown from URLs, extract specific sections from online docs, find relevant sections by query, or get partial content from remote markdown files.
git clone https://github.com/diegosouzapw/awesome-omni-skill
T=$(mktemp -d) && git clone --depth=1 https://github.com/diegosouzapw/awesome-omni-skill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/documentation/md-fetch" ~/.claude/skills/diegosouzapw-awesome-omni-skill-md-fetch && rm -rf "$T"
skills/documentation/md-fetch/SKILL.mdMarkdown Fetch Skill
Efficiently fetch and extract partial content from markdown files on the internet with intelligent section ranking.
Quick Start
Fetch Full File
scripts/fetch_md.sh <url> [output_file]
Examples:
# Print to stdout scripts/fetch_md.sh https://bun.com/docs/runtime/sql.md # Save to file scripts/fetch_md.sh https://bun.com/docs/runtime/sql.md output.md
Structured Retrieval (Recommended)
Use BM25 ranking to find most relevant sections by query:
scripts/fetch_md_structured.py <url> <query> [OPTIONS]
Examples:
# Find sections about "connection pooling" scripts/fetch_md_structured.py https://bun.com/docs/runtime/sql.md "connection pooling" # Get top 5 sections about "authentication" with scores scripts/fetch_md_structured.py https://example.com/docs.md "authentication" --top 5 --show-scores # Save results to file scripts/fetch_md_structured.py https://example.com/docs.md "query hooks" --output result.md
When to use:
- Finding relevant information without knowing exact section names
- Exploring documentation by topic or concept
- Getting the most relevant sections ranked by query intent
Fetch Partial Content
scripts/fetch_md_partial.sh <url> [OPTIONS]
Partial Reading Methods
By Line Numbers
Extract specific line ranges:
# Get lines 10-50 scripts/fetch_md_partial.sh https://bun.com/docs/runtime/sql.md --lines 10:50 # Get first 100 lines scripts/fetch_md_partial.sh https://example.com/docs.md --lines 1:100
By Section Headers
Extract specific markdown sections by header:
# Get "Installation" section scripts/fetch_md_partial.sh https://ai-sdk.dev/docs/getting-started/expo.md --section "## Installation" # Get "API Reference" section scripts/fetch_md_partial.sh https://example.com/docs.md --section "# API Reference"
Note: Section extraction includes the header and all content until the next same-level header.
By Search Pattern
Find and extract lines matching a pattern:
# Search for "query" keyword scripts/fetch_md_partial.sh https://raw.githubusercontent.com/tanstack/query/main/docs/framework/react/overview.md --pattern "query" # Search with context (3 lines before/after) scripts/fetch_md_partial.sh https://example.com/docs.md --pattern "API" --context 3
Saving Output
Add
--output to save results to a file:
scripts/fetch_md_partial.sh https://example.com/docs.md --lines 1:100 --output result.md scripts/fetch_md_partial.sh https://example.com/docs.md --section "## Setup" --output setup.md
Common Use Cases
Find relevant sections by topic (Structured Retrieval):
# Most relevant approach - uses BM25 ranking scripts/fetch_md_structured.py <url> "your search query" --top 3
Preview documentation before full read:
# Get first 50 lines to understand structure scripts/fetch_md_partial.sh <url> --lines 1:50
Extract specific documentation sections:
# Get installation instructions only scripts/fetch_md_partial.sh <url> --section "## Installation"
Search for specific topics:
# Find all mentions of "authentication" with context scripts/fetch_md_partial.sh <url> --pattern "authentication" --context 5
Structured Retrieval Details
The
fetch_md_structured.py script uses BM25 ranking to intelligently rank sections by relevance:
How it works:
- Extracts all markdown sections by headers
- Tokenizes query and section content
- Calculates BM25 scores with term frequency saturation
- Boosts sections with query terms in headers (2x weight)
- Returns top N most relevant sections
Parameters:
: Term frequency saturation (higher = more weight to term frequency)k1=1.2
: Length normalization (higher = more penalty for long documents)b=0.75
Benefits:
- No need to know exact section names
- Finds relevant content even with different terminology
- Ranks by semantic relevance, not just keyword matching
- In-memory processing (no persistent index needed)
Supported URL Types
- Direct markdown URLs:
https://example.com/docs.md - GitHub raw URLs:
https://raw.githubusercontent.com/user/repo/main/README.md - Documentation sites:
https://bun.com/docs/runtime/sql.md - Any publicly accessible markdown file over HTTP/HTTPS
Error Handling
Scripts use
set -e and will exit on errors. Common issues:
- 404 Not Found: URL is incorrect or file doesn't exist
- Network errors: Check internet connection
- Invalid line range: Ensure START:END format is correct
- Section not found: Check header format matches exactly (including
symbols)#