Claude-skill-registry kramme:markdown-converter
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/kramme-markdown-converter" ~/.claude/skills/majiayu000-claude-skill-registry-kramme-markdown-converter && rm -rf "$T"
manifest:
skills/data/kramme-markdown-converter/SKILL.mdsource content
Markdown Converter
Convert files to Markdown using
uvx markitdown — no installation required.
Basic Usage
# Convert to stdout uvx markitdown input.pdf # Save to file uvx markitdown input.pdf -o output.md uvx markitdown input.docx > output.md # From stdin cat input.pdf | uvx markitdown # From stdin with file type hint cat input.pdf | uvx markitdown -x .pdf > output.md
Supported Formats
- Documents: PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls)
- Web/Data: HTML, CSV, JSON, XML
- Media: Images (EXIF + OCR), Audio (EXIF + transcription)
- Other: ZIP (iterates contents), YouTube URLs, EPub
Options
-o OUTPUT # Output file -x EXTENSION # Hint file extension (for stdin) -m MIME_TYPE # Hint MIME type -c CHARSET # Hint charset (e.g., UTF-8) -d # Use Azure Document Intelligence -e ENDPOINT # Document Intelligence endpoint --use-plugins # Enable 3rd-party plugins --list-plugins # Show installed plugins
Examples
# Convert Word document uvx markitdown report.docx -o report.md # Convert Excel spreadsheet uvx markitdown data.xlsx > data.md # Convert PowerPoint presentation uvx markitdown slides.pptx -o slides.md # Use Azure Document Intelligence for better PDF extraction uvx markitdown scan.pdf -d -e "https://your-resource.cognitiveservices.azure.com/"
Notes
- Output preserves document structure: headings, tables, lists, links
- First run caches dependencies; subsequent runs are faster
- For complex PDFs with poor extraction, use
with Azure Document Intelligence-d - When piping via stdin, prefer
(and optionally-x
/-m
) for better detection-c