Claude-skill-registry-data markdown-tools
Converts documents to markdown (PDFs, Word docs, PowerPoint, Confluence exports) with Windows/WSL path handling. Activates when converting .doc/.docx/PDF/PPTX files to markdown, processing Confluence exports, handling Windows/WSL path conversions, extracting images from PDFs, or working with markitdown utility.
install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry-data
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry-data "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data/markdown-tools" ~/.claude/skills/majiayu000-claude-skill-registry-data-markdown-tools && rm -rf "$T"
manifest:
data/markdown-tools/SKILL.mdsource content
Markdown Tools
Convert documents to markdown with image extraction and Windows/WSL path handling.
Quick Start
Install markitdown with PDF Support
# IMPORTANT: Use [pdf] extra for PDF support uv tool install "markitdown[pdf]" # Or via pip pip install "markitdown[pdf]"
Basic Conversion
markitdown "document.pdf" -o output.md # Or redirect: markitdown "document.pdf" > output.md
PDF Conversion with Images
markitdown extracts text only. For PDFs with images, use this workflow:
Step 1: Convert Text
markitdown "document.pdf" -o output.md
Step 2: Extract Images
# Create assets directory alongside the markdown mkdir -p assets # Extract images using PyMuPDF uv run --with pymupdf python scripts/extract_pdf_images.py "document.pdf" ./assets
Step 3: Add Image References
Insert image references in the markdown where needed:

Step 4: Format Cleanup
markitdown output often needs manual fixes:
- Add proper heading levels (
,#
,##
)### - Reconstruct tables in markdown format
- Fix broken line breaks
- Restore indentation structure
Path Conversion (Windows/WSL)
# Windows → WSL conversion C:\Users\name\file.pdf → /mnt/c/Users/name/file.pdf # Use helper script python scripts/convert_path.py "C:\Users\name\Documents\file.pdf"
Common Issues
"dependencies needed to read .pdf files"
# Install with PDF support uv tool install "markitdown[pdf]" --force
FontBBox warnings during PDF conversion
- These are harmless font parsing warnings, output is still correct
Images missing from output
- Use
to extract images separatelyscripts/extract_pdf_images.py
Resources
- Extract images from PDF using PyMuPDFscripts/extract_pdf_images.py
- Windows to WSL path converterscripts/convert_path.py
- Detailed examples for batch operationsreferences/conversion-examples.md