Muse markitdown
A Python utility for converting various file formats (PDF, Word, Excel, PowerPoint, etc.) to Markdown. Use this skill to install and use MarkItDown for document conversion.
install
source · Clone the upstream repo
git clone https://github.com/myths-labs/muse
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/myths-labs/muse "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/toolkit/markitdown" ~/.claude/skills/myths-labs-muse-markitdown && rm -rf "$T"
manifest:
skills/toolkit/markitdown/SKILL.mdsource content
MarkItDown
MarkItDown is a lightweight Python utility by Microsoft for converting various files to Markdown for use with LLMs.
Capabilities
Converts the following formats to Markdown:
- PowerPoint (.pptx)
- Word (.docx)
- Excel (.xlsx)
- Images (EXIF & OCR)
- Audio (Transcription)
- HTML
- Text formats (CSV, JSON, XML)
- YouTube URLs
- ZIP files
Installation
To use MarkItDown, you first need to install it. It requires Python 3.10+.
pip install "markitdown[all]"
Usage
Command Line
Convert a file and print to stdout:
markitdown path-to-file.pdf
Convert a file and save to output:
markitdown path-to-file.pdf -o document.md
Pipe content:
cat path-to-file.pdf | markitdown
Python API
from markitdown import MarkItDown md = MarkItDown() result = md.convert("test.xlsx") print(result.text_content)
Microsoft Document Intelligence
For better PDF extraction, use Azure Document Intelligence:
markitdown path-to-file.pdf -o document.md -d -e "<endpoint>"