Marketplace pdf-processing
Extract text and tables from PDF files, fill forms, merge documents. Use when working with PDF files or when the user mentions PDFs, forms, or document extraction.
install
source · Clone the upstream repo
git clone https://github.com/aiskillstore/marketplace
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/aiskillstore/marketplace "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/0xkynz/pdf-processing" ~/.claude/skills/aiskillstore-marketplace-pdf-processing && rm -rf "$T"
manifest:
skills/0xkynz/pdf-processing/SKILL.mdsource content
PDF Processing Skill
This skill provides capabilities for working with PDF documents.
Quick Start
Use pdfplumber to extract text from PDFs:
import pdfplumber with pdfplumber.open("document.pdf") as pdf: text = pdf.pages[0].extract_text()
Capabilities
Text Extraction
- Extract text from single or multiple pages
- Preserve layout and formatting
- Handle multi-column documents
Table Extraction
- Identify and extract tables
- Convert to structured data (CSV, JSON)
- Handle complex table layouts
Form Operations
- Fill PDF forms programmatically
- Extract form field values
- Create fillable forms
Document Operations
- Merge multiple PDFs
- Split PDFs by page
- Rotate pages
- Add watermarks
Best Practices
- Always check if the PDF is encrypted before processing
- Handle OCR cases for scanned documents
- Validate extracted data for accuracy
- Use appropriate libraries (pdfplumber for extraction, PyPDF2 for manipulation)