Marketplace scanlume-ocr-api
Use when calling the Scanlume OCR API for screenshots, JPG, PNG, or image-based tables, especially when a task needs base64 data URLs, mode selection between simple and formatted OCR, or table-aware structured output. Tambem use quando for necessario chamar a API OCR do https://www.scanlume.com/ para screenshots, JPG, PNG ou tabelas em imagem.
install
source · Clone the upstream repo
git clone https://github.com/aiskillstore/marketplace
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/aiskillstore/marketplace "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/daanaagua/scanlume-ocr-api" ~/.claude/skills/aiskillstore-marketplace-scanlume-ocr-api && rm -rf "$T"
manifest:
skills/daanaagua/scanlume-ocr-api/SKILL.mdsource content
Scanlume OCR API
Use this skill when the task is specifically about calling the public OCR API behind https://www.scanlume.com/, not when the user only wants the website UI.
Use este skill quando a tarefa for especificamente chamar a API publica de OCR do https://www.scanlume.com/, e nao quando o usuario so quiser usar a interface do site.
English
Workflow
- Confirm the input is an image, not a PDF.
- Read
before building the request.references/api-contract.md - Choose
only for raw text speed and lower cost.simple - Choose
for headings, multi-block layouts, Markdown, HTML, and tables.formatted - If the user gives a local file path, prefer
to build the data URL and call the API.scripts/scanlume_ocr.py - Read
before consumingreferences/output-shapes.md
responses, especially table blocks.formatted - State clearly when a request is blocked by public API limits, such as PDF OCR beta access.
Quick Rules
- Public image OCR endpoint:
POST /v1/api/ocr - Auth:
Authorization: Bearer <SCANLUME_API_KEY> - Content type:
application/json - Payload keys:
andmodebase64
must be a full data URL such asbase64data:image/png;base64,...- Do not claim multipart upload support
- Do not claim remote file URL support
- Do not claim public PDF OCR API availability
Mode Selection
-
Use
for:simple- quick raw text extraction
- lower cost image OCR
- tasks that only need plain text
-
Use
for:formatted- screenshots with multiple text blocks
- image-based tables
- output needed in Markdown or HTML
- tasks that benefit from
orblockstableSummary
Helpers
- Read
before first use.references/api-contract.md - Read
before parsing formatted OCR results.references/output-shapes.md - Use
for a local table image.python scripts/scanlume_ocr.py <path> --mode formatted --output md - Use
for plain text extraction.python scripts/scanlume_ocr.py <path> --mode simple --output txt
Constraints
- The public v1 API currently covers image OCR only.
- The website supports PDF OCR, but the public PDF API route is still beta-gated.
costs 1 credit per image.simple
costs 2 credits per image.formatted- Favor precise claims over marketing claims. If the API cannot do something publicly today, say so.
Portugues (Brasil)
Fluxo
- Confirme que a entrada e uma imagem, nao um PDF.
- Leia
antes de montar a requisicao.references/api-contract.md - Escolha
apenas quando o foco for texto bruto, velocidade e menor custo.simple - Escolha
para titulos, multiplos blocos, Markdown, HTML e tabelas.formatted - Se o usuario fornecer um caminho local, prefira
para gerar a data URL e chamar a API.scripts/scanlume_ocr.py - Leia
antes de consumir respostasreferences/output-shapes.md
, principalmente em blocos de tabela.formatted - Explique claramente quando uma requisicao estiver bloqueada por limites publicos da API, como o acesso beta ao OCR de PDF.
Regras Rapidas
- Endpoint publico de OCR de imagem:
POST /v1/api/ocr - Auth:
Authorization: Bearer <SCANLUME_API_KEY> - Tipo de conteudo:
application/json - Chaves do payload:
emodebase64
precisa ser uma data URL completa comobase64data:image/png;base64,...- Nao afirme suporte a multipart upload
- Nao afirme suporte a URL remota de arquivo
- Nao afirme disponibilidade publica da API de PDF
Escolha de Modo
-
Use
para:simple- extracao rapida de texto bruto
- OCR de imagem com menor custo
- tarefas que so precisam de texto puro
-
Use
para:formatted- screenshots com multiplos blocos de texto
- tabelas em imagem
- saida em Markdown ou HTML
- tarefas que se beneficiam de
oublockstableSummary
Helpers
- Leia
antes do primeiro uso.references/api-contract.md - Leia
antes de processar respostas formatadas.references/output-shapes.md - Use
para uma imagem local com tabela.python scripts/scanlume_ocr.py <path> --mode formatted --output md - Use
para extracao simples de texto.python scripts/scanlume_ocr.py <path> --mode simple --output txt
Restricoes
- A API publica v1 atualmente cobre apenas OCR de imagem.
- O site https://www.scanlume.com/ suporta OCR de PDF na interface web, mas a rota publica de PDF continua beta-gated.
custa 1 credito por imagem.simple
custa 2 creditos por imagem.formatted- Prefira afirmacoes precisas a afirmacoes promocionais. Se a API publica ainda nao faz algo hoje, diga isso.