Medical-research-skills literatureimages-interpretation
Interpret figures in academic papers and their captions when the input is a PDF-to-Markdown document with page markers and image links, producing a structured Markdown report for extracting variables, trends, and conclusions.
install
source · Clone the upstream repo
git clone https://github.com/aipoch/medical-research-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/aipoch/medical-research-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/scientific-skills/Other/literatureimages-interpretation" ~/.claude/skills/aipoch-medical-research-skills-literatureimages-interpretation && rm -rf "$T"
manifest:
scientific-skills/Other/literatureimages-interpretation/SKILL.mdsource content
When to Use
- You have a paper converted from PDF to Markdown (including
markers and image links) and need a figure-by-figure interpretation report.## Page XX - You need to extract key variables, trends, comparisons, and conclusions primarily from charts/plots rather than from the main text.
- You want to align images in a
folder with figure numbers (e.g., Figure 1A, Figure 2) using captions and in-text citations.*-images/ - You need a standardized, UTF-8 Markdown output suitable for downstream summarization, data extraction, or knowledge base ingestion.
- You must filter out non-figure images (e.g., scanned body text blocks) and interpret only chart-like content.
Key Features
- Parses Markdown image links and locates the corresponding
directory (e.g.,*-images/
→RateSOX2.md
).RateSOX2-images/ - Opens every image in the images folder without skipping and classifies each as chart / table / schematic / flowchart / body-text block.
- Builds an internal (non-exported) alignment list to map images to figure numbers using captions and body-text citations.
- Interprets only chart-type images (and other figure-like visuals when required), excluding body-text blocks.
- Produces a single structured “Image Interpretation” Markdown report per input file, saved to
.outputs/ - Enforces evidence-based interpretation: rely only on captions, body text, and visible image content; do not speculate.
Dependencies
(version: not specified) — used only when the source is PDF and must be converted to Markdown first.pdf-extract- Markdown template:
(version: not specified)assets/figure_interpretation_template.md - Quality/requirements reference:
(version: not specified)references/guide.md
Example Usage
Input layout
skill/ literatureimages-interpretation/ inputs/ RateSOX2.md RateSOX2-images/ image_001.png image_002.png ... assets/ figure_interpretation_template.md references/ guide.md outputs/
Run (conceptual workflow)
-
If starting from PDF, convert to Markdown first:
pdf-extract RateSOX2.pdf > skill/literatureimages-interpretation/inputs/RateSOX2.md -
Ensure the images folder exists and matches the literature name:
inputs/RateSOX2.mdRateSOX2-images/
-
Execute the interpretation process:
- Read
(captions + in-text citations + image links).inputs/RateSOX2.md - Open every image in
sequentially.RateSOX2-images/ - Classify images; keep only chart/figure-like items for interpretation.
- Align images to figure numbers (e.g., Figure 1A) when possible; otherwise mark as Unassigned.
- Fill
.assets/figure_interpretation_template.md - Write exactly one UTF-8 Markdown output to:
(example name; keep it concise)outputs/RateSOX2_figure_interpretation.md
- Read
Output (must be a single Markdown file)
- Location:
outputs/ - Content: only the “Image Interpretation” section (do not include the internal image list table)
- Encoding: UTF-8
Implementation Details
-
Input assumptions
- Default input is a PDF-to-Markdown file (
) containing:.md- page markers like
## Page XX - image links
- captions and surrounding body text
- page markers like
- If only a PDF is provided, convert it to Markdown using
before interpretation.pdf-extract
- Default input is a PDF-to-Markdown file (
-
Image discovery
- Images are typically stored in a folder named
matching the literature filename.*-images/ - Use Markdown image links and/or folder naming to locate the correct images.
- Images are typically stored in a folder named
-
Mandatory full pass over images
- Open every image in the
folder without skipping.*-images/ - Classify each image into one of:
- chart/plot
- table
- schematic
- flowchart
- body text block (to be excluded from interpretation)
- Open every image in the
-
Figure attribution (alignment)
- Use captions and in-text citations to assign figure identifiers (e.g., Figure 2, Fig. 3B).
- If attribution cannot be determined, label the item as Unassigned.
- Maintain an internal alignment list for processing only; do not generate or export any image list file.
-
Interpretation scope and constraints
- Interpret only chart-type (and other figure-like) images that require analysis; exclude body-text blocks.
- Interpretations must be grounded in:
- visible content in the image (axes, legends, labels, values, trends)
- caption text
- relevant body text citations
- Do not infer beyond the evidence; if information is missing, write “Not specified”.
-
Output rules
- Use
as the structure.assets/figure_interpretation_template.md - Output exactly one Markdown file per input document.
- Save to
with a concise filename (avoid redundant phrases).outputs/ - Do not include the internal image list table; output only the final “Image Interpretation” content.
- Ensure UTF-8 encoding to prevent character corruption.
- Use
-
Quality checks
- Follow detailed requirements and checkpoints in
.references/guide.md
- Follow detailed requirements and checkpoints in