Medical-research-skills literatureimages-interpretation

Interpret figures in academic papers and their captions when the input is a PDF-to-Markdown document with page markers and image links, producing a structured Markdown report for extracting variables, trends, and conclusions.

install

source · Clone the upstream repo

git clone https://github.com/aipoch/medical-research-skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/aipoch/medical-research-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/scientific-skills/Other/literatureimages-interpretation" ~/.claude/skills/aipoch-medical-research-skills-literatureimages-interpretation && rm -rf "$T"

manifest: scientific-skills/Other/literatureimages-interpretation/SKILL.md

source content

Source: https://github.com/aipoch/medical-research-skills

When to Use

You have a paper converted from PDF to Markdown (including
```
## Page XX
```
markers and image links) and need a figure-by-figure interpretation report.
You need to extract key variables, trends, comparisons, and conclusions primarily from charts/plots rather than from the main text.
You want to align images in a
```
*-images/
```
folder with figure numbers (e.g., Figure 1A, Figure 2) using captions and in-text citations.
You need a standardized, UTF-8 Markdown output suitable for downstream summarization, data extraction, or knowledge base ingestion.
You must filter out non-figure images (e.g., scanned body text blocks) and interpret only chart-like content.

Key Features

Parses Markdown image links and locates the corresponding
```
*-images/
```
directory (e.g.,
```
RateSOX2.md
```
→
```
RateSOX2-images/
```
).
Opens every image in the images folder without skipping and classifies each as chart / table / schematic / flowchart / body-text block.
Builds an internal (non-exported) alignment list to map images to figure numbers using captions and body-text citations.
Interprets only chart-type images (and other figure-like visuals when required), excluding body-text blocks.
Produces a single structured “Image Interpretation” Markdown report per input file, saved to
```
outputs/
```
.
Enforces evidence-based interpretation: rely only on captions, body text, and visible image content; do not speculate.

Dependencies

```
pdf-extract
```
(version: not specified) — used only when the source is PDF and must be converted to Markdown first.
Markdown template:
```
assets/figure_interpretation_template.md
```
(version: not specified)
Quality/requirements reference:
```
references/guide.md
```
(version: not specified)

Example Usage

Input layout

skill/
  literatureimages-interpretation/
    inputs/
      RateSOX2.md
    RateSOX2-images/
      image_001.png
      image_002.png
      ...
    assets/
      figure_interpretation_template.md
    references/
      guide.md
    outputs/

Run (conceptual workflow)

If starting from PDF, convert to Markdown first:

pdf-extract RateSOX2.pdf > skill/literatureimages-interpretation/inputs/RateSOX2.md

Ensure the images folder exists and matches the literature name:
- ```
inputs/RateSOX2.md
```
- ```
RateSOX2-images/
```
Execute the interpretation process:
- Read
```
inputs/RateSOX2.md
```
  (captions + in-text citations + image links).
- Open every image in
```
RateSOX2-images/
```
  sequentially.
- Classify images; keep only chart/figure-like items for interpretation.
- Align images to figure numbers (e.g., Figure 1A) when possible; otherwise mark as Unassigned.
- Fill
```
assets/figure_interpretation_template.md
```
  .
- Write exactly one UTF-8 Markdown output to:
  - ```
  outputs/RateSOX2_figure_interpretation.md
```
  (example name; keep it concise)

Output (must be a single Markdown file)

Location:
```
outputs/
```
Content: only the “Image Interpretation” section (do not include the internal image list table)
Encoding: UTF-8

Implementation Details

Input assumptions
- Default input is a PDF-to-Markdown file (
```
.md
```
  ) containing:
  - page markers like
```
## Page XX
```
  - image links
  - captions and surrounding body text
- If only a PDF is provided, convert it to Markdown using
```
pdf-extract
```
  before interpretation.
Image discovery
- Images are typically stored in a folder named
```
*-images/
```
  matching the literature filename.
- Use Markdown image links and/or folder naming to locate the correct images.
Mandatory full pass over images
- Open every image in the
```
*-images/
```
  folder without skipping.
- Classify each image into one of:
  - chart/plot
  - table
  - schematic
  - flowchart
  - body text block (to be excluded from interpretation)
Figure attribution (alignment)
- Use captions and in-text citations to assign figure identifiers (e.g., Figure 2, Fig. 3B).
- If attribution cannot be determined, label the item as Unassigned.
- Maintain an internal alignment list for processing only; do not generate or export any image list file.
Interpretation scope and constraints
- Interpret only chart-type (and other figure-like) images that require analysis; exclude body-text blocks.
- Interpretations must be grounded in:
  - visible content in the image (axes, legends, labels, values, trends)
  - caption text
  - relevant body text citations
- Do not infer beyond the evidence; if information is missing, write “Not specified”.
Output rules
- Use
```
assets/figure_interpretation_template.md
```
  as the structure.
- Output exactly one Markdown file per input document.
- Save to
```
outputs/
```
  with a concise filename (avoid redundant phrases).
- Do not include the internal image list table; output only the final “Image Interpretation” content.
- Ensure UTF-8 encoding to prevent character corruption.
Quality checks
- Follow detailed requirements and checkpoints in
```
references/guide.md
```
  .