Medical-research-skills literatureimages-interpretation

Interpret figures in academic papers and their captions when the input is a PDF-to-Markdown document with page markers and image links, producing a structured Markdown report for extracting variables, trends, and conclusions.

install
source · Clone the upstream repo
git clone https://github.com/aipoch/medical-research-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/aipoch/medical-research-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/scientific-skills/Other/literatureimages-interpretation" ~/.claude/skills/aipoch-medical-research-skills-literatureimages-interpretation && rm -rf "$T"
manifest: scientific-skills/Other/literatureimages-interpretation/SKILL.md
source content

Source: https://github.com/aipoch/medical-research-skills

When to Use

  • You have a paper converted from PDF to Markdown (including
    ## Page XX
    markers and image links) and need a figure-by-figure interpretation report.
  • You need to extract key variables, trends, comparisons, and conclusions primarily from charts/plots rather than from the main text.
  • You want to align images in a
    *-images/
    folder with figure numbers (e.g., Figure 1A, Figure 2) using captions and in-text citations.
  • You need a standardized, UTF-8 Markdown output suitable for downstream summarization, data extraction, or knowledge base ingestion.
  • You must filter out non-figure images (e.g., scanned body text blocks) and interpret only chart-like content.

Key Features

  • Parses Markdown image links and locates the corresponding
    *-images/
    directory (e.g.,
    RateSOX2.md
    RateSOX2-images/
    ).
  • Opens every image in the images folder without skipping and classifies each as chart / table / schematic / flowchart / body-text block.
  • Builds an internal (non-exported) alignment list to map images to figure numbers using captions and body-text citations.
  • Interprets only chart-type images (and other figure-like visuals when required), excluding body-text blocks.
  • Produces a single structured “Image Interpretation” Markdown report per input file, saved to
    outputs/
    .
  • Enforces evidence-based interpretation: rely only on captions, body text, and visible image content; do not speculate.

Dependencies

  • pdf-extract
    (version: not specified) — used only when the source is PDF and must be converted to Markdown first.
  • Markdown template:
    assets/figure_interpretation_template.md
    (version: not specified)
  • Quality/requirements reference:
    references/guide.md
    (version: not specified)

Example Usage

Input layout

skill/
  literatureimages-interpretation/
    inputs/
      RateSOX2.md
    RateSOX2-images/
      image_001.png
      image_002.png
      ...
    assets/
      figure_interpretation_template.md
    references/
      guide.md
    outputs/

Run (conceptual workflow)

  1. If starting from PDF, convert to Markdown first:

    pdf-extract RateSOX2.pdf > skill/literatureimages-interpretation/inputs/RateSOX2.md
    
  2. Ensure the images folder exists and matches the literature name:

    • inputs/RateSOX2.md
    • RateSOX2-images/
  3. Execute the interpretation process:

    • Read
      inputs/RateSOX2.md
      (captions + in-text citations + image links).
    • Open every image in
      RateSOX2-images/
      sequentially.
    • Classify images; keep only chart/figure-like items for interpretation.
    • Align images to figure numbers (e.g., Figure 1A) when possible; otherwise mark as Unassigned.
    • Fill
      assets/figure_interpretation_template.md
      .
    • Write exactly one UTF-8 Markdown output to:
      • outputs/RateSOX2_figure_interpretation.md
        (example name; keep it concise)

Output (must be a single Markdown file)

  • Location:
    outputs/
  • Content: only the “Image Interpretation” section (do not include the internal image list table)
  • Encoding: UTF-8

Implementation Details

  • Input assumptions

    • Default input is a PDF-to-Markdown file (
      .md
      ) containing:
      • page markers like
        ## Page XX
      • image links
      • captions and surrounding body text
    • If only a PDF is provided, convert it to Markdown using
      pdf-extract
      before interpretation.
  • Image discovery

    • Images are typically stored in a folder named
      *-images/
      matching the literature filename.
    • Use Markdown image links and/or folder naming to locate the correct images.
  • Mandatory full pass over images

    • Open every image in the
      *-images/
      folder without skipping.
    • Classify each image into one of:
      • chart/plot
      • table
      • schematic
      • flowchart
      • body text block (to be excluded from interpretation)
  • Figure attribution (alignment)

    • Use captions and in-text citations to assign figure identifiers (e.g., Figure 2, Fig. 3B).
    • If attribution cannot be determined, label the item as Unassigned.
    • Maintain an internal alignment list for processing only; do not generate or export any image list file.
  • Interpretation scope and constraints

    • Interpret only chart-type (and other figure-like) images that require analysis; exclude body-text blocks.
    • Interpretations must be grounded in:
      • visible content in the image (axes, legends, labels, values, trends)
      • caption text
      • relevant body text citations
    • Do not infer beyond the evidence; if information is missing, write “Not specified”.
  • Output rules

    • Use
      assets/figure_interpretation_template.md
      as the structure.
    • Output exactly one Markdown file per input document.
    • Save to
      outputs/
      with a concise filename (avoid redundant phrases).
    • Do not include the internal image list table; output only the final “Image Interpretation” content.
    • Ensure UTF-8 encoding to prevent character corruption.
  • Quality checks

    • Follow detailed requirements and checkpoints in
      references/guide.md
      .