Medical-research-skills literature-experiment-extract

Extract experimental models, experimental methods, and biomarker information from paper Markdown (typically produced by PDF-to-Markdown tools) when a user provides paper Markdown and needs a structured, evidence-backed summary (1 Markdown + 3 CSVs).

install

source · Clone the upstream repo

git clone https://github.com/aipoch/medical-research-skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/aipoch/medical-research-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/scientific-skills/Evidence Insight/literature-experiment-extract" ~/.claude/skills/aipoch-medical-research-skills-literature-experiment-extract && rm -rf "$T"

manifest: scientific-skills/Evidence Insight/literature-experiment-extract/SKILL.md

source content

Source: https://github.com/aipoch/medical-research-skills

When to Use

You have a paper converted to Markdown (e.g., via PDF-to-Markdown) and need to extract cell/animal models used in experiments.
You need a structured list of experimental methods/protocols described in the paper, with traceable evidence.
You want to compile biomarkers / detection indicators (e.g., genes, proteins, assays, readouts) reported in the study.
You need standardized outputs for downstream analysis: one Markdown summary plus three CSV tables.
The paper Markdown includes page markers (e.g.,
```
## Page XX
```
) and you want evidence organized by page.

Key Features

Extracts three entity groups from paper Markdown:
- Experimental models (cell lines, animal models, strains, genotypes, etc.)
- Experimental methods (assays, protocols, instruments, conditions)
- Biomarkers / indicators (targets, readouts, measured variables)
Produces evidence-backed results (citations/excerpts preserved and traceable to the source).
Supports page-aware evidence organization when the input includes pagination headers like
```
## Page XX
```
.
Outputs are fixed and standardized:
- 1 Markdown summary
- 3 CSV files: models / methods / biomarkers
Uses a predefined template and extraction rules:
- Requirements and consistency rules:
```
references/guide.md
```
- Output template:
```
assets/template.md
```

Dependencies

None (documentation-driven workflow).
Input assumption: paper content is available as Markdown, typically generated by a PDF-to-Markdown tool.

Example Usage

Input

A paper converted to Markdown, ideally with page headers:

## Page 1
... text describing "C57BL/6 mice" and "Western blot" ...

## Page 2
... text describing "ELISA" and "IL-6 levels" ...

Steps

Open the paper Markdown (typically produced by PDF-to-Markdown tools).
Extract models, methods, and biomarkers page by page.
Follow:
- Extraction rules and evidence requirements:
```
references/guide.md
```
- Output template:
```
assets/template.md
```

Output exactly:

outputs/{Paper Abbreviation}-experiment-summary.md

```
outputs/{Paper Abbreviation}-models.csv
```

outputs/{Paper Abbreviation}-methods.csv

outputs/{Paper Abbreviation}-biomarkers.csv

Output (required)

All final outputs must be UTF-8 encoded.
Output must be produced directly (no confirmation steps or optional branches).
Evidence excerpts must remain in the original language of the source literature.

Implementation Details

Input parsing
- Read the paper Markdown as the sole input source.
- If pagination headers like
```
## Page XX
```
  exist, prioritize attaching evidence to the corresponding page.
Extraction rules
- Apply entity definitions, allowed/expected fields, normalization rules, and evidence formatting as specified in
```
references/guide.md
```
  .
Output formatting
- Generate outputs using
```
assets/template.md
```
  as the canonical structure.
- Add rows as needed while preserving evidence citations/excerpts.
- The output set is fixed: 1 Markdown summary + 3 CSVs (models/methods/biomarkers).

Paths and naming

Default output directory:
```
outputs/
```

Naming:

Markdown:

outputs/{Paper Abbreviation}-experiment-summary.md

CSVs:

```
outputs/{Paper Abbreviation}-models.csv
```

outputs/{Paper Abbreviation}-methods.csv

outputs/{Paper Abbreviation}-biomarkers.csv

Language
- Output language should be Chinese by default (or the user-requested language if specified).
- Evidence excerpts must remain in the original language of the source text.