Medical-research-skills literature-experiment-extract
Extract experimental models, experimental methods, and biomarker information from paper Markdown (typically produced by PDF-to-Markdown tools) when a user provides paper Markdown and needs a structured, evidence-backed summary (1 Markdown + 3 CSVs).
install
source · Clone the upstream repo
git clone https://github.com/aipoch/medical-research-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/aipoch/medical-research-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/scientific-skills/Evidence Insight/literature-experiment-extract" ~/.claude/skills/aipoch-medical-research-skills-literature-experiment-extract && rm -rf "$T"
manifest:
scientific-skills/Evidence Insight/literature-experiment-extract/SKILL.mdsource content
When to Use
- You have a paper converted to Markdown (e.g., via PDF-to-Markdown) and need to extract cell/animal models used in experiments.
- You need a structured list of experimental methods/protocols described in the paper, with traceable evidence.
- You want to compile biomarkers / detection indicators (e.g., genes, proteins, assays, readouts) reported in the study.
- You need standardized outputs for downstream analysis: one Markdown summary plus three CSV tables.
- The paper Markdown includes page markers (e.g.,
) and you want evidence organized by page.## Page XX
Key Features
- Extracts three entity groups from paper Markdown:
- Experimental models (cell lines, animal models, strains, genotypes, etc.)
- Experimental methods (assays, protocols, instruments, conditions)
- Biomarkers / indicators (targets, readouts, measured variables)
- Produces evidence-backed results (citations/excerpts preserved and traceable to the source).
- Supports page-aware evidence organization when the input includes pagination headers like
.## Page XX - Outputs are fixed and standardized:
- 1 Markdown summary
- 3 CSV files: models / methods / biomarkers
- Uses a predefined template and extraction rules:
- Requirements and consistency rules:
references/guide.md - Output template:
assets/template.md
- Requirements and consistency rules:
Dependencies
- None (documentation-driven workflow).
- Input assumption: paper content is available as Markdown, typically generated by a PDF-to-Markdown tool.
Example Usage
Input
A paper converted to Markdown, ideally with page headers:
## Page 1 ... text describing "C57BL/6 mice" and "Western blot" ... ## Page 2 ... text describing "ELISA" and "IL-6 levels" ...
Steps
- Open the paper Markdown (typically produced by PDF-to-Markdown tools).
- Extract models, methods, and biomarkers page by page.
- Follow:
- Extraction rules and evidence requirements:
references/guide.md - Output template:
assets/template.md
- Extraction rules and evidence requirements:
- Output exactly:
outputs/{Paper Abbreviation}-experiment-summary.mdoutputs/{Paper Abbreviation}-models.csvoutputs/{Paper Abbreviation}-methods.csvoutputs/{Paper Abbreviation}-biomarkers.csv
Output (required)
- All final outputs must be UTF-8 encoded.
- Output must be produced directly (no confirmation steps or optional branches).
- Evidence excerpts must remain in the original language of the source literature.
Implementation Details
-
Input parsing
- Read the paper Markdown as the sole input source.
- If pagination headers like
exist, prioritize attaching evidence to the corresponding page.## Page XX
-
Extraction rules
- Apply entity definitions, allowed/expected fields, normalization rules, and evidence formatting as specified in
.references/guide.md
- Apply entity definitions, allowed/expected fields, normalization rules, and evidence formatting as specified in
-
Output formatting
- Generate outputs using
as the canonical structure.assets/template.md - Add rows as needed while preserving evidence citations/excerpts.
- The output set is fixed: 1 Markdown summary + 3 CSVs (models/methods/biomarkers).
- Generate outputs using
-
Paths and naming
- Default output directory:
outputs/ - Naming:
- Markdown:
outputs/{Paper Abbreviation}-experiment-summary.md - CSVs:
outputs/{Paper Abbreviation}-models.csvoutputs/{Paper Abbreviation}-methods.csvoutputs/{Paper Abbreviation}-biomarkers.csv
- Markdown:
- Default output directory:
-
Language
- Output language should be Chinese by default (or the user-requested language if specified).
- Evidence excerpts must remain in the original language of the source text.