ClawBio wes-clinical-report-en
git clone https://github.com/ClawBio/ClawBio
T=$(mktemp -d) && git clone --depth=1 https://github.com/ClawBio/ClawBio "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/wes-clinical-report-en" ~/.claude/skills/clawbio-clawbio-wes-clinical-report-en && rm -rf "$T"
skills/wes-clinical-report-en/SKILL.mdWES Clinical Report (English)
Skill for generating professional clinical PDF reports in English from whole exome sequencing (WES) data. Designed for Novogene WES data (GATK + ANNOVAR pipeline) but adaptable to any WES pipeline with equivalent annotations.
Trigger
Fire this skill when the user says any of:
- "generate WES clinical report in English"
- "English exome PDF report"
- "WES report PDF"
- "clinical report from exome data"
- "Novogene report English"
- "exome clinical PDF"
Do NOT fire when:
- User asks for a Spanish report (use
)wes-clinical-report-es - User asks for variant annotation only (use
)variant-annotation - User asks for ACMG classification only (use
)clinical-variant-reporter
Scope
One skill, one task: convert WES markdown reports into professional English-language clinical PDFs with interpretation.
Workflow
- Parse WES markdown report (structured sections 1-7)
- Extract KPI metrics from Exome Summary
- Extract pathogenic variants, PGx alerts, rare damaging variants
- Build interpretive summary paragraph
- Render all sections as styled PDF with clinical tables
- Add ancestry estimation (section 8) if data available
- Add limitations section (section 9)
- Add disclaimer and report metadata
- Output PDF to specified directory
Capabilities
- Clinical interpretation summary: key findings, high-risk PGx alerts, prioritised rare variants, clinical follow-up recommendations.
- Clinically significant variants: ClinVar P/LP, ACMG SF v3.2, cancer predisposition panel, conflicting variants.
- Pharmacogenomics: CPIC star alleles, clinical effects, affected medications with contextualised high-risk alerts.
- Fitness and nutrition traits: genotypes with evidence grades (Corpas et al. 2021).
- Rare damaging variant prioritisation: REVEL, CADD, gnomAD AF.
- Disease and pathway context: OMIM, GWAS, COSMIC, KEGG.
- Institutional logos: configurable left/right logos on cover and header.
Example Output
Page 1 (cover): [Logo Left] [Logo Right] +---------------------------------------------+ | Whole Exome Sequencing Report [SampleN] | | Platform / Reference / Date | +---------------------------------------------+ [KPIs: Total SNPs | Missense | Stopgain | Rare Damaging | ClinVar] Results Interpretation (auto-generated clinical summary paragraph) Pages 2+: 1. Exome Summary 2. Clinically Significant Variants 3. Pharmacogenomics 4. Fitness and Nutrition Traits 5. Prioritised Rare Damaging Variants 6. Disease and Pathway Context 7. Methods 8. Ancestry Estimation 9. Limitations [Disclaimer]
Usage
# Generate reports for all samples python skills/wes-clinical-report-en/wes_clinical_report_en.py \ --report-dir /path/to/REPORTS/ \ --output-dir /path/to/PDF-EN/ \ --logo-left /path/to/logo_left.jpg \ --logo-right /path/to/logo_right.jpg # Generate report for a single sample python skills/wes-clinical-report-en/wes_clinical_report_en.py \ --report-dir /path/to/REPORTS/ \ --output-dir /path/to/PDF-EN/ \ --samples Sample3 # Demo with default Novogene data python skills/wes-clinical-report-en/wes_clinical_report_en.py --demo
Input format
The skill consumes WES reports in markdown format generated by the analysis pipeline (scripts 02-12 in
ANALYSIS/SCRIPTS/). Each markdown
report must follow this structure:
# Whole Exome Sequencing Report: SampleN > **Project** ... | **Platform** ... | ... ## 1. Exome Summary ## 2. Clinically Significant Variants ## 3. Pharmacogenomics ## 4. Fitness and Nutrition Traits ## 5. Prioritised Rare Damaging Variants ## 6. Disease and Pathway Context ## 7. Methods
Gotchas
- Logo paths must exist: if logo files are missing, the report still generates but without institutional branding. The script silently skips missing logos.
- Table truncation: tables with more than 20 rows are truncated in the PDF with a note to consult TSV files. Do not assume all data is visible in the PDF.
- Ancestry data is optional: section 8 requires
in the ancestry output directory. If absent, the section shows "No ancestry data available."ancestry_results.json - ClinVar classifications are time-sensitive: the report reflects ClinVar state at annotation time. Do not treat classifications as permanent.
- PGx star alleles from SNVs only: CYP2D6 CNV analysis is not included. Do not claim complete metaboliser phenotyping.
Safety
ClawBio is a research and educational tool. It is not a medical device and does not provide clinical diagnoses. Consult a healthcare professional before making any medical decisions.
Agent Boundary
The agent dispatches and explains; the skill executes. The agent should not modify PDF generation logic inline. All report customisation goes through CLI flags.
Chaining Partners
: upstream VCF annotation feeding markdown reportsvariant-annotation
: ACMG classification for deeper analysisclinical-variant-reporter
: Spanish language version of the same reportwes-clinical-report-es
Maintenance
- Review cadence: quarterly (aligned with ClinVar release cycle)
- Staleness signals: ClinVar version drift, CPIC guideline updates
- Deprecation: if WES is superseded by WGS-only clinical pipelines
Requirements
- Python 3.9+
- reportlab >= 4.0
- WES markdown reports (see input format above)
- Institutional logos in JPG/PNG (optional)
Privacy
This skill is private and not included in the ClawBio public catalog. It contains institutional report templates that should not be distributed publicly.
References
- Corpas et al. (2021) "Whole Genome Interpretation for a Family of Five" Frontiers in Genetics 12:535123
- CPIC guidelines for pharmacogenomics
- ClinVar / gnomAD / OMIM / COSMIC / KEGG for variant annotation