LLMs-Universal-Life-Science-and-Clinical-Skills- genomics-assembly
install
source · Clone the upstream repo
git clone https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills-
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills- "$T" && mkdir -p ~/.claude/skills && cp -r "$T/Skills/Genomics/genomics-assembly" ~/.claude/skills/mdbabumiamssm-llms-universal-life-science-and-clinical-skills-genomics-assembly && rm -rf "$T"
manifest:
Skills/Genomics/genomics-assembly/SKILL.mdsource content
🧬 Genome Assembly
De novo genome assembly for short and long reads. Wraps SPAdes, Megahit, Flye, and Canu.
CLI Reference
python omicsclaw.py run genomics-assembly --demo python omicsclaw.py run genomics-assembly --input <reads.fastq> --output <dir>
Why This Exists
- Without it: Assemblies require intense memory management and parameter orchestration per graph build
- With it: Automated contig building and K-mer tuning logic across read modalities
- Why OmicsClaw: Unified containerized or local graph assembler invocation
Workflow
- Calculate: Prepare k-mer frequencies or long-read overlaps.
- Execute: Build de Bruijn or string graphs.
- Assess: Perform contig polishing and scaffolding.
- Generate: Output structural FASTA representations.
- Report: Synthesize N50 stats and completeness metrics.
Example Queries
- "Assemble my isolate using SPAdes"
- "De novo genome assembly using Flye"
Output Structure
output_directory/ ├── report.md ├── result.json ├── assembled.fa ├── figures/ │ └── assembly_graph.png ├── tables/ │ └── quast_metrics.csv └── reproducibility/ ├── commands.sh ├── environment.yml └── checksums.sha256
Safety
- Local-first: Strict offline processing without external upload.
- Disclaimer: Requires OmicsClaw reporting structures and disclaimers.
- Audit trail: Hyperparameters and operational flow states are logged fully.
Integration with Orchestrator
Trigger conditions:
- Automatically invoked dynamically based on tool metadata and user intent matching.
Chaining partners:
— Upstream read trimminggenomics-qc
— Downstream genome annotationannotation